Embedding channel pruning within the CNN architecture design using a bi-level evolutionary approach

被引:5
作者
Louati, Hassen [1 ]
Louati, Ali [2 ]
Bechikh, Slim [1 ]
Kariri, Elham [2 ]
机构
[1] Univ Tunis, SMART Lab, ISG, Tunis, Tunisia
[2] Prince Sattam bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Informat Syst, Al Kharj 11942, Saudi Arabia
关键词
DCNN design; Pruning; Evolutionary algorithms; Bi-level optimization; NEURAL-NETWORKS; OPTIMIZATION;
D O I
10.1007/s11227-023-05273-5
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Remarkable advancements have been achieved in machine learning and computer vision through the utilization of deep neural networks. Among the most advantageous of these networks is the convolutional neural network (CNN). It has been used in pattern recognition, medical diagnosis, and signal processing, among other things. Actually, for these networks, the challenge of choosing hyperparameters is of utmost importance. The reason behind this is that as the number of layers rises, the search space grows exponentially. In addition, all known classical and evolutionary pruning algorithms require a trained or built architecture as input. During the design phase, none of them consider the process of pruning. In order to assess the effectiveness and efficiency of any architecture created, pruning of channels must be carried out before transmitting the dataset and computing classification errors. For instance, following pruning, an architecture of medium quality in terms of classification may transform into an architecture that is both highly light and accurate, and vice versa. There exist countless potential scenarios that could occur, which prompted us to develop a bi-level optimization approach for the entire process. The upper level involves generating the architecture while the lower level optimizes channel pruning. Evolutionary algorithms (EAs) have proven effective in bi-level optimization, leading us to adopt the co-evolutionary migration-based algorithm as a search engine for our bi-level architectural optimization problem in this research. Our proposed method, CNN-D-P (bi-level CNN design and pruning), was tested on the widely used image classification benchmark datasets, CIFAR-10, CIFAR-100 and ImageNet. Our suggested technique is validated by means of a set of comparison tests with regard to relevant state-of-the-art architectures.
引用
收藏
页码:16118 / 16151
页数:34
相关论文
共 54 条
[1]  
Bengio Yoshua., 2007, ADV NEURAL INFORM PR, V19, P153
[2]   CONVEX COMBINATIONS OF STABLE POLYNOMIALS [J].
BIALAS, S ;
GARLOFF, J .
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1985, 319 (03) :373-377
[3]  
Candler W., 1962, Australian Journal of Agricultural Economics, V6, P36
[4]   An analysis of Gray versus binary encoding in genetic search [J].
Chakraborty, UK ;
Janikow, CZ .
INFORMATION SCIENCES, 2003, 156 (3-4) :253-269
[5]  
Chakraborty UK, 2003, US Patent, Patent No. [156,253-269, 156253269]
[6]  
Chen S., 2019, P 2019 2 INT C ALG C, P189
[7]  
Cheung B., 2011, Proceedings of the 2011 Tenth International Conference on Machine Learning and Applications (ICMLA 2011), P293, DOI 10.1109/ICMLA.2011.73
[8]   A survey of swarm and evolutionary computing approaches for deep learning [J].
Darwish, Ashraf ;
Hassanien, Aboul Ella ;
Das, Swagatam .
ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (03) :1767-1812
[9]   The reusable holdout: Preserving validity in adaptive data analysis [J].
Dwork, Cynthia ;
Feldman, Vitaly ;
Hardt, Moritz ;
Pitassi, Toniann ;
Reingold, Omer ;
Roth, Aaron .
SCIENCE, 2015, 349 (6248) :636-638
[10]   Fundamental Technologies in Modern Speech Recognition [J].
Furui, Sadaoki ;
Deng, Li ;
Gales, Mark ;
Ney, Hermann ;
Tokuda, Keiichi .
IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) :16-17