A review of convolutional neural network architectures and their optimizations

被引:154
作者
Cong, Shuang [1 ]
Zhou, Yang [1 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230027, Peoples R China
基金
中国国家自然科学基金;
关键词
Machine learning; Convolutional neural network; Network architecture; OBJECT;
D O I
10.1007/s10462-022-10213-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The research advances concerning the typical architectures of convolutional neural networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this paper. This paper proposes a typical approach to classifying CNNs architecture based on modules in order to accommodate more new network architectures with multiple characteristics that make them difficult to rely on the original classification method. Through the pros and cons analysis of diverse network architectures and their performance comparisons, six types of typical CNNs architectures are analyzed and explained in detail. The CNNs architectures intrinsic characteristics is also explored. Moreover, this paper provides a comprehensive classification of network compression and accelerated network architecture optimization algorithms based on the mathematical principle of various optimization algorithms. Finally, this paper analyses the strategy of NAS algorithms, discusses the applications of CNNs, and sheds light on the challenges and prospects of the current CNNs architecture and its optimizations. The explanation of the advantages brought by optimizing different network architecture types, the basis for constructively choosing appropriate CNNs in specific designs and applications are provided. This paper will help the readers to choose constructively appropriate CNNs in specific designs and applications.
引用
收藏
页码:1905 / 1969
页数:65
相关论文
共 234 条
[1]  
Aghli N, 2021, P IEEECVF C COMPUTER, P3191
[2]  
Alzubaidi L., 2018, INT C INT SYST DES A, P550
[3]   Review of deep learning: concepts, CNN architectures, challenges, applications, future directions [J].
Alzubaidi, Laith ;
Zhang, Jinglan ;
Humaidi, Amjad J. ;
Al-Dujaili, Ayad ;
Duan, Ye ;
Al-Shamma, Omran ;
Santamaria, J. ;
Fadhel, Mohammed A. ;
Al-Amidie, Muthana ;
Farhan, Laith .
JOURNAL OF BIG DATA, 2021, 8 (01)
[4]   Optimizing the Performance of Breast Cancer Classification by Employing the Same Domain Transfer Learning from Hybrid Deep Convolutional Neural Network Model [J].
Alzubaidi, Laith ;
Al-Shamma, Omran ;
Fadhel, Mohammed A. ;
Farhan, Laith ;
Zhang, Jinglan ;
Duan, Ye .
ELECTRONICS, 2020, 9 (03)
[5]  
[Anonymous], 2015, ABS151201274 CORR
[6]  
[Anonymous], 2019, IEEE ICC, DOI DOI 10.1109/icc.2019.8761264
[7]  
[Anonymous], 2013, ADV NEURAL INFORM PR
[8]  
[Anonymous], 2016, P 24 ACM INT C MULT, DOI 10.1145/2964284.2967280
[9]  
[Anonymous], 2014, CORR
[10]  
[Anonymous], 2014, C NEUR INF PROC SYST