A review of convolutional neural networks in computer vision

被引:219
作者
Zhao, Xia [1 ]
Wang, Limin [1 ]
Zhang, Yufei [2 ]
Han, Xuming [3 ]
Deveci, Muhammet [4 ,5 ,6 ]
Parmar, Milan [7 ]
机构
[1] Guangdong Univ Finance & Econ, Sch Informat Sci, Guangzhou 510320, Peoples R China
[2] Changchun Univ Sci & Technol, Sch Comp Sci & Technol, Changchun 130022, Peoples R China
[3] Jinan Univ, Sch Informat Sci & Technol, Guangzhou 510632, Peoples R China
[4] Natl Def Univ, Turkish Naval Acad, Dept Ind Engn, TR-34942 Istanbul, Turkiye
[5] UCL, Bartlett Sch Sustainable Construction, 1-19 Torrington Pl, London WC1E 7HB, England
[6] Lebanese Amer Univ, Dept Elect & Comp Engn, Byblos, Lebanon
[7] Mississippi State Univ, Dept Comp Sci & Engn, Starkville, MS 39762 USA
关键词
Convolutional neural networks; Computer vision; Status quo review; Deep learning; MODELS;
D O I
10.1007/s10462-024-10721-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In computer vision, a series of exemplary advances have been made in several areas involving image classification, semantic segmentation, object detection, and image super-resolution reconstruction with the rapid development of deep convolutional neural network (CNN). The CNN has superior features for autonomous learning and expression, and feature extraction from original input data can be realized by means of training CNN models that match practical applications. Due to the rapid progress in deep learning technology, the structure of CNN is becoming more and more complex and diverse. Consequently, it gradually replaces the traditional machine learning methods. This paper presents an elementary understanding of CNN components and their functions, including input layers, convolution layers, pooling layers, activation functions, batch normalization, dropout, fully connected layers, and output layers. On this basis, this paper gives a comprehensive overview of the past and current research status of the applications of CNN models in computer vision fields, e.g., image classification, object detection, and video prediction. In addition, we summarize the challenges and solutions of the deep CNN, and future research directions are also discussed.
引用
收藏
页数:43
相关论文
共 100 条
[31]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[32]  
Guo G., 2023, Vis. Intell., V1, P6
[33]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[34]   Overview of behavior recognition based on deep learning [J].
Hu, Kai ;
Jin, Junlan ;
Zheng, Fei ;
Weng, Liguo ;
Ding, Yiwu .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (03) :1833-1865
[35]   Abnormal Event Detection Using Deep Contrastive Learning for Intelligent Video Surveillance System [J].
Huang, Chao ;
Wu, Zhihao ;
Wen, Jie ;
Xu, Yong ;
Jiang, Qiuping ;
Wang, Yaowei .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2022, 18 (08) :5171-5179
[36]   Normalization Techniques in Training DNNs: Methodology, Analysis and Application [J].
Huang, Lei ;
Qin, Jie ;
Zhou, Yi ;
Zhu, Fan ;
Liu, Li ;
Shao, Ling .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) :10173-10196
[37]   RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE OF MONKEY STRIATE CORTEX [J].
HUBEL, DH ;
WIESEL, TN .
JOURNAL OF PHYSIOLOGY-LONDON, 1968, 195 (01) :215-&
[38]  
Ioffe S., 2015, International Conference on Machine Learning (PMLR), P448
[39]   Development of a Multilayer Perceptron Neural Network for Optimal Predictive Modeling in Urban Microcellular Radio Environments [J].
Isabona, Joseph ;
Imoize, Agbotiname Lucky ;
Ojo, Stephen ;
Karunwi, Olukayode ;
Kim, Yongsung ;
Lee, Cheng-Chi ;
Li, Chun-Ta .
APPLIED SCIENCES-BASEL, 2022, 12 (11)
[40]   Filtered selective search and evenly distributed convolutional neural networks for casting defects recognition [J].
Ji, Xiaoyuan ;
Yan, Qiuyu ;
Huang, Dong ;
Wu, Bo ;
Xu, Xiaojing ;
Zhang, Aibin ;
Liao, Guanglan ;
Zhou, Jianxin ;
Wu, Menghuai .
JOURNAL OF MATERIALS PROCESSING TECHNOLOGY, 2021, 292 (292)