Convolutional neural network: a review of models, methodologies and applications to object detection

被引:654
作者
Dhillon, Anamika [1 ]
Verma, Gyanendra K. [1 ]
机构
[1] Natl Inst Technol Kurukshetra, Dept Comp Engn, Kurukshetra 136119, Haryana, India
关键词
Deep learning; CNN architectures; Transfer learning; Object detection; DEEP LEARNING APPROACH; HANDGUN DETECTION; RECOGNITION; CLASSIFICATION; IDENTIFICATION; ARCHITECTURES; AUTOENCODERS; FUSION; HEALTH;
D O I
10.1007/s13748-019-00203-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has developed as an effective machine learning method that takes in numerous layers of features or representation of the data and provides state-of-the-art results. The application of deep learning has shown impressive performance in various application areas, particularly in image classification, segmentation and object detection. Recent advances of deep learning techniques bring encouraging performance to fine-grained image classification which aims to distinguish subordinate-level categories. This task is extremely challenging due to high intra-class and low inter-class variance. In this paper, we provide a detailed review of various deep architectures and model highlighting characteristics of particular model. Firstly, we described the functioning of CNN architectures and its components followed by detailed description of various CNN models starting with classical LeNet model to AlexNet, ZFNet, GoogleNet, VGGNet, ResNet, ResNeXt, SENet, DenseNet, Xception, PNAS/ENAS. We mainly focus on the application of deep learning architectures to three major applications, namely (i) wild animal detection, (ii) small arm detection and (iii) human being detection. A detailed review summary including the systems, database, application and accuracy claimed is also provided for each model to serve as guidelines for future work in the above application areas.
引用
收藏
页码:85 / 112
页数:28
相关论文
共 135 条
[101]   A Survey on Deep Transfer Learning [J].
Tan, Chuanqi ;
Sun, Fuchun ;
Kong, Tao ;
Zhang, Wenchang ;
Yang, Chao ;
Liu, Chunfang .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 :270-279
[102]  
Tan MX, 2019, PR MACH LEARN RES, V97
[103]   The Object Detection Based on Deep Learning [J].
Tang, Cong ;
Feng, Yunsong ;
Yang, Xing ;
Zheng, Chao ;
Zhou, Yuanpu .
2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, :723-728
[104]   Video Object Detection for Tractability with Deep Learning Method [J].
Tian, Bing ;
Li, Liang ;
Qu, Yansheng ;
Yan, Li .
2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD), 2017, :397-401
[105]   Deep Convolutional Neural Networks for pedestrian detection [J].
Tome, D. ;
Monti, F. ;
Baroffio, L. ;
Bondi, L. ;
Tagliasacchi, M. ;
Tubaro, S. .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2016, 47 :482-489
[106]  
Vincent P, 2010, J MACH LEARN RES, V11, P3371
[107]  
Vinyals O, 2015, PROC CVPR IEEE, P3156, DOI 10.1109/CVPR.2015.7298935
[108]   Large-Margin Multi-Modal Deep Learning for RGB-D Object Recognition [J].
Wang, Anran ;
Lu, Jiwen ;
Cai, Jianfei ;
Cham, Tat-Jen ;
Wang, Gang .
IEEE TRANSACTIONS ON MULTIMEDIA, 2015, 17 (11) :1887-1898
[109]  
Wang JG, 2017, TENCON IEEE REGION, P321, DOI 10.1109/TENCON.2017.8227883
[110]  
Wang Xue-jun, 2012, 2012 8th International Conference on Natural Computation, P44, DOI 10.1109/ICNC.2012.6234772