Review of deep learning: concepts, CNN architectures, challenges, applications, future directions

被引:3087
作者
Alzubaidi, Laith [1 ,5 ]
Zhang, Jinglan [1 ]
Humaidi, Amjad J. [2 ]
Al-Dujaili, Ayad [3 ]
Duan, Ye [4 ]
Al-Shamma, Omran [5 ]
Santamaria, J. [6 ]
Fadhel, Mohammed A. [7 ]
Al-Amidie, Muthana [4 ]
Farhan, Laith [8 ]
机构
[1] Queensland Univ Technol, Sch Comp Sci, Brisbane, Qld 4000, Australia
[2] Univ Technol Baghdad, Control & Syst Engn Dept, Baghdad 10001, Iraq
[3] Middle Tech Univ, Elect Engn Tech Coll, Baghdad 10001, Iraq
[4] Univ Missouri, Fac Elect Engn & Comp Sci, Columbia, MO 65211 USA
[5] Univ Informat Technol & Commun, AlNidhal Campus, Baghdad 10001, Iraq
[6] Univ Jaen, Dept Comp Sci, Jaen 23071, Spain
[7] Univ Sumer, Coll Comp Sci & Informat Technol, Thi Qar 64005, Iraq
[8] Manchester Metropolitan Univ, Sch Engn, Manchester M1 5GD, Lancs, England
关键词
Deep learning; Machine learning; Convolution neural network (CNN); Deep neural network architectures; Deep learning applications; Image classification; Transfer learning; Medical image analysis; Supervised learning; FPGA; GPU; CONVOLUTIONAL NEURAL-NETWORKS; BRAIN-TUMOR SEGMENTATION; X-RAY IMAGES; OF-THE-ART; COMPUTER-AIDED DETECTION; AUTOMATIC SEGMENTATION; CANCER CLASSIFICATION; PNEUMONIA DETECTION; MITOSIS DETECTION; DATA AUGMENTATION;
D O I
10.1186/s40537-021-00444-8
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In the last few years, the deep learning (DL) computing paradigm has been deemed the Gold Standard in the machine learning (ML) community. Moreover, it has gradually become the most widely used computational approach in the field of ML, thus achieving outstanding results on several complex cognitive tasks, matching or even beating those provided by human performance. One of the benefits of DL is the ability to learn massive amounts of data. The DL field has grown fast in the last few years and it has been extensively used to successfully address a wide range of traditional applications. More importantly, DL has outperformed well-known ML techniques in many domains, e.g., cybersecurity, natural language processing, bioinformatics, robotics and control, and medical information processing, among many others. Despite it has been contributed several works reviewing the State-of-the-Art on DL, all of them only tackled one aspect of the DL, which leads to an overall lack of knowledge about it. Therefore, in this contribution, we propose using a more holistic approach in order to provide a more suitable starting point from which to develop a full understanding of DL. Specifically, this review attempts to provide a more comprehensive survey of the most important aspects of DL and including those enhancements recently added to the field. In particular, this paper outlines the importance of DL, presents the types of DL techniques and networks. It then presents convolutional neural networks (CNNs) which the most utilized DL network type and describes the development of CNNs architectures together with their main features, e.g., starting with the AlexNet network and closing with the High-Resolution network (HR.Net). Finally, we further present the challenges and suggested solutions to help researchers understand the existing research gaps. It is followed by a list of the major DL applications. Computational tools including FPGA, GPU, and CPU are summarized along with a description of their influence on DL. The paper ends with the evolution matrix, benchmark datasets, and summary and conclusion.
引用
收藏
页数:74
相关论文
共 336 条
[1]   Two-phase multi-model automatic brain tumour diagnosis system from magnetic resonance images using convolutional neural networks [J].
Abd-Ellah, Mahmoud Khaled ;
Awad, Ali Ismail ;
Khalaf, Ashraf A. M. ;
Hamed, Hesham F. A. .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
[2]   Computer-aided detection of COVID-19 from X-ray images using multi-CNN and Bayesnet classifier [J].
Abraham, Bejoy ;
Nair, Madhu S. .
BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2020, 40 (04) :1436-1445
[3]   Contextual deep learning-based audio-visual switching for speech enhancement in real-world environments [J].
Adeel, Ahsan ;
Gogate, Mandar ;
Hussain, Amir .
INFORMATION FUSION, 2020, 59 :163-170
[4]   Deep learning techniques for skin lesion analysis and melanoma cancer detection: a survey of state-of-the-art [J].
Adegun, Adekanmi ;
Viriri, Serestina .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (02) :811-841
[5]   Deep Learning for Brain MRI Segmentation: State of the Art and Future Directions [J].
Akkus, Zeynettin ;
Galimzianova, Alfiia ;
Hoogi, Assaf ;
Rubin, Daniel L. ;
Erickson, Bradley J. .
JOURNAL OF DIGITAL IMAGING, 2017, 30 (04) :449-459
[6]   DeepCryoPicker: fully automated deep neural network for single protein particle picking in cryo-EM [J].
Al-Azzawi, Adil ;
Ouadou, Anes ;
Max, Highsmith ;
Duan, Ye ;
Tanner, John J. ;
Cheng, Jianlin .
BMC BIOINFORMATICS, 2020, 21 (01)
[7]   Benchmarking HEp-2 specimen cells classification using linear discriminant analysis on higher order spectra features of cell shape [J].
Al-Dulaimi, Khamael ;
Chandran, Vinod ;
Nguyen, Kien ;
Banks, Jasmine ;
Tomeo-Reyes, Inmaculada .
PATTERN RECOGNITION LETTERS, 2019, 125 :534-541
[8]   AggNet: Deep Learning From Crowds for Mitosis Detection in Breast Cancer Histology Images [J].
Albarqouni, Shadi ;
Baur, Christoph ;
Achilles, Felix ;
Belagiannis, Vasileios ;
Demirci, Stefanie ;
Navab, Nassir .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2016, 35 (05) :1313-1321
[9]   A State-of-the-Art Survey on Deep Learning Theory and Architectures [J].
Alom, Md Zahangir ;
Taha, Tarek M. ;
Yakopcic, Chris ;
Westberg, Stefan ;
Sidike, Paheding ;
Nasrin, Mst Shamima ;
Hasan, Mahmudul ;
Van Essen, Brian C. ;
Awwal, Abdul A. S. ;
Asari, Vijayan K. .
ELECTRONICS, 2019, 8 (03)
[10]   Towards a Better Understanding of Transfer Learning for Medical Imaging: A Case Study [J].
Alzubaidi, Laith ;
Fadhel, Mohammed A. ;
Al-Shamma, Omran ;
Zhang, Jinglan ;
Santamaria, J. ;
Duan, Ye ;
Oleiwi, Sameer R. .
APPLIED SCIENCES-BASEL, 2020, 10 (13)