Deep learning modelling techniques: current progress, applications, advantages, and challenges

被引：308

作者：

Ahmed, Shams Forruque ^{[1
]}

Alam, Md. Sakib Bin ^{[2
]}

Hassan, Maruf ^{[1
]}

Rozbu, Mahtabin Rodela ^{[3
]}

Ishtiak, Taoseef ^{[4
]}

Rafa, Nazifa ^{[5
]}

Mofijur, M. ^{[6
,7
]}

Ali, A. B. M. Shawkat ^{[8
,9
]}

Gandomi, Amir H. ^{[10
,11
]}

机构：

[1] Asian Univ Women, Sci & Math Program, Chattogram 4000, Bangladesh

[2] Asian Inst Technol, Data Sci & Artificial Intelligence, Chang Wat, Pathum Thani 12120, Thailand

[3] Carnegie Mellon Univ, Dept Computat Biol, Pittsburgh, PA 15213 USA

[4] Carleton Univ, Sch Comp Sci, Ottawa, ON K1S 5B6, Canada

[5] Univ Cambridge, Dept Geog, Downing Pl, Cambridge CB2 3EN, England

[6] Univ Technol Sydney, Ctr Technol Water & Wastewater, Sch Civil & Environm Engn, Ultimo, NSW 2007, Australia

[7] Prince Mohammad Bin Fahd Univ, Mech Engn Dept, Al Khobar 31952, Saudi Arabia

[8] Cent Queensland Univ, Sch Engn & Technol, 300, Melbourne, Vic, Australia

[9] Univ Fiji, Sch Sci & Technol, Lautoka, Fiji

[10] Univ Technol Sydney, Fac Engn & Informat Technol, Sydney, NSW 2007, Australia

[11] Obuda Univ, Univ Res & Innovat Ctr EKIK, H-1034 Budapest, Hungary

来源：

ARTIFICIAL INTELLIGENCE REVIEW | 2023年 / 56卷 / 11期

关键词：

Deep learning; Deep learning architecture; Neural network; Boltzmann machine; Deep belief network; Autoencoders; RECURRENT NEURAL-NETWORKS; BOLTZMANN MACHINE; CLASSIFICATION; LSTM; ARCHITECTURES; INTELLIGENCE; COMPUTATION; CAPSNET;

D O I：

10.1007/s10462-023-10466-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep learning (DL) is revolutionizing evidence-based decision-making techniques that can be applied across various sectors. Specifically, it possesses the ability to utilize two or more levels of non-linear feature transformation of the given data via representation learning in order to overcome limitations posed by large datasets. As a multidisciplinary field that is still in its nascent phase, articles that survey DL architectures encompassing the full scope of the field are rather limited. Thus, this paper comprehensively reviews the state-of-art DL modelling techniques and provides insights into their advantages and challenges. It was found that many of the models exhibit a highly domain-specific efficiency and could be trained by two or more methods. However, training DL models can be very time-consuming, expensive, and requires huge samples for better accuracy. Since DL is also susceptible to deception and misclassification and tends to get stuck on local minima, improved optimization of parameters is required to create more robust models. Regardless, DL has already been leading to groundbreaking results in the healthcare, education, security, commercial, industrial, as well as government sectors. Some models, like the convolutional neural network (CNN), generative adversarial networks (GAN), recurrent neural network (RNN), recursive neural networks, and autoencoders, are frequently used, while the potential of other models remains widely unexplored. Pertinently, hybrid conventional DL architectures have the capacity to overcome the challenges experienced by conventional models. Considering that capsule architectures may dominate future DL models, this work aimed to compile information for stakeholders involved in the development and use of DL models in the contemporary world.

引用

页码：13521 / 13617

页数：97

共 301 条

[31] MUTAN: Multimodal Tucker Fusion for Visual Question Answering [J].

Ben-younes, Hedi ;

Cadene, Remi ;

Cord, Matthieu ;

Thome, Nicolas .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2631-2639

[32] LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].

BENGIO, Y ;

SIMARD, P ;

FRASCONI, P .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166

[33] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

[34] Recursive neural network model for analysis and forecast of PM10 and PM2.5 [J].

Biancofiore, Fabio ;

Busilacchio, Marcella ;

Verdecchia, Marco ;

Tomassetti, Barbara ;

Aruffo, Eleonora ;

Bianco, Sebastiano ;

Di Tommaso, Sinibaldo ;

Colangeli, Carlo ;

Rosatelli, Gianluigi ;

Di Carlo, Piero .

ATMOSPHERIC POLLUTION RESEARCH, 2017, 8 (04) :652-659

[35]

Bordes A, 2014, Arxiv, DOI arXiv:1406.3676

[36]

Bousmalis K, 2016, ADV NEUR IN, V29

[37]

Brahma S, 2018, Arxiv, DOI [arXiv:1805.07340, DOI 10.48550/ARXIV.1805.07340]

[38] Authorship verification using deep belief network systems [J].

Brocardo, Marcelo Luiz ;

Traore, Issa ;

Woungang, Isaac ;

Obaidat, Mohammad S. .

INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2017, 30 (12)

[39]

Brock A., 2019, 7 INT C LEARNING REP

[40] Neural Sign Language Translation [J].

Camgoz, Necati Cihan ;

Hadfield, Simon ;

Koller, Oscar ;

Ney, Hermann ;

Bowden, Richard .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7784-7793

← 1 2 3 4 5 6 7 8 9 10 →