A comprehensive review of recent advances on deep vision systems

被引:64
作者
Abbas, Qaisar [1 ]
Ibrahim, Mostafa E. A. [1 ,2 ]
Jaffar, M. Arfan [1 ]
机构
[1] Al Imam Mohammad Ibn Saud Islamic Univ IMSIU1, Dept Comp Sci, Riyadh 11432, Saudi Arabia
[2] Benha Univ, Benha Fac Engn, Banha, Egypt
关键词
Computer vision; Video processing; Object detection; Object tracking; Object recognition; Deep learning; Convolutional neural network; Deep belief network; Deep residual learning; CONVOLUTIONAL NEURAL-NETWORKS; IMAGE; REPRESENTATION; LOCALIZATION; RECOGNITION; TRACKING;
D O I
10.1007/s10462-018-9633-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-time video objects detection, tracking, and recognition are challenging issues due to the real-time processing requirements of the machine learning algorithms. In recent years, video processing is performed by deep learning (DL) based techniques that achieve higher accuracy but require higher computations cost. This paper presents a recent survey of the state-of-the-art DL platforms and architectures used for deep vision systems. It highlights the contributions and challenges from over numerous research studies. In particular, this paper first describes the architecture of various DL models such as AutoEncoders, deep Boltzmann machines, convolution neural networks, recurrent neural networks and deep residual learning. Next, deep real-time video objects detection, tracking and recognition studies are highlighted to illustrate the key trends in terms of cost of computation, number of layers and the accuracy of results. Finally, the paper discusses the challenges of applying DL for real-time video processing and draw some directions for the future of DL algorithms.
引用
收藏
页码:39 / 76
页数:38
相关论文
共 110 条
[1]  
[Anonymous], DEEP LEARNING VERESU
[2]  
[Anonymous], P 3 INT C LEARNING R
[3]  
[Anonymous], ABS160200577 ARXIV
[4]  
[Anonymous], P INT C LEARN REPR I
[5]  
[Anonymous], PROC CVPR IEEE
[6]  
[Anonymous], 2015, PROC CVPR IEEE
[7]  
[Anonymous], J NEURAL COMPUT APPL
[8]  
[Anonymous], TIP
[9]  
[Anonymous], 2017, INT J PROD RES, DOI DOI 10.1080/00207543.2016.1154209
[10]  
[Anonymous], 2016, COMPUTER VISION METR