A tutorial survey of architectures, algorithms, and applications for deep learning

被引:348
作者
Deng, Li [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
关键词
Deep learning; Algorithms; Information processing;
D O I
10.1017/atsip.2013.9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this invited paper, my overview material on the same topic as presented in the plenary overview session of APSIPA-2011 and the tutorial material presented in the same conference [1] are expanded and updated to include more recent developments in deep learning. The previous and the updatedmaterials cover both theory and applications, and analyze its future directions. The goal of this tutorial survey is to introduce the emerging area of deep learning or hierarchical learning to the APSIPA community. Deep learning refers to a class of machine learning techniques, developed largely since 2006, where many stages of non-linear information processing in hierarchical architectures are exploited for pattern classification and for feature learning. In the more recent literature, it is also connected to representation learning, which involves a hierarchy of features or concepts where higherlevel concepts are defined from lower-level ones and where the same lower-level concepts help to define higher-level ones. In this tutorial survey, a brief history of deep learning research is discussed first. Then, a classificatory scheme is developed to analyze and summarize major work reported in the recent deep learning literature. Using this scheme, I provide a taxonomy-oriented survey on the existing deep architectures and algorithms in the literature, and categorize them into three classes: generative, discriminative, and hybrid. Three representative deep architectures-deep autoencoders, deep stacking networks with their generalization to the temporal domain (recurrent networks), and deep neural networks (pretrained with deep belief networks) one in each of the three classes, are presented in more detail. Next, selected applications of deep learning are reviewed in broad areas of signal and information processing including audio/ speech, image/vision, multimodality, language modeling, natural language processing, and information retrieval. Finally, future directions of deep learning are discussed and analyzed.
引用
收藏
页数:29
相关论文
共 215 条
[1]  
Abdel-Hamid O., 2012, ICASSP
[2]  
Abdel-Hamid O., 2013, P INTERSPEECH
[3]  
[Anonymous], 2010, ADV NEURAL INFORM PR, DOI DOI 10.5555/2997189.2997242
[4]  
[Anonymous], 2012, NEW YORK TIMES
[5]  
[Anonymous], 2013, RE CURSIVE DEEP MODE
[6]  
[Anonymous], P ICML
[7]  
[Anonymous], 2012, P ACL
[8]   Deep Machine Learning-A New Frontier in Artificial Intelligence Research [J].
Arel, Itamar ;
Rose, Derek C. ;
Karnowski, Thomas P. .
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2010, 5 (04) :13-18
[9]  
As Mikolov T., 2012, THESIS
[10]   Updated MINDS Report on Speech Recognition and Understanding, Part 2 [J].
Baker, Janet M. ;
Deng, Li ;
Khudanpur, Sanjeev ;
Lee, Chin-Hui ;
Glass, James R. ;
Morgan, Nelson ;
O'Shaughnessy, Douglas .
IEEE SIGNAL PROCESSING MAGAZINE, 2009, 26 (04) :78-85