Convolutional Deep Rectifier Neural Nets for Phone Recognition

被引:0
作者
Toth, Laszlo [1 ,2 ]
机构
[1] Hungarian Acad Sci, Res Grp Artificial Intelligence, Budapest, Hungary
[2] Univ Szeged, Szeged, Hungary
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
Deep neural networks; sparse rectifier neural networks; phone recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rectifier neurons differ from standard ones only in that the sigmoid activation function is replaced by the rectifier function, max(0, x). Several recent studies suggest that rectifier units may be more suitable building units for deep nets. For example, we found that with deep rectifier networks one can attain a similar speech recognition performance than that with sigmoid nets, but without the need for the time-consuming pre-training procedure. Here, we extend the previous results by modifying the rectifier network so that it has a convolutional structure. As convolutional networks are inherently deep, rectifier neurons seem to be an ideal choice as their building units. Indeed, on the TIMIT phone recognition task we report a 6% relative error reduction compared to our earlier results, giving an 18.6% error rate on the core test set. Then, with the application of the recently proposed 'dropout' training method we reduce the error rate further to 17.8%, which, to our knowledge, is the best result to date on this database.
引用
收藏
页码:1721 / 1725
页数:5
相关论文
共 50 条
[41]   Recent advances in efficient computation of deep convolutional neural networks [J].
Jian Cheng ;
Pei-song Wang ;
Gang Li ;
Qing-hao Hu ;
Han-qing Lu .
Frontiers of Information Technology & Electronic Engineering, 2018, 19 :64-77
[42]   Learn a Deep Convolutional Neural Network for Image Smoke Detection [J].
Liu, Maoshen ;
Gu, Ke ;
Wu, Li ;
Xu, Xin ;
Qiao, Junfei .
DIGITAL TV AND MULTIMEDIA COMMUNICATION, 2019, 1009 :217-226
[43]   Recent advances in efficient computation of deep convolutional neural networks [J].
Cheng, Jian ;
Wang, Pei-song ;
Li, Gang ;
Hu, Qing-hao ;
Lu, Han-qing .
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2018, 19 (01) :64-77
[44]   A HIERARCHICAL, CONTEXT-DEPENDENT NEURAL NETWORK ARCHITECTURE FOR IMPROVED PHONE RECOGNITION [J].
Toth, Laszlo .
2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, :5040-5043
[45]   A Deep Normalization and Convolutional Neural Network for Image Smoke Detection [J].
Yin, Zhijian ;
Wan, Boyang ;
Yuan, Feiniu ;
Xia, Xue ;
Shi, Jinting .
IEEE ACCESS, 2017, 5 :18429-18438
[46]   Semi-Supervised Convolutional Neural Networks for Human Activity Recognition\ [J].
Zeng, Ming ;
Yu, Tong ;
Wang, Xiao ;
Nguyen, Le T. ;
Mengshoel, Ole J. ;
Lane, Ian .
2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, :522-529
[47]   Age Recognition from Facial Images using Convolutional Neural Networks [J].
Pakulich, D. V. ;
Yakimov, S. A. ;
Alyamkin, S. A. .
OPTOELECTRONICS INSTRUMENTATION AND DATA PROCESSING, 2019, 55 (03) :255-262
[48]   Age Recognition from Facial Images using Convolutional Neural Networks [J].
D. V. Pakulich ;
S. A. Yakimov ;
S. A. Alyamkin .
Optoelectronics, Instrumentation and Data Processing, 2019, 55 :255-262
[49]   DELVING DEEP INTO INTERPRETING NEURAL NETS WITH PIECE-WISE AFFINE REPRESENTATION [J].
Chen, Yifu ;
Saporta, Antoine ;
Dapogny, Arnaud ;
Cord, Matthieu .
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, :609-613
[50]   Unconstrained ear recognition using deep neural networks [J].
Dodge, Samuel ;
Mounsef, Jinane ;
Karam, Lina .
IET BIOMETRICS, 2018, 7 (03) :207-214