Convolutional Deep Rectifier Neural Nets for Phone Recognition

被引:0
作者
Toth, Laszlo [1 ,2 ]
机构
[1] Hungarian Acad Sci, Res Grp Artificial Intelligence, Budapest, Hungary
[2] Univ Szeged, Szeged, Hungary
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
Deep neural networks; sparse rectifier neural networks; phone recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rectifier neurons differ from standard ones only in that the sigmoid activation function is replaced by the rectifier function, max(0, x). Several recent studies suggest that rectifier units may be more suitable building units for deep nets. For example, we found that with deep rectifier networks one can attain a similar speech recognition performance than that with sigmoid nets, but without the need for the time-consuming pre-training procedure. Here, we extend the previous results by modifying the rectifier network so that it has a convolutional structure. As convolutional networks are inherently deep, rectifier neurons seem to be an ideal choice as their building units. Indeed, on the TIMIT phone recognition task we report a 6% relative error reduction compared to our earlier results, giving an 18.6% error rate on the core test set. Then, with the application of the recently proposed 'dropout' training method we reduce the error rate further to 17.8%, which, to our knowledge, is the best result to date on this database.
引用
收藏
页码:1721 / 1725
页数:5
相关论文
共 50 条
[31]   Fingerprint Recognition by Deep Neural Networks and Fingercodes [J].
Basturk, Alper ;
Sarikaya Basturk, Nurcan ;
Qurbanov, Orxan .
2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[32]   DEEP NEURAL NETWORKS FOR AUDIO SCENE RECOGNITION [J].
Petetin, Yohan ;
Laroche, Cyrille ;
Mayoue, Aurelien .
2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, :125-129
[33]   Insights into Deep Neural Networks for Speaker Recognition [J].
Garcia-Romero, Daniel ;
McCree, Alan .
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, :1141-1145
[34]   SPEECH RECOGNITION WITH DEEP RECURRENT NEURAL NETWORKS [J].
Graves, Alex ;
Mohamed, Abdel-rahman ;
Hinton, Geoffrey .
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, :6645-6649
[35]   Recent advances in efficient computation of deep convolutional neural networks [J].
Jian CHENG ;
Peisong WANG ;
Gang LI ;
Qinghao HU ;
Hanqing LU .
FrontiersofInformationTechnology&ElectronicEngineering, 2018, 19 (01) :64-77
[36]   Neonatal Seizure Detection Using Deep Convolutional Neural Networks [J].
Ansari, Amir H. ;
Cherian, Perumpillichira J. ;
Caicedo, Alexander ;
Naulaers, Gunnar ;
De Vos, Maarten ;
Van Huffel, Sabine .
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 2019, 29 (04)
[37]   High performance reconfigurable accelerator for deep convolutional neural networks [J].
Qiao R. ;
Chen G. ;
Gong G. ;
Lu H. .
Xi'an Dianzi Keji Daxue Xuebao/Journal of Xidian University, 2019, 46 (03) :130-139
[38]   A novel radioactive particle tracking algorithm based on deep rectifier neural network [J].
de Freitas Dam, Roos Sophia ;
dos Santos, Marcelo Carvalho ;
Moreira do Desterro, Filipe Santana ;
Salgado, William Luna ;
Schirru, Roberto ;
Salgado, Cesar Marques .
NUCLEAR ENGINEERING AND TECHNOLOGY, 2021, 53 (07) :2334-2340
[39]   Estimation of crowd density based on deep convolutional neural networks [J].
Tan, Zhiyong ;
Yuan, Jiazheng ;
Bao, Hong ;
Liu, Hongzhe ;
Li, Qing .
Engineering Intelligent Systems, 2016, 24 (3-4) :131-138
[40]   Prediction to Atrial Fibrillation Using Deep Convolutional Neural Networks [J].
Cho, Jungrae ;
Kim, Yoonnyun ;
Lee, Minho .
PREDICTIVE INTELLIGENCE IN MEDICINE, 2018, 11121 :164-171