Convolutional Deep Rectifier Neural Nets for Phone Recognition

被引:0
作者
Toth, Laszlo [1 ,2 ]
机构
[1] Hungarian Acad Sci, Res Grp Artificial Intelligence, Budapest, Hungary
[2] Univ Szeged, Szeged, Hungary
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
Deep neural networks; sparse rectifier neural networks; phone recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rectifier neurons differ from standard ones only in that the sigmoid activation function is replaced by the rectifier function, max(0, x). Several recent studies suggest that rectifier units may be more suitable building units for deep nets. For example, we found that with deep rectifier networks one can attain a similar speech recognition performance than that with sigmoid nets, but without the need for the time-consuming pre-training procedure. Here, we extend the previous results by modifying the rectifier network so that it has a convolutional structure. As convolutional networks are inherently deep, rectifier neurons seem to be an ideal choice as their building units. Indeed, on the TIMIT phone recognition task we report a 6% relative error reduction compared to our earlier results, giving an 18.6% error rate on the core test set. Then, with the application of the recently proposed 'dropout' training method we reduce the error rate further to 17.8%, which, to our knowledge, is the best result to date on this database.
引用
收藏
页码:1721 / 1725
页数:5
相关论文
共 50 条
[21]   Measuring Customer Behavior with Deep Convolutional Neural Networks [J].
Albu, Veaceslav .
BRAIN-BROAD RESEARCH IN ARTIFICIAL INTELLIGENCE AND NEUROSCIENCE, 2016, 7 (01) :74-79
[22]   Ensemble of Gaussian Mixture Localized Neural Networks with Application to Phone Recognition [J].
Travadi, Ruchir ;
Narayanan, Shrikanth .
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, :1903-1907
[23]   Optimal Approach for Image Recognition Using Deep Convolutional Architecture [J].
Shah, Parth ;
Bakrola, Vishvajit ;
Pati, Supriya .
RECENT FINDINGS IN INTELLIGENT COMPUTING TECHNIQUES, VOL 3, 2018, 709 :535-545
[24]   Deep Convolutional Neural Network with Symbiotic Organism Search-Based Human Activity Recognition for Cognitive Health Assessment [J].
Alonazi, Mohammed ;
Alshahrani, Haya Mesfer ;
Kouki, Fadoua ;
Almalki, Nabil Sharaf ;
Mahmud, Ahmed ;
Majdoubi, Jihen .
BIOMIMETICS, 2023, 8 (07)
[25]   Application of Convolutional Neural Networks to Speaker Recognition in Noisy Conditions [J].
McLaren, Mitchell ;
Lei, Yun ;
Scheffer, Nicolas ;
Ferrer, Luciana .
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, :686-690
[26]   Stimulated Deep Neural Network for Speech Recognition [J].
Wu, Chunyang ;
Karanasou, Penny ;
Gales, Mark J. F. ;
Sim, Khe Chai .
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, :400-404
[27]   Explorations into Deep Neural Models for Emotion Recognition [J].
Stojanovska, Frosina ;
Toshevska, Martina ;
Gievska, Sonja .
ICT INNOVATIONS 2018: ENGINEERING AND LIFE SCIENCES, ICT INNOVATIONS 2018, 2018, 940 :217-232
[28]   Automatic phoneme recognition by deep neural networks [J].
Pereira, Bianca Valeria L. ;
de Carvalho, Mateus B. F. ;
Alves, Pedro Augusto A. da S. de A. Nava ;
Ribeiro, Paulo Rogerio de A. ;
de Oliveira, Alexandre Cesar M. ;
de Almeida Neto, Areolino .
JOURNAL OF SUPERCOMPUTING, 2024, 80 (11) :16654-16678
[29]   Fingerprint Recognition by Deep Neural Networks and Fingercodes [J].
Basturk, Alper ;
Sarikaya Basturk, Nurcan ;
Qurbanov, Orxan .
2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[30]   Vehicle Brand Recognition by Deep Neural Networks [J].
Pan, Wei ;
Zhou, Tao ;
Chen, Yuan-yuan .
2018 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL MODELING, SIMULATION AND APPLIED MATHEMATICS (CMSAM 2018), 2018, 310 :157-162