Convolutional Deep Rectifier Neural Nets for Phone Recognition

被引：0

作者：

Toth, Laszlo ^{[1
,2
]}

机构：

[1] Hungarian Acad Sci, Res Grp Artificial Intelligence, Budapest, Hungary

[2] Univ Szeged, Szeged, Hungary

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

Deep neural networks; sparse rectifier neural networks; phone recognition;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Rectifier neurons differ from standard ones only in that the sigmoid activation function is replaced by the rectifier function, max(0, x). Several recent studies suggest that rectifier units may be more suitable building units for deep nets. For example, we found that with deep rectifier networks one can attain a similar speech recognition performance than that with sigmoid nets, but without the need for the time-consuming pre-training procedure. Here, we extend the previous results by modifying the rectifier network so that it has a convolutional structure. As convolutional networks are inherently deep, rectifier neurons seem to be an ideal choice as their building units. Indeed, on the TIMIT phone recognition task we report a 6% relative error reduction compared to our earlier results, giving an 18.6% error rate on the core test set. Then, with the application of the recently proposed 'dropout' training method we reduce the error rate further to 17.8%, which, to our knowledge, is the best result to date on this database.

引用

页码：1721 / 1725

页数：5

共 50 条

[1] PHONE RECOGNITION WITH DEEP SPARSE RECTIFIER NEURAL NETWORKS
Toth, Laszlo
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6985 - 6989
[2] Convolutional Deep Maxout Networks for Phone Recognition
Toth, Laszlo
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1078 - 1082
[3] A Sequence Training Method for Deep Rectifier Neural Networks in Speech Recognition
Grosz, Tamas
Gosztolya, Gabor
Toth, Laszlo
SPEECH AND COMPUTER, 2014, 8773 : 81 - 88
[4] Deep convolutional neural networks are not mechanistic explanations of object recognition
Grujicic, Bojana
SYNTHESE, 2024, 203 (01)
[5] Deep Convolutional Neural Networks based on Manifold for Smoke Recognition
Cheng, Ming
Ma, Pei
He, Ruhan
Chen, Jia
Zhang, Zili
Huang, Jin
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[6] INVESTIGATION OF DEEP BOLTZMANN MACHINES FOR PHONE RECOGNITION
You, Zhao
Wang, Xiaorui
Xu, Bo
2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7600 - 7603
[7] BOOSTING ATTRIBUTE AND PHONE ESTIMATION ACCURACIES WITH DEEP NEURAL NETWORKS FOR DETECTION-BASED SPEECH RECOGNITION
Yu, Dong
Siniscalchi, Sabato Marco
Deng, Li
Lee, Chin-Hui
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4169 - 4172
[8] A COMPARISON BETWEEN DEEP NEURAL NETS AND KERNEL ACOUSTIC MODELS FOR SPEECH RECOGNITION
Lu, Zhiyun
Guo, Dong
Garakani, Alireza Bagheri
Liu, Kuan
May, Avner
Bellet, Aurelien
Fan, Linxi
Collins, Michael
Kingsbury, Brian
Picheny, Michael
Sha, Fei
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5070 - 5074
[9] RGB-D OBJECT RECOGNITION WITH MULTIMODAL DEEP CONVOLUTIONAL NEURAL NETWORKS
Rahman, Mohammad Muntasir
Tan, Yanhao
Xue, Jian
Lu, Ke
2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 991 - 996
[10] Deep Convolutional Neural Network with Multi-Task Learning Scheme for Modulations Recognition
Mossad, Omar S.
ElNainay, Mustafa
Torki, Marwan
2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1644 - 1649

← 1 2 3 4 5 →