Convolutional Deep Rectifier Neural Nets for Phone Recognition

被引:0
|
作者
Toth, Laszlo [1 ,2 ]
机构
[1] Hungarian Acad Sci, Res Grp Artificial Intelligence, Budapest, Hungary
[2] Univ Szeged, Szeged, Hungary
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
Deep neural networks; sparse rectifier neural networks; phone recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Rectifier neurons differ from standard ones only in that the sigmoid activation function is replaced by the rectifier function, max(0, x). Several recent studies suggest that rectifier units may be more suitable building units for deep nets. For example, we found that with deep rectifier networks one can attain a similar speech recognition performance than that with sigmoid nets, but without the need for the time-consuming pre-training procedure. Here, we extend the previous results by modifying the rectifier network so that it has a convolutional structure. As convolutional networks are inherently deep, rectifier neurons seem to be an ideal choice as their building units. Indeed, on the TIMIT phone recognition task we report a 6% relative error reduction compared to our earlier results, giving an 18.6% error rate on the core test set. Then, with the application of the recently proposed 'dropout' training method we reduce the error rate further to 17.8%, which, to our knowledge, is the best result to date on this database.
引用
收藏
页码:1721 / 1725
页数:5
相关论文
共 50 条
  • [1] PHONE RECOGNITION WITH DEEP SPARSE RECTIFIER NEURAL NETWORKS
    Toth, Laszlo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6985 - 6989
  • [2] Convolutional Deep Maxout Networks for Phone Recognition
    Toth, Laszlo
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1078 - 1082
  • [3] A Sequence Training Method for Deep Rectifier Neural Networks in Speech Recognition
    Grosz, Tamas
    Gosztolya, Gabor
    Toth, Laszlo
    SPEECH AND COMPUTER, 2014, 8773 : 81 - 88
  • [4] Deep convolutional neural networks are not mechanistic explanations of object recognition
    Grujicic, Bojana
    SYNTHESE, 2024, 203 (01)
  • [5] Deep Convolutional Neural Networks based on Manifold for Smoke Recognition
    Cheng, Ming
    Ma, Pei
    He, Ruhan
    Chen, Jia
    Zhang, Zili
    Huang, Jin
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] INVESTIGATION OF DEEP BOLTZMANN MACHINES FOR PHONE RECOGNITION
    You, Zhao
    Wang, Xiaorui
    Xu, Bo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7600 - 7603
  • [7] BOOSTING ATTRIBUTE AND PHONE ESTIMATION ACCURACIES WITH DEEP NEURAL NETWORKS FOR DETECTION-BASED SPEECH RECOGNITION
    Yu, Dong
    Siniscalchi, Sabato Marco
    Deng, Li
    Lee, Chin-Hui
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4169 - 4172
  • [8] A COMPARISON BETWEEN DEEP NEURAL NETS AND KERNEL ACOUSTIC MODELS FOR SPEECH RECOGNITION
    Lu, Zhiyun
    Guo, Dong
    Garakani, Alireza Bagheri
    Liu, Kuan
    May, Avner
    Bellet, Aurelien
    Fan, Linxi
    Collins, Michael
    Kingsbury, Brian
    Picheny, Michael
    Sha, Fei
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5070 - 5074
  • [9] RGB-D OBJECT RECOGNITION WITH MULTIMODAL DEEP CONVOLUTIONAL NEURAL NETWORKS
    Rahman, Mohammad Muntasir
    Tan, Yanhao
    Xue, Jian
    Lu, Ke
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2017, : 991 - 996
  • [10] Deep Convolutional Neural Network with Multi-Task Learning Scheme for Modulations Recognition
    Mossad, Omar S.
    ElNainay, Mustafa
    Torki, Marwan
    2019 15TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE (IWCMC), 2019, : 1644 - 1649