Ensemble of Gaussian Mixture Localized Neural Networks with Application to Phone Recognition

被引:0
作者
Travadi, Ruchir [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ Southern Calif, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
来源
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年
关键词
Neural Networks; Acoustic Modeling; Speech Recognition; Phone Recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present Ensemble of Gaussian Mixture Localized Neural Networks (EGMLNNs), a model for the joint probability density of input as well as output variables of any general mapping to be estimated. The model aims at identifying clusters in the input data, thereby replacing one complex classifier with an ensemble of relatively simpler classifiers, each of which is localized to operate within its associated cluster. We present an algorithm for maximum likelihood parameter estimation for this model using Expectation Maximization (EM). The reported results on phone recognition task on TIMIT database show that the model is able to obtain performance improvement over a single complex classifier while also reducing the computational complexity required for testing.
引用
收藏
页码:1903 / 1907
页数:5
相关论文
共 50 条
  • [31] Neural Network Phone Duration Model for Speech Recognition
    Alumae, Tanel
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1204 - 1208
  • [32] DEEP MAXOUT NEURAL NETWORKS FOR SPEECH RECOGNITION
    Cai, Meng
    Shi, Yongzhe
    Liu, Jia
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 291 - 296
  • [33] Voice Recognition Technology Using Neural Networks
    Zaatri, Abdelouahab
    Azzizi, Norelhouda
    Rahmani, Fouad Lazhar
    JOURNAL OF NEW TECHNOLOGY AND MATERIALS, 2015, 5 (01) : 27 - 31
  • [34] Android Application for Object Recognition using Neural Networks for the Visually Impaired
    Dosi, Sanika
    Sambare, Shivani
    Singh, Shashank
    Lokhande, Netra
    Garware, Bhushan
    2018 FOURTH INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION CONTROL AND AUTOMATION (ICCUBEA), 2018,
  • [35] Application of Pretrained Deep Neural Networks to Large Vocabulary Speech Recognition
    Jaitly, Navdeep
    Patrick Nguyen
    Senior, Andrew
    Vanhoucke, Vincent
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2577 - 2580
  • [36] Application of Transfer Learning for Object Recognition Using Convolutional Neural Networks
    Diaz Salazar, Nicolas
    Lopez Sotelo, Jesus Alfonso
    Salazar Gomez, Gustavo Andres
    2018 IEEE 1ST COLOMBIAN CONFERENCE ON APPLICATIONS IN COMPUTATIONAL INTELLIGENCE (COLCACI), 2018,
  • [37] Application of concurrent generalized regression neural networks for arabic speech recognition
    Shoaib, M
    Awais, M
    Masud, S
    Shamail, S
    Akhtar, J
    Proceedings of the Second IASTED International Conference on Neural Networks and Computational Intelligence, 2004, : 206 - 210
  • [38] Application of Transfer Learning for Object Recognition Using Convolutional Neural Networks
    Lopez Sotelo, Jesus Alfonso
    Diaz Salazar, Nicolas
    Salazar Gomez, Gustavo Andres
    APPLICATIONS OF COMPUTATIONAL INTELLIGENCE, COLCACI 2018, 2018, 833 : 14 - 25
  • [39] Phone duration modeling for LVCSR using neural networks
    Hadian, Hossein
    Povey, Daniel
    Sameti, Hossein
    Khudanpur, Sanjeev
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 518 - 522
  • [40] Application of Convolutional Neural Networks for Pattern Recognition Circuits of Railway Automatics. Specifics of this Application
    Blagoveschenskaya, E. A.
    Zuev, D. V.
    Garbaruk, V. V.
    Gerasimenko, V. A.
    Sedykh, D. V.
    Kunets, D. S.
    PROCEEDINGS OF 2017 XX IEEE INTERNATIONAL CONFERENCE ON SOFT COMPUTING AND MEASUREMENTS (SCM), 2017, : 434 - 435