Ensemble of Gaussian Mixture Localized Neural Networks with Application to Phone Recognition

被引:0
|
作者
Travadi, Ruchir [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ Southern Calif, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
来源
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年
关键词
Neural Networks; Acoustic Modeling; Speech Recognition; Phone Recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present Ensemble of Gaussian Mixture Localized Neural Networks (EGMLNNs), a model for the joint probability density of input as well as output variables of any general mapping to be estimated. The model aims at identifying clusters in the input data, thereby replacing one complex classifier with an ensemble of relatively simpler classifiers, each of which is localized to operate within its associated cluster. We present an algorithm for maximum likelihood parameter estimation for this model using Expectation Maximization (EM). The reported results on phone recognition task on TIMIT database show that the model is able to obtain performance improvement over a single complex classifier while also reducing the computational complexity required for testing.
引用
收藏
页码:1903 / 1907
页数:5
相关论文
共 50 条
  • [21] PHONE SEQUENCE MODELING WITH RECURRENT NEURAL NETWORKS
    Boulanger-Lewandowski, Nicolas
    Droppo, Jasha
    Seltzer, Mike
    Yu, Dong
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [22] Probabilistic Neural Networks for Ensemble Postprocessing
    Liu, Pu
    Dabernig, Markus
    Atencia, Aitor
    Wang, Yong
    Zhao, Yuchu
    MONTHLY WEATHER REVIEW, 2024, 152 (07) : 1487 - 1510
  • [23] Filter Method Ensemble with Neural Networks
    Chakraborty, Anuran
    De, Rajonya
    Chatterjee, Agneet
    Schwenker, Friedhelm
    Sarkar, Ram
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: DEEP LEARNING, PT II, 2019, 11728 : 755 - 765
  • [24] On the Functional Equivalence of TSK Fuzzy Systems to Neural Networks, Mixture of Experts, CART, and Stacking Ensemble Regression
    Wu, Dongrui
    Lin, Chin-Teng
    Huang, Jian
    Zeng, Zhigang
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (10) : 2570 - 2580
  • [25] Ensemble of evolving neural networks in classification
    Sohn, SH
    Dagli, CHH
    NEURAL PROCESSING LETTERS, 2004, 19 (03) : 191 - 203
  • [26] Recognition of speaking in a phone channel with a neural network backpropagation
    Seccion de Estudios de Postgrado e Investigation, Escuela Superior de Ingenieria Mecanica y Electrica, Instituto Politécnico Nacional, Edif. Z-4, 3er. Piso, Col. Lindavista, C.P. 07738, Mexico
    International Journal for Engineering Modelling, 2007, 20 (1-4) : 17 - 22
  • [27] Ensemble of Evolving Neural Networks in Classification
    Sunghwan Sohn
    Cihan H. Dagli
    Neural Processing Letters, 2004, 19 : 191 - 203
  • [28] DEEP MAXOUT NEURAL NETWORKS FOR SPEECH RECOGNITION
    Cai, Meng
    Shi, Yongzhe
    Liu, Jia
    2013 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2013, : 291 - 296
  • [29] Voice Recognition Technology Using Neural Networks
    Zaatri, Abdelouahab
    Azzizi, Norelhouda
    Rahmani, Fouad Lazhar
    JOURNAL OF NEW TECHNOLOGY AND MATERIALS, 2015, 5 (01) : 27 - 31
  • [30] Neural Network Phone Duration Model for Speech Recognition
    Alumae, Tanel
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1204 - 1208