Ensemble of Gaussian Mixture Localized Neural Networks with Application to Phone Recognition

被引:0
|
作者
Travadi, Ruchir [1 ]
Narayanan, Shrikanth [1 ]
机构
[1] Univ Southern Calif, Signal Anal & Interpretat Lab, Los Angeles, CA 90089 USA
来源
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5 | 2015年
关键词
Neural Networks; Acoustic Modeling; Speech Recognition; Phone Recognition;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper we present Ensemble of Gaussian Mixture Localized Neural Networks (EGMLNNs), a model for the joint probability density of input as well as output variables of any general mapping to be estimated. The model aims at identifying clusters in the input data, thereby replacing one complex classifier with an ensemble of relatively simpler classifiers, each of which is localized to operate within its associated cluster. We present an algorithm for maximum likelihood parameter estimation for this model using Expectation Maximization (EM). The reported results on phone recognition task on TIMIT database show that the model is able to obtain performance improvement over a single complex classifier while also reducing the computational complexity required for testing.
引用
收藏
页码:1903 / 1907
页数:5
相关论文
共 50 条
  • [1] FACE RECOGNITION USING ENSEMBLE OF NEURAL NETWORKS
    Alekseichevs, M.
    Glazs, A.
    ICAART 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, 2009, : 144 - +
  • [2] Gaussian Process Neural Networks for Speech Recognition
    Lam, Max W. Y.
    Hu, Shoukang
    Xie, Xurong
    Liu, Shansong
    Yu, Jianwei
    Su, Rongfeng
    Liu, Xunying
    Meng, Helen
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1778 - 1782
  • [3] PHONE RECOGNITION WITH DEEP SPARSE RECTIFIER NEURAL NETWORKS
    Toth, Laszlo
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6985 - 6989
  • [4] An adaptive Gaussian mixture method for nonlinear uncertainty propagation in neural networks
    Zhang, Bin
    Shin, Yung C.
    NEUROCOMPUTING, 2021, 458 (458) : 170 - 183
  • [5] APPLICATION OF NEURAL NETWORKS IN IMAGE DEFINITION RECOGNITION
    Chen Guojin
    Zhu Miaofen
    Yu Honghao
    Li Yan
    ICSPC: 2007 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1-3, PROCEEDINGS, 2007, : 1207 - 1210
  • [6] Comparison of Subspace Methods for Gaussian Mixture Models in Speech Recognition
    Varjokallio, Matti
    Kurimo, Mikko
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 181 - 184
  • [7] Boosted Mixture Learning of Gaussian Mixture HMMs for Speech Recognition
    Du, Jun
    Hu, Yu
    Jiang, Hui
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2942 - +
  • [8] Application of Neural Networks and Machine Learning in Image Recognition
    Gali, Dario
    Stojanovi, Zvezdan
    Caji, Elvir
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2024, 31 (01): : 316 - 323
  • [9] Automatic genre classification of TV programmes using Gaussian mixture models and neural networks
    Montagnuolo, Maurizio
    Messina, Alberto
    DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 99 - +
  • [10] SUBSPACE GAUSSIAN MIXTURE MODELS FOR SPEECH RECOGNITION
    Povey, Daniel
    Burget, Lukas
    Agarwal, Mohit
    Akyazi, Pinar
    Feng, Kai
    Ghoshal, Arnab
    Glembek, Ondrej
    Goel, Nagendra Kumar
    Karafiat, Martin
    Rastrow, Ariya
    Rose, Richard C.
    Schwarz, Petr
    Thomas, Samuel
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4330 - 4333