DATA SAMPLING BASED ENSEMBLE ACOUSTIC MODELLING

被引:3
作者
Chen, Xin [1 ]
Zhao, Yunxin [1 ]
机构
[1] Univ Missouri, Dept Comp Sci, Columbia, MO 65211 USA
来源
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS | 2009年
关键词
ensemble classifier; acoustic modeling; hierarchical mixture ensemble; data sampling; cross validation;
D O I
10.1109/ICASSP.2009.4960456
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a novel technique of using Cross Validation (CV) data sampling to construct an ensemble of acoustic models for conversational speech recognition. We further propose using Hierarchical Gaussian Mixture Model (HGMM) and repartition training data to increase the ensemble size and diversity. The proposed methods are found to work well together for ensemble acoustic modeling. We also evaluated the quality of the ensemble acoustic models by using the measures of classification margin, average correct score and variance of correct score. We have found that the ensemble of acoustic models increases the margin and the average correct score, and reduces the variance. We compared the performance of our proposed method with a recently reported method of CV Expectation Maximization (CVEM) for single acoustic models. Our experimental results on a telemedicine automatic captioning task showed that the proposed ensemble acoustic modeling has led to significant improvements in word recognition accuracy.
引用
收藏
页码:3805 / 3808
页数:4
相关论文
共 12 条
  • [1] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [2] CHEN X, 2008, THESIS U MISSOURI
  • [3] Ensemble methods in machine learning
    Dietterich, TG
    [J]. MULTIPLE CLASSIFIER SYSTEMS, 2000, 1857 : 1 - 15
  • [4] A post-processing system to yield reduced word error rates: Recognizer output voting error reduction (ROVER)
    Fiscus, JG
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 347 - 354
  • [5] *HTK, HTK TOOLK
  • [6] Ko AHR, 2007, LECT NOTES COMPUT SC, V4472, P52
  • [7] Shinozaki T, 2007, INT CONF ACOUST SPEE, P437
  • [8] Siohan O, 2005, INT CONF ACOUST SPEE, P197
  • [9] Random forests of phonetic decision trees for acoustic modeling in conversational speech recognition
    Xue, Jian
    Zhao, Yunxin
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (03): : 519 - 528
  • [10] Young S., 1994, P ARPA WORKSHOP HUMA, P307