Discriminative optimisation of large vocabulary recognition systems

被引:0
|
作者
Valtchev, V
Woodland, PC
Young, SJ
机构
来源
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes a framework for optimising the structure and parameters of a continuous density HMM-based large vocabulary recognition system using the Maximum Mutual Information Estimation (MMIE) criterion. To reduce the computational complexity of the MMIE training algorithm, confusable segments of speech are identified and stored as word lattices of alternative utterance hypotheses. An iterative mixture splitting procedure is also employed to adjust the number of mixture components in each state during training such that the optimal balance between number of parameters and available Paining data is achieved. Experiments are presented on various test sets from the Wall Street Journal database using the full SI-284 training set. These show that the use of lattices makes MMIE training practicable for very complex recognition systems and large Paining sets. Furthermore, experimental results demonstrate that MMIE optimisation of system structure and parameters can yield useful increases in recognition accuracy.
引用
收藏
页码:18 / 21
页数:4
相关论文
共 50 条
  • [21] Compound words in large-vocabulary German speech recognition systems
    Berton, A
    Fetter, P
    RegelBrietzmann, P
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1165 - 1168
  • [22] Large vocabulary speech recognition in French
    Adda-Decker, M
    Adda, G
    Gauvain, JL
    Lamel, L
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 45 - 48
  • [23] Advances in Large Vocabulary Speech Recognition
    Gauvain, JL
    De Mori, R
    Lamel, L
    COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 1 - 3
  • [24] Large vocabulary speech recognition in French
    Adda-Decker, Martine
    Adda, Gilles
    Gauvain, Jean-Luc
    Lamel, Lori
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 45 - 48
  • [25] An Exploration of Large Vocabulary Tools for Small Vocabulary Phonetic Recognition
    Sainath, Tara N.
    Ramabhadran, Bhuvana
    Picheny, Michael
    2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 359 - 364
  • [26] Discriminative and generative vocabulary tree: With application to vein image authentication and recognition
    Wang, Jinjun
    Xiao, Jing
    Lin, Weiyao
    Luo, Chuanfei
    IMAGE AND VISION COMPUTING, 2015, 34 : 51 - 62
  • [27] Context-Aware Discriminative Vocabulary Learning for Mobile Landmark Recognition
    Chen, Tao
    Yap, Kim-Hui
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (09) : 1611 - 1621
  • [28] Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers
    Koller, Oscar
    Forster, Jens
    Ney, Hermann
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 141 : 108 - 125
  • [29] Large Vocabulary Speech Recognition on Parallel Architectures
    Cardinal, Patrick
    Dumouchel, Pierre
    Boulianne, Gilles
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (11): : 2290 - 2300
  • [30] Croatian Large Vocabulary Automatic Speech Recognition
    Martincic-Ipsic, Sanda
    Pobar, Miran
    Ipsic, Ivo
    AUTOMATIKA, 2011, 52 (02) : 147 - 157