Discriminative optimisation of large vocabulary recognition systems

被引：0

作者：

Valtchev, V

Woodland, PC

Young, SJ

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper describes a framework for optimising the structure and parameters of a continuous density HMM-based large vocabulary recognition system using the Maximum Mutual Information Estimation (MMIE) criterion. To reduce the computational complexity of the MMIE training algorithm, confusable segments of speech are identified and stored as word lattices of alternative utterance hypotheses. An iterative mixture splitting procedure is also employed to adjust the number of mixture components in each state during training such that the optimal balance between number of parameters and available Paining data is achieved. Experiments are presented on various test sets from the Wall Street Journal database using the full SI-284 training set. These show that the use of lattices makes MMIE training practicable for very complex recognition systems and large Paining sets. Furthermore, experimental results demonstrate that MMIE optimisation of system structure and parameters can yield useful increases in recognition accuracy.

引用

页码：18 / 21

页数：4

共 50 条

[21] Compound words in large-vocabulary German speech recognition systems
Berton, A
Fetter, P
RegelBrietzmann, P
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1165 - 1168
[22] Large vocabulary speech recognition in French
Adda-Decker, M
Adda, G
Gauvain, JL
Lamel, L
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 45 - 48
[23] Advances in Large Vocabulary Speech Recognition
Gauvain, JL
De Mori, R
Lamel, L
COMPUTER SPEECH AND LANGUAGE, 2002, 16 (01): : 1 - 3
[24] Large vocabulary speech recognition in French
Adda-Decker, Martine
Adda, Gilles
Gauvain, Jean-Luc
Lamel, Lori
ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 45 - 48
[25] An Exploration of Large Vocabulary Tools for Small Vocabulary Phonetic Recognition
Sainath, Tara N.
Ramabhadran, Bhuvana
Picheny, Michael
2009 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION & UNDERSTANDING (ASRU 2009), 2009, : 359 - 364
[26] Discriminative and generative vocabulary tree: With application to vein image authentication and recognition
Wang, Jinjun
Xiao, Jing
Lin, Weiyao
Luo, Chuanfei
IMAGE AND VISION COMPUTING, 2015, 34 : 51 - 62
[27] Context-Aware Discriminative Vocabulary Learning for Mobile Landmark Recognition
Chen, Tao
Yap, Kim-Hui
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (09) : 1611 - 1621
[28] Continuous sign language recognition: Towards large vocabulary statistical recognition systems handling multiple signers
Koller, Oscar
Forster, Jens
Ney, Hermann
COMPUTER VISION AND IMAGE UNDERSTANDING, 2015, 141 : 108 - 125
[29] Large Vocabulary Speech Recognition on Parallel Architectures
Cardinal, Patrick
Dumouchel, Pierre
Boulianne, Gilles
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (11): : 2290 - 2300
[30] Croatian Large Vocabulary Automatic Speech Recognition
Martincic-Ipsic, Sanda
Pobar, Miran
Ipsic, Ivo
AUTOMATIKA, 2011, 52 (02) : 147 - 157

← 1 2 3 4 5 →