A new hybrid decoding algorithm for speech recognition and utterance verification

被引：0

作者：

Koo, MW ^{[1
]}

Lee, CH ^{[1
]}

Juang, BH ^{[1
]}

机构：

[1] AT&T Bell Labs, Lucent Technol, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA

来源：

1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS | 1997年

关键词：

D O I：

10.1109/ASRU.1997.659104

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a new search algorithm for one-pass utterance verification in which verification and recognition are simultaneously done. The basic idea is that we make a hybrid decoder in which the conventional Viterbi decoder is combined with a likelihood ratio decoder based on a confidence score representing the confidence level for utterance verification. We overcome the dynamic range problem encountered in the likelihood ratio decoder by use of a sigmoid limiter. We also improve its robustness by selective use of the anti-phone models in calculating the phone likelihood ratios. The proposed strategy is applied to a 1000-word Car Reservation task. Experimental results show that the proposed hybrid decoder gives better results than the convention likelihood decoder or the likelihood ratio decoder, particularly in dealing with out-of-vocabulary words or out-of-task sentences.

引用

页码：303 / 310

页数：8

共 50 条

[21] A Study of Speech Emotion Recognition Based on Hybrid Algorithm
Zhu Ju-xia
Zhang Chao
Lv Zhao
Rao Yao-quan
Wu Xiao-pei
INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
[22] DISCRETE UTTERANCE SPEECH RECOGNITION WITHOUT TIME ALIGNMENT
SHORE, JE
BURTON, DK
IEEE TRANSACTIONS ON INFORMATION THEORY, 1983, 29 (04) : 473 - 491
[23] HISTORY UTTERANCE EMBEDDING TRANSFORMER LM FOR SPEECH RECOGNITION
Deng, Keqi
Cheng, Gaofeng
Miao, Haoran
Zhang, Pengyuan
Yan, Yonghong
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5914 - 5918
[24] A New Hybrid Algorithm for Speech Recognition Based on HMM Segmentation and Learning Vector Quantization
Katagiri, Shigeru
Lee, Chin-Hui
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (04): : 421 - 430
[25] Response Probability Based Decoding Algorithm for Large Vocabulary Continuous Speech Recognition
Yang, Zhanlei
Chao, Hao
Liu, Wenju
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1940 - 1943
[26] Continuous speech recognition using dynamic Bayesian networks: A fast decoding algorithm
Deviren, M
Daoudi, K
ADVANCES IN BAYESIAN NETWORKS, 2004, 146 : 289 - 308
[27] Dynamic classifier combination in hybrid speech recognition systems using utterance-level confidence values
Kirchhoff, K
Bilmes, JA
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 693 - 696
[28] Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios
Kumar, Ankur
Singh, Sachin
Gowda, Dhananjaya
Garg, Abhinav
Singh, Shatrughan
Kim, Chanwoo
INTERSPEECH 2020, 2020, : 4357 - 4361
[29] Label Synchronous Decoding for Speech Recognition
Chen Z.-H.
Zheng W.-L.
You Y.-B.
Qian Y.-M.
Yu K.
Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (07): : 1511 - 1523
[30] A new robust hybrid speech recognition algorithm based on FVQ/HMM and neural nets classification
Asghar, S
Cong, L
INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1810 - 1816

← 1 2 3 4 5 →