A new hybrid decoding algorithm for speech recognition and utterance verification

被引:0
|
作者
Koo, MW [1 ]
Lee, CH [1 ]
Juang, BH [1 ]
机构
[1] AT&T Bell Labs, Lucent Technol, Multimedia Commun Res Lab, Murray Hill, NJ 07974 USA
关键词
D O I
10.1109/ASRU.1997.659104
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new search algorithm for one-pass utterance verification in which verification and recognition are simultaneously done. The basic idea is that we make a hybrid decoder in which the conventional Viterbi decoder is combined with a likelihood ratio decoder based on a confidence score representing the confidence level for utterance verification. We overcome the dynamic range problem encountered in the likelihood ratio decoder by use of a sigmoid limiter. We also improve its robustness by selective use of the anti-phone models in calculating the phone likelihood ratios. The proposed strategy is applied to a 1000-word Car Reservation task. Experimental results show that the proposed hybrid decoder gives better results than the convention likelihood decoder or the likelihood ratio decoder, particularly in dealing with out-of-vocabulary words or out-of-task sentences.
引用
收藏
页码:303 / 310
页数:8
相关论文
共 50 条
  • [21] A Study of Speech Emotion Recognition Based on Hybrid Algorithm
    Zhu Ju-xia
    Zhang Chao
    Lv Zhao
    Rao Yao-quan
    Wu Xiao-pei
    INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2011), 2011, 8285
  • [22] DISCRETE UTTERANCE SPEECH RECOGNITION WITHOUT TIME ALIGNMENT
    SHORE, JE
    BURTON, DK
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1983, 29 (04) : 473 - 491
  • [23] HISTORY UTTERANCE EMBEDDING TRANSFORMER LM FOR SPEECH RECOGNITION
    Deng, Keqi
    Cheng, Gaofeng
    Miao, Haoran
    Zhang, Pengyuan
    Yan, Yonghong
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5914 - 5918
  • [24] A New Hybrid Algorithm for Speech Recognition Based on HMM Segmentation and Learning Vector Quantization
    Katagiri, Shigeru
    Lee, Chin-Hui
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (04): : 421 - 430
  • [25] Response Probability Based Decoding Algorithm for Large Vocabulary Continuous Speech Recognition
    Yang, Zhanlei
    Chao, Hao
    Liu, Wenju
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1940 - 1943
  • [26] Continuous speech recognition using dynamic Bayesian networks: A fast decoding algorithm
    Deviren, M
    Daoudi, K
    ADVANCES IN BAYESIAN NETWORKS, 2004, 146 : 289 - 308
  • [27] Dynamic classifier combination in hybrid speech recognition systems using utterance-level confidence values
    Kirchhoff, K
    Bilmes, JA
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 693 - 696
  • [28] Utterance Confidence Measure for End-to-End Speech Recognition with Applications to Distributed Speech Recognition Scenarios
    Kumar, Ankur
    Singh, Sachin
    Gowda, Dhananjaya
    Garg, Abhinav
    Singh, Shatrughan
    Kim, Chanwoo
    INTERSPEECH 2020, 2020, : 4357 - 4361
  • [29] Label Synchronous Decoding for Speech Recognition
    Chen Z.-H.
    Zheng W.-L.
    You Y.-B.
    Qian Y.-M.
    Yu K.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (07): : 1511 - 1523
  • [30] A new robust hybrid speech recognition algorithm based on FVQ/HMM and neural nets classification
    Asghar, S
    Cong, L
    INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-IV, PROCEEDINGS, 1998, : 1810 - 1816