A Hybrid HMM/DNN Approach to Keyword Spotting of Short Words

被引:0
|
作者
Chen, I-Fan [1 ]
Lee, Chin-Hui [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
来源
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年
关键词
keyword and filler modeling; keyword detection; utterance verification; deep neural networks; knowledge-based; RECOGNITION; FEATURES;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An HMM/DNN framework is proposed to address the issues of short-word detection. The first-stage keyword hypothesizer is redesigned with a context-aware keyword model and a 9 state filler model to reduce the miss rate from 80% to 6% and increase the figure-of-merit (FOM) from 6.08% to 21.88% for short words. The hypothesizer is followed by a MLP-based second-stage keyword verifier to further reduce its putative hits. To enhance short word detection, three new techniques, including an HMM-based feature transfonnation for the MLPs, knowledge-based features, and deep neural networks, are incorporated into redesigning the verifier. With a set of nine short keywords from the TIMIT set the best FOM we had achieved for the proposed KWS system was 42.79%, which is comparable with that of 42.6% for long content words and much better than the FOM of 18.4% for short keywords reported in previous research [10].
引用
收藏
页码:1573 / 1577
页数:5
相关论文
共 17 条
  • [1] HYBRID CONTEXT DEPENDENT CD-DNN-HMM KEYWORD SPOTTING (KWS) IN SPEECH CONVERSATIONS
    Tyagi, Vivek
    2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [2] Hybrid HMM/BLSTM system for multi-script keyword spotting in printed and handwritten documents with identification stage
    Cheikhrouhou, Ahmed
    Kessentini, Yousri
    Kanoun, Slim
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13) : 9201 - 9215
  • [3] Forward Hand Gesture Spotting and Prediction Using HMM-DNN Model
    Elmezain, Mahmoud
    Alwateer, Majed M. M.
    El-Agamy, Rasha
    Atlam, Elsayed
    Ibrahim, Hani M. M.
    INFORMATICS-BASEL, 2023, 10 (01):
  • [4] HMM word graph based keyword spotting in handwritten document images
    Toselli, Alejandro Hector
    Vidal, Enrique
    Romero, Veronica
    Frinken, Volkmar
    INFORMATION SCIENCES, 2016, 370 : 497 - 518
  • [5] Keyword spotting in handwritten documents based on a generic text line HMM and a SVM verification
    Kessentini, Yousri
    Paquet, Thierry
    2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 41 - 45
  • [6] Two-Stage Approach to Keyword Spotting in Handwritten Documents
    Haji, Mehdi
    Ameri, Mohammad R.
    Bui, Tien D.
    Suen, Ching Y.
    Ponson, Dominique
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [7] A Resource-Dependent Approach to Word Modeling for Keyword Spotting
    Chen, I-Fan
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2543 - 2547
  • [8] On quantifying the quality of acoustic models in hybrid DNN-HMM ASR
    Dighe, Pranay
    Asaei, Afsaneh
    Bourlard, Herve
    SPEECH COMMUNICATION, 2020, 119 : 24 - 35
  • [9] Speaker Adaptive Training Localizing Speaker Modules in DNN for Hybrid DNN-HMM Speech Recognizers
    Ochiai, Tsubasa
    Matsuda, Shigeki
    Watanabe, Hideyuki
    Lu, Xugang
    Hori, Chiori
    Kawai, Hisashi
    Katagiri, Shigeru
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2431 - 2443
  • [10] Custom Mandarin Keyword Spotting with Extended Long Short-Term Memory
    Cao, Haitao
    Liu, Xi
    Tan, Zhiguo
    Yang, Zhenlun
    Qin, Xin
    IAENG International Journal of Computer Science, 2024, 51 (12) : 1933 - 1942