Signal modeling for high-performance robust isolated word recognition

被引:10
作者
Karnjanadecha, M [1 ]
Zahorian, SA [1 ]
机构
[1] Old Dominion Univ, Dept Elect & Comp Engn, Norfolk, VA 23529 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 06期
基金
美国国家科学基金会;
关键词
cepstral analysis; discrete cosine transforms; feature extraction; speech analysis; speech recognition;
D O I
10.1109/89.943342
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes speech signal modeling techniques which are well-suited to high performance and robust isolated word recognition. We present new techniques for incorporating spectral/temporal information as a function of temporal position within each word. In particular, spectral/temporal parameters are computed using both variable length blocks with a variable spacing between blocks. We tested features computed with these methods using an alphabet recognition task based on the ISOLET database. The hidden Markov model toolkit (HTK) was used to implement the isolated word recognizer with whole word HMM models. The best accuracy achieved for speaker independent alphabet recognition, using 50 features, was 97.9%, which represents a new benchmark for this task. We also tested these methods with deliberate signal degradation using additive Gaussian noise and telephone band limiting and found that the recognition degrades gracefully and to a smaller degree than for control cases based on MFCC coefficients and delta cepstra terms.
引用
收藏
页码:647 / 654
页数:8
相关论文
共 50 条
  • [21] Hidden Markov Model Based Isolated Hindi Word Recognition
    Bhardwaj, Ishan
    Londhe, Narendra D.
    2012 2ND INTERNATIONAL CONFERENCE ON POWER, CONTROL AND EMBEDDED SYSTEMS (ICPCES 2012), 2012,
  • [22] Hybrid HMM/ANN based Isolated Hindi Word Recognition
    Kapse, Yogi
    Londhe, Narendra D.
    2014 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2014,
  • [23] Isolated word recognition with the Liquid State Machine:: a case study
    Verstraeten, D
    Schrauwen, B
    Stroobandt, D
    Van Campenhout, J
    INFORMATION PROCESSING LETTERS, 2005, 95 (06) : 521 - 528
  • [24] IMPROVEMENTS IN HMM-BASED ISOLATED WORD RECOGNITION SYSTEM
    PEINADO, AM
    LOPEZ, JM
    SANCHEZ, VE
    SEGURA, JC
    AYUSO, AJR
    IEE PROCEEDINGS-I COMMUNICATIONS SPEECH AND VISION, 1991, 138 (03): : 201 - 206
  • [25] Image recognition of english vocabulary translation based on FPGA high-performance algorithm
    Wang, Xuelian
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 80
  • [26] High-Performance Embedded System Design for QR Code Recognition With Deep Learning
    Gu, Wencheng
    Sun, Li
    Jiang, Zhipeng
    Sun, Kexue
    IEEE MULTIMEDIA, 2024, 31 (04) : 70 - 78
  • [27] Handwritten Devanagari Word Recognition using Robust Invariant Feature Transforms
    Guruprasad, Prathima
    Guruprasad
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON APPLIED AND THEORETICAL COMPUTING AND COMMUNICATION TECHNOLOGY (ICATCCT), 2015, : 327 - 330
  • [28] Algorithm-based low-power and high-performance multimedia signal processing
    Liu, KJR
    Wu, AY
    Raghupathy, A
    Chen, J
    PROCEEDINGS OF THE IEEE, 1998, 86 (06) : 1155 - 1202
  • [29] LATENT TOPIC MODELING OF WORD VICINITY INFORMATION FOR SPEECH RECOGNITION
    Chen, Kuan-Yu
    Chiu, Hsuan-Sheng
    Chen, Berlin
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 5394 - 5397
  • [30] High-accuracy phone recognition by combining high-performance lattice generation and knowledge based rescoring
    Siniscalchi, Sabato Marco
    Schwarz, Petr
    Lee, Chin-Hui
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 869 - +