Signal modeling for high-performance robust isolated word recognition

被引:10
|
作者
Karnjanadecha, M [1 ]
Zahorian, SA [1 ]
机构
[1] Old Dominion Univ, Dept Elect & Comp Engn, Norfolk, VA 23529 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2001年 / 9卷 / 06期
基金
美国国家科学基金会;
关键词
cepstral analysis; discrete cosine transforms; feature extraction; speech analysis; speech recognition;
D O I
10.1109/89.943342
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper describes speech signal modeling techniques which are well-suited to high performance and robust isolated word recognition. We present new techniques for incorporating spectral/temporal information as a function of temporal position within each word. In particular, spectral/temporal parameters are computed using both variable length blocks with a variable spacing between blocks. We tested features computed with these methods using an alphabet recognition task based on the ISOLET database. The hidden Markov model toolkit (HTK) was used to implement the isolated word recognizer with whole word HMM models. The best accuracy achieved for speaker independent alphabet recognition, using 50 features, was 97.9%, which represents a new benchmark for this task. We also tested these methods with deliberate signal degradation using additive Gaussian noise and telephone band limiting and found that the recognition degrades gracefully and to a smaller degree than for control cases based on MFCC coefficients and delta cepstra terms.
引用
收藏
页码:647 / 654
页数:8
相关论文
共 50 条
  • [1] A novel approach to isolated word recognition
    Gülmezoglu, MB
    Dzhafarov, V
    Keskin, M
    Barkana, A
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (06): : 620 - 628
  • [2] An Isolated Word Speaker Recognition System
    Ozaydin, Selma
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2017, : 70 - 74
  • [3] BANGLA ISOLATED WORD SPEECH RECOGNITION
    Firoze, Adnan
    Arifin, M. Shamsul
    Quadir, Ryana
    Rahman, Rashedur M.
    ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 2, 2011, : 73 - 82
  • [4] COMPARISON OF PERFORMANCE BETWEEN NORMAL AND WHISPERED SPEECH IN CHINESE ISOLATED WORD RECOGNITION
    Sha, Jun
    Chen, Xueqin
    Yu, Yibiao
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 545 - 548
  • [5] Performance evaluation of time-delay fuzzy neural networks for isolated word recognition
    Oweiss, K
    Alim, OA
    1ST INTERNATIONAL SYMPOSIUM ON NEURO-FUZZY SYSTEMS - AT'96, CONFERENCE REPORT, 1996, : 203 - 211
  • [6] Genetic time warping for isolated word recognition
    Kwong, S
    He, QH
    Man, KF
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1996, 10 (07) : 849 - 865
  • [7] Isolated word recognition in the Sigma cognitive architecture
    Joshi, Himanshu
    Rosenbloom, Paul S.
    Ustun, Volkan
    BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, 2014, 10 : 1 - 9
  • [8] Development of isolated word speech recognition system
    Lipeika, A
    Lipeikiene, J
    Telksnys, L
    INFORMATICA, 2002, 13 (01) : 37 - 46
  • [9] Isolated Word Recognition using Polynomial Classifier
    Nehe, N. S.
    Holambe, R. S.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON CONTROL AUTOMATION, COMMUNICATION AND ENERGY CONSERVATION INCACEC 2009 VOLUME II, 2009, : 965 - +
  • [10] Isolated word recognition based on PNCC with different classifiers in a noisy environment
    Safi, Mohammed Ehsan
    Abbas, Eyad Ibrahim
    APPLIED ACOUSTICS, 2022, 195