High resolution speech feature parametrization for monophone-based stressed speech recognition

被引:43
|
作者
Sarikaya, R [1 ]
Hansen, JHL [1 ]
机构
[1] Univ Colorado, Ctr Spoken Language Res, Robust Speech Proc Lab, Boulder, CO 80309 USA
关键词
feature extraction; speech recognition; speech under stress; wavelet analysis;
D O I
10.1109/97.847363
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This letter investigates the impact of stress on monophone speech recognition accuracy and proposes a new set of acoustic parameters based on high resolution wavelet analysis. The two parameter schemes are entitled wavelet packet parameters (WPP) and subband-based cepstral parameters (SBC). The performance of these features is compared to traditional Mel-frequency cepstral coefficients (MFCC) for stressed speech monophone recognition. The stressed speaking styles considered areneutral, angry, loud, and Lombard effect(1) speech from the SUSAS database. An overall monophone recognition improvement of 20.4% and 17.2% is achieved for loud and angry stressed speech, with a corresponding increase in the neutral monophone rate of 9.9% over MFCC parameters.
引用
收藏
页码:182 / 185
页数:4
相关论文
共 50 条
  • [21] On the use of kernel PCA for feature extraction in speech recognition
    Lima, A
    Zen, H
    Nankaku, Y
    Miyajima, C
    Tokuda, K
    Kitamura, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (12) : 2802 - 2811
  • [22] HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress
    Bou-Ghazale, SE
    Hansen, JHL
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 201 - 216
  • [23] Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC and CCBC
    Han Zhiyan
    Wang Jian
    Wang Xu
    Lun Shuxian
    CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (01): : 105 - 110
  • [24] Speech enhancement method based on feature compensation gain for effective speech recognition in noisy environments
    Bae, Ara
    Kim, Wooil
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 51 - 55
  • [25] A feature-based hierarchical speech recognition system for Hindi
    K Samudravijaya
    R Ahuja
    N Bondale
    T Jose
    S Krishnan
    P Poddar
    xxPVS Rao
    R Raveendran
    Sadhana, 1998, 23 : 313 - 340
  • [26] Speech Recognition Based on Concatenated Acoustic Feature and LightGBM Model
    Yu, Jiali
    Qu, Yuanyuan
    Zhang, Zhongkai
    Lu, Qidong
    Qin, Zhiliang
    Liu, Xiaowei
    TWELFTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2021, 11719
  • [27] Applying sparse KPCA for feature extraction in speech recognition
    Lima, A
    Zen, H
    Nankaku, Y
    Tokuda, K
    Kitamura, T
    Resende, FG
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 401 - 409
  • [28] Discriminative temporal feature extraction for robust speech recognition
    Shen, JL
    ELECTRONICS LETTERS, 1997, 33 (19) : 1598 - 1600
  • [29] Word graph based feature enhancement for noisy speech recognition
    Yan, Zhi-Jie
    Soong, Frank K.
    Wang, Ren-Hua
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 373 - +
  • [30] A new weighted feature approach based on GA for speech recognition
    Ongkowijaya, BT
    Zhu, XY
    2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 663 - 666