High resolution speech feature parametrization for monophone-based stressed speech recognition

被引：43

作者：

Sarikaya, R ^{[1
]}

Hansen, JHL ^{[1
]}

机构：

[1] Univ Colorado, Ctr Spoken Language Res, Robust Speech Proc Lab, Boulder, CO 80309 USA

来源：

IEEE SIGNAL PROCESSING LETTERS | 2000年 / 7卷 / 07期

关键词：

feature extraction; speech recognition; speech under stress; wavelet analysis;

D O I：

10.1109/97.847363

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

This letter investigates the impact of stress on monophone speech recognition accuracy and proposes a new set of acoustic parameters based on high resolution wavelet analysis. The two parameter schemes are entitled wavelet packet parameters (WPP) and subband-based cepstral parameters (SBC). The performance of these features is compared to traditional Mel-frequency cepstral coefficients (MFCC) for stressed speech monophone recognition. The stressed speaking styles considered areneutral, angry, loud, and Lombard effect(1) speech from the SUSAS database. An overall monophone recognition improvement of 20.4% and 17.2% is achieved for loud and angry stressed speech, with a corresponding increase in the neutral monophone rate of 9.9% over MFCC parameters.

引用

页码：182 / 185

页数：4

共 50 条

[21] On the use of kernel PCA for feature extraction in speech recognition
Lima, A
Zen, H
Nankaku, Y
Miyajima, C
Tokuda, K
Kitamura, T
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (12) : 2802 - 2811
[22] HMM-based stressed speech modeling with application to improved synthesis and recognition of isolated speech under stress
Bou-Ghazale, SE
Hansen, JHL
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 201 - 216
[23] Robust Feature Extraction for Speech Recognition Based on Perceptually Motivated MUSIC and CCBC
Han Zhiyan
Wang Jian
Wang Xu
Lun Shuxian
CHINESE JOURNAL OF ELECTRONICS, 2011, 20 (01): : 105 - 110
[24] Speech enhancement method based on feature compensation gain for effective speech recognition in noisy environments
Bae, Ara
Kim, Wooil
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2019, 38 (01): : 51 - 55
[25] A feature-based hierarchical speech recognition system for Hindi
K Samudravijaya
R Ahuja
N Bondale
T Jose
S Krishnan
P Poddar
xxPVS Rao
R Raveendran
Sadhana, 1998, 23 : 313 - 340
[26] Speech Recognition Based on Concatenated Acoustic Feature and LightGBM Model
Yu, Jiali
Qu, Yuanyuan
Zhang, Zhongkai
Lu, Qidong
Qin, Zhiliang
Liu, Xiaowei
TWELFTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING SYSTEMS, 2021, 11719
[27] Applying sparse KPCA for feature extraction in speech recognition
Lima, A
Zen, H
Nankaku, Y
Tokuda, K
Kitamura, T
Resende, FG
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (03): : 401 - 409
[28] Discriminative temporal feature extraction for robust speech recognition
Shen, JL
ELECTRONICS LETTERS, 1997, 33 (19) : 1598 - 1600
[29] Word graph based feature enhancement for noisy speech recognition
Yan, Zhi-Jie
Soong, Frank K.
Wang, Ren-Hua
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 373 - +
[30] A new weighted feature approach based on GA for speech recognition
Ongkowijaya, BT
Zhu, XY
2004 7TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS 1-3, 2004, : 663 - 666

← 1 2 3 4 5 →