Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model

被引:0
作者
Molla, Md. Khademul Islam [1 ]
Hirose, Keikichi [1 ]
Minematsu, Nobuaki [2 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1130033, Japan
[2] Univ Tokyo, Grad Sch Engn, Bunkyo Ku, Tokyo 1130033, Japan
来源
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5 | 2008年
关键词
empirical mode decomposition; normalized autocorrelation; periodic correlation; voiced/unvoiced speech;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a method of voiced/unvoiced (V/Uv) classification of noisy speech signals. Empirical mode decomposition (EMD), a newly developed tool to analyze nonlinear and non-stationary signals is used to filter the additive noise with the speech signal. The normalized autocorrelation of the filtered speech signal is computed to enhance the periodicity if any. It is considered that the voiced speech signal is periodically correlated and the unvoiced signal is not. A statistical model of determining periodic correlation is used to differentiate voiced and unvoiced speech with low SNR. The experimental results show that the use of EMD improves the classification performance and the overall efficiency is noticeable as compared to other existing algorithms.
引用
收藏
页码:2530 / +
页数:2
相关论文
共 15 条
[1]   Cepstrum-based pitch detection using a new statistical V/UV classification algorithm [J].
Ahmadi, S ;
Spanias, AS .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (03) :333-338
[2]   Fast HOS based simultaneous voiced/unvoiced detection and pitch estimation using 3-level binary speech signals [J].
Alkulaibi, A ;
Soraghan, JJ ;
Durrani, TS .
8TH IEEE SIGNAL PROCESSING WORKSHOP ON STATISTICAL SIGNAL AND ARRAY PROCESSING, PROCEEDINGS, 1996, :194-197
[3]   PATTERN-RECOGNITION APPROACH TO VOICED UNVOICED SILENCE CLASSIFICATION WITH APPLICATIONS TO SPEECH RECOGNITION [J].
ATAL, BS ;
RABINER, LR .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1976, 24 (03) :201-212
[4]  
BROSZKIEWICZSUW.E, 2003, HSC0302
[5]   Empirical mode decomposition as a filter bank [J].
Flandrin, P ;
Rilling, G ;
Gonçalvés, P .
IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (02) :112-114
[6]  
Gardner WA, 1994, CYCLOSTATIONARITY CO
[7]  
Giridharan K., 2004, P ISPACS
[8]   The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis [J].
Huang, NE ;
Shen, Z ;
Long, SR ;
Wu, MLC ;
Shih, HH ;
Zheng, QN ;
Yen, NC ;
Tung, CC ;
Liu, HH .
PROCEEDINGS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1998, 454 (1971) :903-995
[9]  
HURD HL, 1991, J TIME SER ANAL, V15, P337
[10]  
Janer L., 1996, P ICSLP