A SHORT-TIME OBJECTIVE INTELLIGIBILITY MEASURE FOR TIME-FREQUENCY WEIGHTED NOISY SPEECH

被引:766
作者
Taal, Cees H. [1 ]
Hendriks, Richard C. [1 ]
Heusdens, Richard [1 ]
Jensen, Jesper [2 ]
机构
[1] Delft Univ Technol, Signal Informat & Proc Lab, NL-2628 CD Delft, Netherlands
[2] Oticon A S, DK-2765 Smorum, Denmark
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
intelligibility prediction; speech enhancement; noisy speech;
D O I
10.1109/ICASSP.2010.5495701
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Existing objective speech-intelligibility measures are suitable for several types of degradation, however, it turns out that they are less appropriate for methods where noisy speech is processed by a time-frequency (TF) weighting, e. g., noise reduction and speech separation. In this paper, we present an objective intelligibility measure, which shows high correlation (rho=0.95) with the intelligibility of both noisy, and TF-weighted noisy speech. The proposed method shows significantly better performance than three other, more sophisticated, objective measures. Furthermore, it is based on an intermediate intelligibility measure for short-time (approximately 400 ms) TF-regions, and uses a simple DFT-based TF-decomposition. In addition, a free Matlab implementation is provided.
引用
收藏
页码:4214 / 4217
页数:4
相关论文
共 18 条
[1]  
[Anonymous], P INTERSPEECH
[2]  
ANSI (American National Standards Institute), 1997, S351997 ANSI
[3]  
Boldt J. B., 2009, P EUSIPCO, P1849
[4]   Isolating the energetic com ponent of speech-on-speech masking with ideal time-frequency segregation [J].
Brungart, Douglas S. ;
Chang, Peter S. ;
Simpson, Brian D. ;
Wang, DeLiang .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (06) :4007-4018
[5]   A quantitative model of the ''effective'' signal processing in the auditory system .1. Model structure [J].
Dau, T ;
Puschel, D ;
Kohlrausch, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (06) :3615-3622
[6]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121
[7]   FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS [J].
FRENCH, NR ;
STEINBERG, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) :90-119
[8]   Analysis of speech-based speech transmission index methods with implications for nonlinear operations [J].
Goldsworthy, RL ;
Greenberg, JE .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 116 (06) :3679-3689
[9]   Coherence and the speech intelligibility index [J].
Kates, JM ;
Arehart, KH .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 117 (04) :2224-2237
[10]   Role of mask pattern in intelligibility of ideal binary-masked noisy speech [J].
Kjems, Ulrik ;
Boldt, Jesper B. ;
Pedersen, Michael S. ;
Lunner, Thomas ;
Wang, DeLiang .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (03) :1415-1426