Application of a short-time version of the Equalization-Cancellation model to speech intelligibility experiments with speech maskers

被引:33
作者
Wan, Rui
Durlach, Nathaniel I.
Colburn, H. Steven [1 ]
机构
[1] Boston Univ, Hearing Res Ctr, Boston, MA 02215 USA
关键词
RECEPTION THRESHOLD; INFORMATIONAL MASKING; FLUCTUATING NOISE; SPATIAL RELEASE; NORMAL-HEARING; PREDICTION; INDEX; GAIN;
D O I
10.1121/1.4884767
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A short-time-processing version of the Equalization-Cancellation (EC) model of binaural processing is described and applied to speech intelligibility tasks in the presence of multiple maskers, including multiple speech maskers. This short-time EC model, called the STEC model, extends the model described by Wan et al. [J. Acoust. Soc. Am. 128, 3678-3690 (2010)] to allow the EC model's equalization parameters s and a to be adjusted as a function of time, resulting in improved masker cancellation when the dominant masker location varies in time. Using the Speech Intelligibility Index, the STEC model is applied to speech intelligibility with maskers that vary in number, type, and spatial arrangements. Most notably, when maskers are located on opposite sides of the target, this STEC model predicts improved thresholds when the maskers are modulated independently with speech-envelope modulators; this includes the most relevant case of independent speech maskers. The STEC model describes the spatial dependence of the speech reception threshold with speech maskers better than the steady-state model. Predictions are also improved for independently speech-modulated noise maskers but are poorer for reversed-speech maskers. In general, short-term processing is useful, but much remains to be done in the complex task of understanding speech in speech maskers. (C) 2014 Acoustical Society of America.
引用
收藏
页码:768 / 776
页数:9
相关论文
共 33 条
[1]  
[Anonymous], 1969, IEEE T ACOUST SPEECH, VAU17, P225
[2]  
ANSI, 1997, S3 5 AM NAT STAND ME
[3]  
ANSI, 1969, S3 5 AM NAT STAND ME
[4]   The influence of non-spatial factors on measures of spatial release from masking [J].
Best, Virginia ;
Marrone, Nicole ;
Mason, Christine R. ;
Kidd, Gerald, Jr. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (04) :3103-3110
[5]   Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners [J].
Beutelmann, Rainer ;
Brand, Thomas .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01) :331-342
[6]   Revision, extension, and evaluation of a binaural speech intelligibility model [J].
Beutelmann, Rainer ;
Brand, Thomas ;
Kollmeier, Birger .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (04) :2479-2497
[7]   Prediction of binaural speech intelligibility with frequency-dependent interaural phase differences [J].
Beutelmann, Rainer ;
Brand, Thomas ;
Kollmeier, Birger .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (03) :1359-1368
[8]   A speech corpus for multitalker communications research [J].
Bolia, RS ;
Nelson, WT ;
Ericson, MA ;
Simpson, BD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 107 (02) :1065-1066
[9]   Better-ear glimpsing efficiency with symmetrically-placed interfering talkers [J].
Brungart, Douglas S. ;
Iyer, Nandini .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04) :2545-2556
[10]   Informational and energetic masking effects in the perception of multiple simultaneous talkers [J].
Brungart, DS ;
Simpson, BD ;
Ericson, MA ;
Scott, KR .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (05) :2527-2538