Modeling Sluggishness in Binaural Unmasking of Speech for Maskers With Time-Varying Interaural Phase Differences

被引:23
作者
Hauth, Christopher F. [1 ,2 ]
Brand, Thomas [1 ,2 ]
机构
[1] Carl von Ossietzky Univ Oldenburg, Med Phys, Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Cluster Excellence Hearing4All, Oldenburg, Germany
关键词
speech reception thresholds; binaural; auditory model; interaural phase difference; binaural sluggishness; MASKING-LEVEL DIFFERENCES; NORMAL-HEARING; NOISE SOURCES; INTELLIGIBILITY; PREDICTION; SIGNAL; EQUALIZATION; INTENSITY; THRESHOLD; LISTENERS;
D O I
10.1177/2331216517753547
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In studies investigating binaural processing in human listeners, relatively long and task-dependent time constants of a binaural window ranging from 10 ms to 250 ms have been observed. Such time constants are often thought to reflect "binaural sluggishness.'' In this study, the effect of binaural sluggishness on binaural unmasking of speech in stationary speech-shaped noise is investigated in 10 listeners with normal hearing. In order to design a masking signal with temporally varying binaural cues, the interaural phase difference of the noise was modulated sinusoidally with frequencies ranging from 0.25 Hz to 64 Hz. The lowest, that is the best, speech reception thresholds (SRTs) were observed for the lowest modulation frequency. SRTs increased with increasing modulation frequency up to 4 Hz. For higher modulation frequencies, SRTs remained constant in the range of 1 dB to 1.5 dB below the SRT determined in the diotic situation. The outcome of the experiment was simulated using a short-term binaural speech intelligibility model, which combines an equalization-cancellation (EC) model with the speech intelligibility index. This model segments the incoming signal into 23.2-ms time frames in order to predict release from masking in modulated noises. In order to predict the results from this study, the model required a further time constant applied to the EC mechanism representing binaural sluggishness. The best agreement with perceptual data was achieved using a temporal window of 200 ms in the EC mechanism.
引用
收藏
页数:10
相关论文
共 32 条
[1]   The variation across time of sensitivity to interaural disparities: Behavioral measurements and quantitative analyses [J].
Akeroyd, MA ;
Bernstein, LR .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (05) :2516-2526
[2]   A binaural analog of gap detection [J].
Akeroyd, MA ;
Summerfield, AQ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 105 (05) :2807-2820
[3]   Predicting the Intelligibility of Noisy and Nonlinearly Processed Binaural Speech [J].
Andersen, Asger Heidemann ;
de Haan, Jan Mark ;
Tan, Zheng-Hua ;
Jensen, Jesper .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) :1908-1920
[4]  
[Anonymous], 1997, Methods for Calculation of the Speech Intelligibility Index
[5]   Sensitivity to brief changes of interaural time and interaural intensity [J].
Bernstein, LR ;
Trahiotis, C ;
Akeroyd, MA ;
Hartung, K .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (04) :1604-1615
[6]   Prediction of speech intelligibility in spatial noise and reverberation for normal-hearing and hearing-impaired listeners [J].
Beutelmann, Rainer ;
Brand, Thomas .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (01) :331-342
[7]   Revision, extension, and evaluation of a binaural speech intelligibility model [J].
Beutelmann, Rainer ;
Brand, Thomas ;
Kollmeier, Birger .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 127 (04) :2479-2497
[8]   Prediction of binaural speech intelligibility with frequency-dependent interaural phase differences [J].
Beutelmann, Rainer ;
Brand, Thomas ;
Kollmeier, Birger .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (03) :1359-1368
[9]   Efficient adaptive procedures for threshold and concurrent slope estimates for psychophysics and speech intelligibility tests [J].
Brand, T ;
Kollmeier, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (06) :2801-2810
[10]  
Bronkhorst AW, 2000, ACUSTICA, V86, P117