A computer model of auditory efferent suppression: Implications for the recognition of speech in noise

被引:59
作者
Brown, Guy J. [1 ]
Ferry, Robert T. [2 ]
Meddis, Ray [2 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield S1 4DP, S Yorkshire, England
[2] Univ Essex, Dept Psychol, Colchester CO4 3SQ, Essex, England
基金
英国工程与自然科学研究理事会;
关键词
acoustic noise; biomembranes; ear; hearing; neurophysiology; physiological models; speech intelligibility; speech recognition; HIDDEN MARKOV-MODELS; INNER-HAIR CELL; OLIVOCOCHLEAR BUNDLE; ELECTRICAL-STIMULATION; NERVE FIBERS; INTELLIGIBILITY; DISCRIMINATION; MODERATE; PERCEPTION; RESPONSES;
D O I
10.1121/1.3273893
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The neural mechanisms underlying the ability of human listeners to recognize speech in the presence of background noise are still imperfectly understood. However, there is mounting evidence that the medial olivocochlear system plays an important role, via efferents that exert a suppressive effect on the response of the basilar membrane. The current paper presents a computer modeling study that investigates the possible role of this activity on speech intelligibility in noise. A model of auditory efferent processing [Ferry, R. T., and Meddis, R. (2007). J. Acoust. Soc. Am. 122, 3519-3526] is used to provide acoustic features for a statistical automatic speech recognition system, thus allowing the effects of efferent activity on speech intelligibility to be quantified. Performance of the "basic" model (without efferent activity) on a connected digit recognition task is good when the speech is uncorrupted by noise but falls when noise is present. However, recognition performance is much improved when efferent activity is applied. Furthermore, optimal performance is obtained when the amount of efferent activity is proportional to the noise level. The results obtained are consistent with the suggestion that efferent suppression causes a "release from adaptation" in the auditory-nerve response to noisy speech, which enhances its intelligibility.
引用
收藏
页码:943 / 954
页数:12
相关论文
共 50 条
[1]  
[Anonymous], 1977, DISCRETE TIME SIGNAL
[2]   A glimpsing model of speech perception in noise [J].
Cooke, M .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (03) :1562-1573
[3]   Medial olivocochlear efferent effects on basilar membrane responses to sound [J].
Cooper, N. P. ;
Guinan, J. J., Jr. .
AUDITORY MECHANISMS: PROCESSES AND MODELS, 2006, :86-92
[4]   Noise robust speech recognition using feature compensation based on polynomial fly regression of utterance SNR [J].
Cui, XD ;
Alwan, A .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (06) :1161-1172
[5]  
DALLOS P, 1992, J NEUROSCI, V12, P4575
[7]   MASKED COCHLEAR WHOLE-NERVE RESPONSE INTENSITY FUNCTIONS ALTERED BY ELECTRICAL-STIMULATION OF THE CROSSED OLIVOCOCHLEAR BUNDLE [J].
DOLAN, DF ;
NUTTALL, AL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 83 (03) :1081-1086
[8]   Frequency-dependent enhancement of basilar membrane velocity during olivocochlear bundle stimulation [J].
Dolan, DF ;
Guo, MH ;
Nuttall, AL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (06) :3587-3596
[9]   A computer model of medial efferent suppression in the mammalian auditory system [J].
Ferry, Robert T. ;
Meddis, Ray .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 122 (06) :3519-3526
[10]  
FERRY RT, 2008, THESIS U ESSEX COLCH