Level-Dependent Changes in Perception of Speech Envelope Cues

被引:7
作者
Dubno, Judy R. [1 ]
Ahlstrom, Jayne B. [1 ]
Wang, Xin [1 ]
Horwitz, Amy R. [1 ]
机构
[1] Med Univ S Carolina, Dept Otolaryngol Head & Neck Surg, Charleston, SC 29425 USA
来源
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY | 2012年 / 13卷 / 06期
基金
美国国家卫生研究院;
关键词
basilar-membrane responses; compression; human; speech envelope; vocoder; STEADY-STATE VOWELS; THAN-NORMAL LEVELS; NORMAL-HEARING; AUDITORY-NERVE; FREQUENCY-SELECTIVITY; TEMPORAL INTEGRATION; OLIVOCOCHLEAR REFLEX; CONSONANT CONFUSIONS; DISCHARGE PATTERNS; WORD RECOGNITION;
D O I
10.1007/s10162-012-0343-2
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Level-dependent changes in temporal envelope fluctuations in speech and related changes in speech recognition may reveal effects of basilar-membrane nonlinearities. As a result of compression in the basilar-membrane response, the "effective" magnitude of envelope fluctuations may be reduced as speech level increases from lower level (more linear) to mid-level (more compressive) regions. With further increases to a more linear region, speech envelope fluctuations may become more pronounced. To assess these effects, recognition of consonants and key words in sentences was measured as a function of speech level for younger adults with normal hearing. Consonant-vowel syllables and sentences were spectrally degraded using "noise vocoder" processing to maximize perceptual effects of changes to the speech envelope. Broadband noise at a fixed signal-to-noise ratio maintained constant audibility as speech level increased. Results revealed significant increases in scores and envelope-dependent feature transmission from 45 to 60 dB SPL and decreasing scores and feature transmission from 60 to 85 dB SPL. This quadratic pattern, with speech recognition maximized at mid levels and poorer at lower and higher levels, is consistent with a role of cochlear nonlinearities in perception of speech envelope cues.
引用
收藏
页码:835 / 852
页数:18
相关论文
共 64 条
[1]   Detection of high-frequency spectral notches as a function of level [J].
Alves-Pinto, A ;
Lopez-Poveda, EA .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (04) :2458-2469
[2]  
[Anonymous], 2004, AM NAT STAND I S3 6
[3]  
[Anonymous], 2005, GUIDELINES MANUAL PU
[4]   Speech recognition in normal hearing and sensorineural hearing loss as a function of the number of spectral channels [J].
Baskent, Deniz .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 120 (05) :2908-2925
[5]  
Bess F H, 1979, Am J Otol, V1, P27
[6]   Modeling the Anti-masking Effects of the Olivocochlear Reflex in Auditory Nerve Responses to Tones in Sustained Noise [J].
Chintanpalli, Ananthakrishna ;
Jennings, Skyler G. ;
Heinz, Michael G. ;
Strickland, Elizabeth A. .
JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2012, 13 (02) :219-235
[7]   Lexical information drives; Perceptual learning of distorted speech: Evidence from the comprehension of noise-vocoded sentences [J].
Davis, MH ;
Johnsrude, IS ;
Hervais-Adelman, A ;
Taylor, K ;
McGettigan, C .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2005, 134 (02) :222-241
[8]   REPRESENTATION OF SPEECH-LIKE SOUNDS IN THE DISCHARGE PATTERNS OF AUDITORY-NERVE FIBERS [J].
DELGUTTE, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (03) :843-857
[9]  
DELGUTTE B, 1995, AUDITORY COMPUTATION, P157
[10]   USE OF PERFORMANCE-INTENSITY FUNCTIONS FOR DIAGNOSIS [J].
DIRKS, DD ;
KAMM, C ;
BOWER, D ;
BETSWORTH, A .
JOURNAL OF SPEECH AND HEARING DISORDERS, 1977, 42 (03) :408-415