Adaptive Temporal Encoding Leads to a Background-Insensitive Cortical Representation of Speech

被引:242
作者
Ding, Nai [1 ]
Simon, Jonathan Z. [1 ,2 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Biol, College Pk, MD 20742 USA
关键词
SYLLABLE-CENTRIC PERSPECTIVE; HUMAN AUDITORY-CORTEX; NEURONAL OSCILLATIONS; NATURAL SOUNDS; GAIN-CONTROL; NOISE; RESPONSES; PERCEPTION; ADAPTATION; INTELLIGIBILITY;
D O I
10.1523/JNEUROSCI.5297-12.2013
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Speech recognition is remarkably robust to the listening background, even when the energy of background sounds strongly overlaps with that of speech. How the brain transforms the corrupted acoustic signal into a reliable neural representation suitable for speech recognition, however, remains elusive. Here, we hypothesize that this transformation is performed at the level of auditory cortex through adaptive neural encoding, and we test the hypothesis by recording, using MEG, the neural responses of human subjects listening to a narrated story. Spectrally matched stationary noise, which has maximal acoustic overlap with the speech, is mixed in at various intensity levels. Despite the severe acoustic interference caused by this noise, it is here demonstrated that low-frequency auditory cortical activity is reliably synchronized to the slow temporal modulations of speech, even when the noise is twice as strong as the speech. Such a reliable neural representation is maintained by intensity contrast gain control and by adaptive processing of temporal modulations at different time scales, corresponding to the neural delta and theta bands. Critically, the precision of this neural synchronization predicts how well a listener can recognize speech in noise, indicating that the precision of the auditory cortical representation limits the performance of speech recognition in noise. Together, these results suggest that, in a complex listening environment, auditory cortex can selectively encode a speech stream in a background insensitive manner, and this stable neural representation of speech provides a plausible basis for background-invariant recognition of speech.
引用
收藏
页码:5728 / 5735
页数:8
相关论文
共 52 条
[1]   Neural Timing Is Linked to Speech Perception in Noise [J].
Anderson, Samira ;
Skoe, Erika ;
Chandrasekaran, Bharath ;
Kraus, Nina .
JOURNAL OF NEUROSCIENCE, 2010, 30 (14) :4922-4926
[2]  
Bar-Yosef O, 2007, FRONT COMPUT NEUROSC, V1, DOI [10.3389/neuro.10/003.2007, 10.3389/neuro.10.003.2007]
[3]   Human evoked cortical activity to signal-to-noise ratio and absolute signal level [J].
Billings, Curtis J. ;
Tremblay, Kelly L. ;
Stecker, G. Christopher ;
Tolin, Wendy M. .
HEARING RESEARCH, 2009, 254 (1-2) :15-24
[4]   Informational and energetic masking effects in the perception of two simultaneous talkers [J].
Brungart, DS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (03) :1101-1109
[5]   Neurobiologic responses to speech in noise in children with learning problems: deficits and strategies for improvement [J].
Cunningham, J ;
Nicol, T ;
Zecker, SG ;
Bradlow, A ;
Kraus, N .
CLINICAL NEUROPHYSIOLOGY, 2001, 112 (05) :758-767
[6]   Estimating sparse spectro-temporal receptive fields with natural stimuli [J].
David, Stephen V. ;
Mesgarani, Nima ;
Shamma, Shihab A. .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2007, 18 (03) :191-212
[7]   Denoising based on spatial filtering [J].
de Cheveigne, Alain ;
Simon, Jonathan Z. .
JOURNAL OF NEUROSCIENCE METHODS, 2008, 171 (02) :331-339
[8]   Neural population coding of sound level adapts to stimulus statistics [J].
Dean, I ;
Harper, NS ;
McAlpine, D .
NATURE NEUROSCIENCE, 2005, 8 (12) :1684-1689
[9]   REPRESENTATION OF SPEECH-LIKE SOUNDS IN THE DISCHARGE PATTERNS OF AUDITORY-NERVE FIBERS [J].
DELGUTTE, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (03) :843-857
[10]   Emergence of neural encoding of auditory objects while listening to competing speakers [J].
Ding, Nai ;
Simon, Jonathan Z. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (29) :11854-11859