Emergence of neural encoding of auditory objects while listening to competing speakers

被引:571
作者
Ding, Nai [1 ]
Simon, Jonathan Z. [1 ,2 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Biol, College Pk, MD 20742 USA
关键词
spectrotemporal response function; reverse correlation; phase locking; selective attention; CORTICAL REPRESENTATION; SPEECH-PERCEPTION; PLANUM TEMPORALE; COCKTAIL PARTY; CORTEX; ATTENTION; ORGANIZATION; MECHANISMS; PATTERNS; MASKING;
D O I
10.1073/pnas.1205381109
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A visual scene is perceived in terms of visual objects. Similar ideas have been proposed for the analogous case of auditory scene analysis, although their hypothesized neural underpinnings have not yet been established. Here, we address this question by recording from subjects selectively listening to one of two competing speakers, either of different or the same sex, using magnetoencephalography. Individual neural representations are seen for the speech of the two speakers, with each being selectively phase locked to the rhythm of the corresponding speech stream and from which can be exclusively reconstructed the temporal envelope of that speech stream. The neural representation of the attended speech dominates responses (with latency near 100 ms) in posterior auditory cortex. Furthermore, when the intensity of the attended and background speakers is separately varied over an 8-dB range, the neural representation of the attended speech adapts only to the intensity of that speaker but not to the intensity of the background speaker, suggesting an object-level intensity gain control. In summary, these results indicate that concurrent auditory objects, even if spectrotemporally overlapping and not resolvable at the auditory periphery, are neurally encoded individually in auditory cortex and emerge as fundamental representational units for top-down attentional modulation and bottom-up neural adaptation.
引用
收藏
页码:11854 / 11859
页数:6
相关论文
共 46 条
[1]   Right-hemisphere auditory cortex is dominant for coding syllable patterns in speech [J].
Abrams, Daniel A. ;
Nicol, Trent ;
Zecker, Steven ;
Kraus, Nina .
JOURNAL OF NEUROSCIENCE, 2008, 28 (15) :3958-3965
[2]   Speech comprehension is correlated with temporal response patterns recorded from auditory cortex [J].
Ahissar, E ;
Nagarajan, S ;
Ahissar, M ;
Protopapas, A ;
Mahncke, H ;
Merzenich, MM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (23) :13367-13372
[3]   Attention-driven auditory cortex short-term plasticity helps segregate relevant sounds from noise [J].
Ahveninen, Jyrki ;
Haemaelaeinen, Matti ;
Jaaskelainen, Iiro P. ;
Ahlfors, Seppo P. ;
Huang, Samantha ;
Lin, Fa-Hsuan ;
Raij, Tommi ;
Sams, Mikko ;
Vasios, Christos E. ;
Belliveau, John W. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (10) :4182-4187
[4]  
Bregman A., 1990, Auditory Scene Analysis: The Perceptual Organization of Sound, DOI DOI 10.7551/MITPRESS/1486.001.0001
[5]   Informational and energetic masking effects in the perception of two simultaneous talkers [J].
Brungart, DS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (03) :1101-1109
[6]   Normalization as a canonical neural computation [J].
Carandini, Matteo ;
Heeger, David J. .
NATURE REVIEWS NEUROSCIENCE, 2012, 13 (01) :51-62
[8]   Optimizing sound features for cortical neurons [J].
deCharms, RC ;
Blake, DT ;
Merzenich, MM .
SCIENCE, 1998, 280 (5368) :1439-1443
[9]   Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex [J].
Depireux, DA ;
Simon, JZ ;
Klein, DJ ;
Shamma, SA .
JOURNAL OF NEUROPHYSIOLOGY, 2001, 85 (03) :1220-1234
[10]   Neural coding of continuous speech in auditory cortex during monaural and dichotic listening [J].
Ding, Nai ;
Simon, Jonathan Z. .
JOURNAL OF NEUROPHYSIOLOGY, 2012, 107 (01) :78-89