Emergence of neural encoding of auditory objects while listening to competing speakers

被引:541
作者
Ding, Nai [1 ]
Simon, Jonathan Z. [1 ,2 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Biol, College Pk, MD 20742 USA
关键词
spectrotemporal response function; reverse correlation; phase locking; selective attention; CORTICAL REPRESENTATION; SPEECH-PERCEPTION; PLANUM TEMPORALE; COCKTAIL PARTY; CORTEX; ATTENTION; ORGANIZATION; MECHANISMS; PATTERNS; MASKING;
D O I
10.1073/pnas.1205381109
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
A visual scene is perceived in terms of visual objects. Similar ideas have been proposed for the analogous case of auditory scene analysis, although their hypothesized neural underpinnings have not yet been established. Here, we address this question by recording from subjects selectively listening to one of two competing speakers, either of different or the same sex, using magnetoencephalography. Individual neural representations are seen for the speech of the two speakers, with each being selectively phase locked to the rhythm of the corresponding speech stream and from which can be exclusively reconstructed the temporal envelope of that speech stream. The neural representation of the attended speech dominates responses (with latency near 100 ms) in posterior auditory cortex. Furthermore, when the intensity of the attended and background speakers is separately varied over an 8-dB range, the neural representation of the attended speech adapts only to the intensity of that speaker but not to the intensity of the background speaker, suggesting an object-level intensity gain control. In summary, these results indicate that concurrent auditory objects, even if spectrotemporally overlapping and not resolvable at the auditory periphery, are neurally encoded individually in auditory cortex and emerge as fundamental representational units for top-down attentional modulation and bottom-up neural adaptation.
引用
收藏
页码:11854 / 11859
页数:6
相关论文
共 46 条
  • [1] Right-hemisphere auditory cortex is dominant for coding syllable patterns in speech
    Abrams, Daniel A.
    Nicol, Trent
    Zecker, Steven
    Kraus, Nina
    [J]. JOURNAL OF NEUROSCIENCE, 2008, 28 (15) : 3958 - 3965
  • [2] Speech comprehension is correlated with temporal response patterns recorded from auditory cortex
    Ahissar, E
    Nagarajan, S
    Ahissar, M
    Protopapas, A
    Mahncke, H
    Merzenich, MM
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (23) : 13367 - 13372
  • [3] Attention-driven auditory cortex short-term plasticity helps segregate relevant sounds from noise
    Ahveninen, Jyrki
    Haemaelaeinen, Matti
    Jaaskelainen, Iiro P.
    Ahlfors, Seppo P.
    Huang, Samantha
    Lin, Fa-Hsuan
    Raij, Tommi
    Sams, Mikko
    Vasios, Christos E.
    Belliveau, John W.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2011, 108 (10) : 4182 - 4187
  • [4] Bregman A., 1990, Auditory Scene Analysis: The Perceptual Organization of Sound, DOI DOI 10.7551/MITPRESS/1486.001.0001
  • [5] Informational and energetic masking effects in the perception of two simultaneous talkers
    Brungart, DS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (03) : 1101 - 1109
  • [6] Normalization as a canonical neural computation
    Carandini, Matteo
    Heeger, David J.
    [J]. NATURE REVIEWS NEUROSCIENCE, 2012, 13 (01) : 51 - 62
  • [8] Optimizing sound features for cortical neurons
    deCharms, RC
    Blake, DT
    Merzenich, MM
    [J]. SCIENCE, 1998, 280 (5368) : 1439 - 1443
  • [9] Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex
    Depireux, DA
    Simon, JZ
    Klein, DJ
    Shamma, SA
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2001, 85 (03) : 1220 - 1234
  • [10] Neural coding of continuous speech in auditory cortex during monaural and dichotic listening
    Ding, Nai
    Simon, Jonathan Z.
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2012, 107 (01) : 78 - 89