Neural speech restoration at the cocktail party: Auditory cortex recovers masked speech of both attended and ignored speakers

被引:58
作者
Brodbeck, Christian [1 ]
Jiao, Alex [2 ]
Hong, L. Elliot [3 ]
Simon, Jonathan Z. [1 ,2 ,4 ]
机构
[1] Univ Maryland, Inst Syst Res, College Pk, MD 20742 USA
[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD USA
[3] Univ Maryland, Maryland Psychiat Res Ctr, Dept Psychiat, Sch Med, Baltimore, MD USA
[4] Univ Maryland, Dept Biol, College Pk, MD USA
基金
美国国家卫生研究院;
关键词
SPECTROTEMPORAL RECEPTIVE-FIELDS; PITCH-ANALYSIS SYSTEM; TO-NOISE RATIO; CORTICAL REPRESENTATION; ENERGETIC MASKING; BRAIN; EEG; MEG; PERCEPTION; SEPARATION;
D O I
10.1371/journal.pbio.3000883
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Humans are remarkably skilled at listening to one speaker out of an acoustic mixture of several speech sources. Two speakers are easily segregated, even without binaural cues, but the neural mechanisms underlying this ability are not well understood. One possibility is that early cortical processing performs a spectrotemporal decomposition of the acoustic mixture, allowing the attended speech to be reconstructed via optimally weighted recombinations that discount spectrotemporal regions where sources heavily overlap. Using human magnetoencephalography (MEG) responses to a 2-talker mixture, we show evidence for an alternative possibility, in which early, active segregation occurs even for strongly spectrotemporally overlapping regions. Early (approximately 70-millisecond) responses to nonoverlapping spectrotemporal features are seen for both talkers. When competing talkers' spectrotemporal features mask each other, the individual representations persist, but they occur with an approximately 20-millisecond delay. This suggests that the auditory cortex recovers acoustic features that are masked in the mixture, even if they occurred in the ignored speech. The existence of such noise-robust cortical representations, of features present in attended as well as ignored speech, suggests an active cortical stream segregation process, which could explain a range of behavioral effects of ignored background speech.
引用
收藏
页数:22
相关论文
共 71 条
[41]   Human Superior Temporal Gyrus Organization of Spectrotemporal Modulation Tuning Derived from Speech Stimuli [J].
Hullett, Patrick W. ;
Hamilton, Liberty S. ;
Mesgarani, Nima ;
Schreiner, Christoph E. ;
Chang, Edward F. .
JOURNAL OF NEUROSCIENCE, 2016, 36 (06) :2014-2026
[42]   The advantage of knowing where to listen [J].
Kidd, G ;
Arbogast, TL ;
Mason, CR ;
Gallun, FJ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2005, 118 (06) :3804-3815
[43]   Determining the energetic and informational components of speech-on-speech masking [J].
Kidd, Gerald, Jr. ;
Mason, Christine R. ;
Swaminathan, Jayaganesh ;
Roverud, Elin ;
Clayton, Kameron K. ;
Best, Virginia .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (01) :132-144
[44]   The potential of onset enhancement for increased speech intelligibility in auditory prostheses [J].
Koning, Raphael ;
Wouters, Jan .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 132 (04) :2569-2581
[45]   Forty-five years after Broadbent (1958): Still no identification without attention [J].
Lachter, J ;
Forster, KI ;
Ruthruff, E .
PSYCHOLOGICAL REVIEW, 2004, 111 (04) :880-913
[46]   The Spectrotemporal Filter Mechanism of Auditory Selective Attention [J].
Lakatos, Peter ;
Musacchia, Gabriella ;
O'Connel, Monica N. ;
Falchier, Arnaud Y. ;
Javitt, Daniel C. ;
Schroeder, Charles E. .
NEURON, 2013, 77 (04) :750-761
[47]   Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution [J].
Lalor, Edmund C. ;
Foxe, John J. .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2010, 31 (01) :189-193
[48]   Perceptual restoration of masked speech in human cortex [J].
Leonard, Matthew K. ;
Baud, Maxime O. ;
Sjerps, Matthias J. ;
Chang, Edward F. .
NATURE COMMUNICATIONS, 2016, 7
[49]   USING CONFIDENCE-INTERVALS IN WITHIN-SUBJECT DESIGNS [J].
LOFTUS, GR ;
MASSON, MEJ .
PSYCHONOMIC BULLETIN & REVIEW, 1994, 1 (04) :476-490
[50]   Background noise exerts diverse effects on the cortical encoding of foreground sounds [J].
Malone, B. J. ;
Heiser, Marc A. ;
Beitel, Ralph E. ;
Schreiner, Christoph E. .
JOURNAL OF NEUROPHYSIOLOGY, 2017, 118 (02) :1034-1054