Neural speech restoration at the cocktail party: Auditory cortex recovers masked speech of both attended and ignored speakers

被引：58

作者：

Brodbeck, Christian ^{[1
]}

Jiao, Alex ^{[2
]}

Hong, L. Elliot ^{[3
]}

Simon, Jonathan Z. ^{[1
,2
,4
]}

机构：

[1] Univ Maryland, Inst Syst Res, College Pk, MD 20742 USA

[2] Univ Maryland, Dept Elect & Comp Engn, College Pk, MD USA

[3] Univ Maryland, Maryland Psychiat Res Ctr, Dept Psychiat, Sch Med, Baltimore, MD USA

[4] Univ Maryland, Dept Biol, College Pk, MD USA

来源：

PLOS BIOLOGY | 2020年 / 18卷 / 10期

基金：

美国国家卫生研究院;

关键词：

SPECTROTEMPORAL RECEPTIVE-FIELDS; PITCH-ANALYSIS SYSTEM; TO-NOISE RATIO; CORTICAL REPRESENTATION; ENERGETIC MASKING; BRAIN; EEG; MEG; PERCEPTION; SEPARATION;

D O I：

10.1371/journal.pbio.3000883

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

Humans are remarkably skilled at listening to one speaker out of an acoustic mixture of several speech sources. Two speakers are easily segregated, even without binaural cues, but the neural mechanisms underlying this ability are not well understood. One possibility is that early cortical processing performs a spectrotemporal decomposition of the acoustic mixture, allowing the attended speech to be reconstructed via optimally weighted recombinations that discount spectrotemporal regions where sources heavily overlap. Using human magnetoencephalography (MEG) responses to a 2-talker mixture, we show evidence for an alternative possibility, in which early, active segregation occurs even for strongly spectrotemporally overlapping regions. Early (approximately 70-millisecond) responses to nonoverlapping spectrotemporal features are seen for both talkers. When competing talkers' spectrotemporal features mask each other, the individual representations persist, but they occur with an approximately 20-millisecond delay. This suggests that the auditory cortex recovers acoustic features that are masked in the mixture, even if they occurred in the ignored speech. The existence of such noise-robust cortical representations, of features present in attended as well as ignored speech, suggests an active cortical stream segregation process, which could explain a range of behavioral effects of ignored background speech.

引用

页数：22

共 71 条

[1]

[Anonymous], 1990, AUDITORY SCENE ANAL

[2] Task Difficulty and Performance Induce Diverse Adaptive Patterns in Gain and Shape of Primary Auditory Cortical Receptive Fields [J].

Atiani, Serin ;

Elhilali, Mounya ;

David, Stephen V. ;

Fritz, Jonathan B. ;

Shamma, Shihab A. .

NEURON, 2009, 61 (03) :467-480

[3] AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].

BELL, AJ ;

SEJNOWSKI, TJ .

NEURAL COMPUTATION, 1995, 7 (06) :1129-1159

[4] Auditory-Inspired Speech Envelope Extraction Methods for Improved EEG-Based Auditory Attention Detection in a Cocktail Party Scenario [J].

Biesmans, Wouter ;

Das, Neetha ;

Francart, Tom ;

Bertrand, Alexander .

IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2017, 25 (05) :402-412

[5] Predicting Perception in Noise Using Cortical Auditory Evoked Potentials [J].

Billings, Curtis J. ;

McMillan, Garnett P. ;

Penman, Tina M. ;

Gille, Sun Mi .

JARO-JOURNAL OF THE ASSOCIATION FOR RESEARCH IN OTOLARYNGOLOGY, 2013, 14 (06) :891-903

[6] Human evoked cortical activity to signal-to-noise ratio and absolute signal level [J].

Billings, Curtis J. ;

Tremblay, Kelly L. ;

Stecker, G. Christopher ;

Tolin, Wendy M. .

HEARING RESEARCH, 2009, 254 (1-2) :15-24

[7] RESETTING THE PITCH-ANALYSIS SYSTEM .2. ROLE OF SUDDEN ONSETS AND OFFSETS IN THE PERCEPTION OF INDIVIDUAL COMPONENTS IN A CLUSTER OF OVERLAPPING TONES [J].

BREGMAN, AS ;

AHAD, PA ;

KIM, J .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 96 (05) :2694-2703

[8] RESETTING THE PITCH-ANALYSIS SYSTEM .1. EFFECTS OF RISE TIMES OF TONES IN NOISE BACKGROUNDS OR OF HARMONICS IN A COMPLEX TONE [J].

BREGMAN, AS ;

AHAD, P ;

KIM, J ;

MELNERICH, L .

PERCEPTION & PSYCHOPHYSICS, 1994, 56 (02) :155-162

[9]

Broadbent D. E., 1958, PERCEPTION COMMUNICA

[10] Rapid Transformation from Auditory to Linguistic Representations of Continuous Speech [J].

Brodbeck, Christian ;

Hong, L. Elliot ;

Simon, Jonathan Z. .

CURRENT BIOLOGY, 2018, 28 (24) :3976-+

← 1 2 3 4 5 6 7 8 →