Speaker-normalized sound representations in the human auditory cortex

被引:41
作者
Sjerps, Matthias J. [1 ,2 ]
Fox, Neal P. [3 ]
Johnson, Keith [4 ]
Chang, Edward F. [3 ,5 ]
机构
[1] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, Ctr Cognit Neuroimaging, Kapittelweg 29, NL-6525 EN Nijmegen, Netherlands
[2] Max Planck Inst Psycholinguist, Wundtlaan 1, NL-6525 XD Nijmegen, Netherlands
[3] Univ Calif San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
[4] Univ Calif Berkeley, Dept Linguist, 1203 Dwinelle Hall 2650, Berkeley, CA 94720 USA
[5] Univ Calif San Francisco, Weill Inst Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
关键词
RELIABLE SPECTRAL PROPERTIES; NONSPEECH CONTEXT; TEMPORAL-LOBE; SPEECH; VOICE; COMPENSATION; PERCEPTION; RESPONSES; ADAPTATION; MASKING;
D O I
10.1038/s41467-019-10365-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The acoustic dimensions that distinguish speech sounds (like the vowel differences in "boot" and "boat") also differentiate speakers' voices. Therefore, listeners must normalize across speakers without losing linguistic information. Past behavioral work suggests an important role for auditory contrast enhancement in normalization: preceding context affects listeners' perception of subsequent speech sounds. Here, using intracranial electrocorticography in humans, we investigate whether and how such context effects arise in auditory cortex. Participants identified speech sounds that were preceded by phrases from two different speakers whose voices differed along the same acoustic dimension as target words (the lowest resonance of the vocal tract). In every participant, target vowels evoke a speaker-dependent neural response that is consistent with the listener's perception, and which follows from a contrast enhancement model. Auditory cortex processing thus displays a critical feature of normalization, allowing listeners to extract meaningful content from the voices of diverse speakers.
引用
收藏
页数:9
相关论文
共 76 条
[1]   Neuromagnetic correlates of voice pitch, vowel type, and speaker size in auditory cortex [J].
Andermann, Martin ;
Patterson, Roy D. ;
Vogt, Carolin ;
Winterstetter, Lisa ;
Rupp, Andre .
NEUROIMAGE, 2017, 158 :79-89
[2]   Mean-based neural coding of voices [J].
Andics, Attila ;
McQueen, James M. ;
Petersson, Karl Magnus .
NEUROIMAGE, 2013, 79 :351-360
[3]   Neural mechanisms for voice recognition [J].
Andics, Attila ;
McQueen, James M. ;
Petersson, Karl Magnus ;
Gal, Viktor ;
Rudas, Gabor ;
Vidnyanszky, Zoltan .
NEUROIMAGE, 2010, 52 (04) :1528-1540
[4]   Adaptation to speaker's voice in right anterior temporal lobe [J].
Belin, P ;
Zatorre, RJ .
NEUROREPORT, 2003, 14 (16) :2105-2109
[5]   Voice-selective areas in human auditory cortex [J].
Belin, P ;
Zatorre, RJ ;
Lafaille, P ;
Ahad, P ;
Pike, B .
NATURE, 2000, 403 (6767) :309-312
[6]   Auditory Cortex Represents Both Pitch Judgments and the Corresponding Acoustic Cues [J].
Bizley, Jennifer K. ;
Walker, Kerry M. M. ;
Nodal, Fernando R. ;
King, Andrew J. ;
Schnupp, Jan W. H. .
CURRENT BIOLOGY, 2013, 23 (07) :620-625
[7]   AUDITORY SPEECH PROCESSING IN THE LEFT TEMPORAL-LOBE - AN ELECTRICAL INTERFERENCE STUDY [J].
BOATMAN, D ;
LESSER, RP ;
GORDON, B .
BRAIN AND LANGUAGE, 1995, 51 (02) :269-290
[8]   Time course of forward masking tuning curves in cat primary auditory cortex [J].
Brosch, M ;
Schreiner, CE .
JOURNAL OF NEUROPHYSIOLOGY, 1997, 77 (02) :923-943
[9]   Speech-Specific Tuning of Neurons in Human Superior Temporal Gyrus [J].
Chan, Alexander M. ;
Dykstra, Andrew R. ;
Jayaram, Vinay ;
Leonard, Matthew K. ;
Travis, Katherine E. ;
Gygi, Brian ;
Baker, Janet M. ;
Eskandar, Emad ;
Hochberg, Leigh R. ;
Halgren, Eric ;
Cash, Sydney S. .
CEREBRAL CORTEX, 2014, 24 (10) :2679-2693
[10]   Categorical speech representation in human superior temporal gyrus [J].
Chang, Edward F. ;
Rieger, Jochem W. ;
Johnson, Keith ;
Berger, Mitchel S. ;
Barbaro, Nicholas M. ;
Knight, Robert T. .
NATURE NEUROSCIENCE, 2010, 13 (11) :1428-U169