Speaker-normalized sound representations in the human auditory cortex

被引:41
作者
Sjerps, Matthias J. [1 ,2 ]
Fox, Neal P. [3 ]
Johnson, Keith [4 ]
Chang, Edward F. [3 ,5 ]
机构
[1] Radboud Univ Nijmegen, Donders Inst Brain Cognit & Behav, Ctr Cognit Neuroimaging, Kapittelweg 29, NL-6525 EN Nijmegen, Netherlands
[2] Max Planck Inst Psycholinguist, Wundtlaan 1, NL-6525 XD Nijmegen, Netherlands
[3] Univ Calif San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
[4] Univ Calif Berkeley, Dept Linguist, 1203 Dwinelle Hall 2650, Berkeley, CA 94720 USA
[5] Univ Calif San Francisco, Weill Inst Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
关键词
RELIABLE SPECTRAL PROPERTIES; NONSPEECH CONTEXT; TEMPORAL-LOBE; SPEECH; VOICE; COMPENSATION; PERCEPTION; RESPONSES; ADAPTATION; MASKING;
D O I
10.1038/s41467-019-10365-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The acoustic dimensions that distinguish speech sounds (like the vowel differences in "boot" and "boat") also differentiate speakers' voices. Therefore, listeners must normalize across speakers without losing linguistic information. Past behavioral work suggests an important role for auditory contrast enhancement in normalization: preceding context affects listeners' perception of subsequent speech sounds. Here, using intracranial electrocorticography in humans, we investigate whether and how such context effects arise in auditory cortex. Participants identified speech sounds that were preceded by phrases from two different speakers whose voices differed along the same acoustic dimension as target words (the lowest resonance of the vocal tract). In every participant, target vowels evoke a speaker-dependent neural response that is consistent with the listener's perception, and which follows from a contrast enhancement model. Auditory cortex processing thus displays a critical feature of normalization, allowing listeners to extract meaningful content from the voices of diverse speakers.
引用
收藏
页数:9
相关论文
共 76 条
[61]   SHORT-TERM ADAPTATION IN SINGLE AUDITORY-NERVE FIBERS - SOME POST-STIMULATORY EFFECTS [J].
SMITH, RL .
JOURNAL OF NEUROPHYSIOLOGY, 1977, 40 (05) :1098-1112
[62]   Spectrotemporal analysis of evoked and induced electroencephalographic responses in primary auditory cortex (A1) of the awake monkey [J].
Steinschneider, Mitchell ;
Fishman, Yonatan I. ;
Arezzo, Joseph C. .
CEREBRAL CORTEX, 2008, 18 (03) :610-625
[63]   Intracranial Study of Speech-Elicited Activity on the Human Posterolateral Superior Temporal Gyrus [J].
Steinschneider, Mitchell ;
Nourski, Kirill V. ;
Kawasaki, Hiroto ;
Oya, Hiroyuki ;
Brugge, John F. ;
Howard, Matthew A., III .
CEREBRAL CORTEX, 2011, 21 (10) :2332-2347
[64]   Toward a model for lexical access based on acoustic landmarks and distinctive features [J].
Stevens, KN .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 111 (04) :1872-1891
[65]   Perceptual sensitivity to spectral properties of earlier sounds during speech categorization [J].
Stilp, Christian E. ;
Assgari, Ashley A. .
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2018, 80 (05) :1300-1310
[66]   Predicting contrast effects following reliable spectral properties in speech perception [J].
Stilp, Christian E. ;
Anderson, Paul W. ;
Winn, Matthew B. .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (06) :3466-3476
[67]   Auditory color constancy: Calibration to reliable spectral properties across nonspeech context and targets [J].
Stilp, Christian E. ;
Alexander, Joshua M. ;
Kiefte, Michael ;
Kluender, Keith R. .
ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (02) :470-480
[68]   Intonational speech prosody encoding in the human auditory cortex [J].
Tang, C. ;
Hamilton, L. S. ;
Chang, E. F. .
SCIENCE, 2017, 357 (6353) :797-801
[69]   Multiple time scales of adaptation in auditory cortex neurons [J].
Ulanovsky, N ;
Las, L ;
Farkas, D ;
Nelken, I .
JOURNAL OF NEUROSCIENCE, 2004, 24 (46) :10440-10453
[70]   Similar Response Patterns Do Not Imply Identical Origins: An Energetic Masking Account of Nonspeech Effects in Compensation for Coarticulation [J].
Viswanathan, Navin ;
Magnuson, James S. ;
Fowler, Carol A. .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2013, 39 (04) :1181-1192