Task-Dependent Decoding of Speaker and Vowel Identity from Auditory Cortical Response Patterns

被引:73
作者
Bonte, Milene
Hausfeld, Lars
Scharke, Wolfgang
Valente, Giancarlo
Formisano, Elia
机构
[1] Maastricht Univ, Fac Psychol & Neurosci, Dept Cognit Neurosci, NL-6200 MD Maastricht, Netherlands
[2] Maastricht Univ, Fac Psychol & Neurosci, Maastricht Brain Imaging Ctr, NL-6200 MD Maastricht, Netherlands
关键词
auditory cortex; fMRI decoding; speech; voice; vowels; SPEECH; VOICE; CORTEX; ACTIVATION; PERCEPTION; REPRESENTATION; VARIABILITY; PLASTICITY; REGIONS; SOUNDS;
D O I
10.1523/JNEUROSCI.4339-13.2014
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Selective attention to relevant sound properties is essential for everyday listening situations. It enables the formation of different perceptual representations of the same acoustic input and is at the basis of flexible and goal-dependent behavior. Here, we investigated the role of the human auditory cortex in forming behavior-dependent representations of sounds. We used single-trial fMRI and analyzed cortical responses collected while subjects listened to the same speech sounds (vowels /a/,/i/, and /u/) spoken by different speakers (boy, girl, male) and performed a delayed-match-to-sample task on either speech sound or speaker identity. Univariate analyses showed a task-specific activation increase in the right superior temporal gyrus/sulcus (STG/STS) during speaker categorization and in the right posterior temporal cortex during vowel categorization. Beyond regional differences in activation levels, multivariate classification of single trial responses demonstrated that the success with which single speakers and vowels can be decoded from auditory cortical activation patterns depends on task demands and subject's behavioral performance. Speaker/vowel classification relied on distinct but overlapping regions across the (right) mid-anterior STG/STS (speakers) and bilateral mid-posterior STG/STS (vowels), as well as the superior temporal plane including Heschl's gyrus/sulcus. The task dependency of speaker/vowel classification demonstrates that the informative fMRI response patterns reflect the top-down enhancement of behaviorally relevant sound representations. Furthermore, our findings suggest that successful selection, processing, and retention of task-relevant sound properties relies on the joint encoding of information across early and higher-order regions of the auditory cortex.
引用
收藏
页码:4548 / 4557
页数:10
相关论文
共 48 条
[1]   Neural mechanisms for voice recognition [J].
Andics, Attila ;
McQueen, James M. ;
Petersson, Karl Magnus ;
Gal, Viktor ;
Rudas, Gabor ;
Vidnyanszky, Zoltan .
NEUROIMAGE, 2010, 52 (04) :1528-1540
[2]   FAMILY HANDEDNESS IN 3 GENERATIONS PREDICTED BY THE RIGHT SHIFT THEORY [J].
ANNETT, M .
ANNALS OF HUMAN GENETICS, 1979, 42 (MAY) :479-491
[3]  
[Anonymous], 2000, Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses
[4]   Task Difficulty and Performance Induce Diverse Adaptive Patterns in Gain and Shape of Primary Auditory Cortical Receptive Fields [J].
Atiani, Serin ;
Elhilali, Mounya ;
David, Stephen V. ;
Fritz, Jonathan B. ;
Shamma, Shihab A. .
NEURON, 2009, 61 (03) :467-480
[5]   Perceptual scaling of voice identity: common dimensions for different vowels and speakers [J].
Baumann, Oliver ;
Belin, Pascal .
PSYCHOLOGICAL RESEARCH-PSYCHOLOGISCHE FORSCHUNG, 2010, 74 (01) :110-120
[6]   Adaptation to speaker's voice in right anterior temporal lobe [J].
Belin, P ;
Zatorre, RJ .
NEUROREPORT, 2003, 14 (16) :2105-2109
[7]   Voice-selective areas in human auditory cortex [J].
Belin, P ;
Zatorre, RJ ;
Lafaille, P ;
Ahad, P ;
Pike, B .
NATURE, 2000, 403 (6767) :309-312
[8]   ACOUSTIC CORRELATES OF PERCEIVED SEXUAL IDENTITY IN PRE-ADOLESCENT CHILDRENS VOICES [J].
BENNETT, S ;
WEINBERG, B .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (04) :989-1000
[9]   Human temporal lobe activation by speech and nonspeech sounds [J].
Binder, JR ;
Frost, JA ;
Hammeke, TA ;
Bellgowan, PSF ;
Springer, JA ;
Kaufman, JN ;
Possing, ET .
CEREBRAL CORTEX, 2000, 10 (05) :512-528
[10]  
Boersma P., 2002, Praat 4.0: a system for doing phonetics with the computer [Computer software]