Task-Dependent Decoding of Speaker and Vowel Identity from Auditory Cortical Response Patterns

被引：73

作者：

Bonte, Milene

Hausfeld, Lars

Scharke, Wolfgang

Valente, Giancarlo

Formisano, Elia

机构：

[1] Maastricht Univ, Fac Psychol & Neurosci, Dept Cognit Neurosci, NL-6200 MD Maastricht, Netherlands

[2] Maastricht Univ, Fac Psychol & Neurosci, Maastricht Brain Imaging Ctr, NL-6200 MD Maastricht, Netherlands

来源：

JOURNAL OF NEUROSCIENCE | 2014年 / 34卷 / 13期

关键词：

auditory cortex; fMRI decoding; speech; voice; vowels; SPEECH; VOICE; CORTEX; ACTIVATION; PERCEPTION; REPRESENTATION; VARIABILITY; PLASTICITY; REGIONS; SOUNDS;

D O I：

10.1523/JNEUROSCI.4339-13.2014

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

Selective attention to relevant sound properties is essential for everyday listening situations. It enables the formation of different perceptual representations of the same acoustic input and is at the basis of flexible and goal-dependent behavior. Here, we investigated the role of the human auditory cortex in forming behavior-dependent representations of sounds. We used single-trial fMRI and analyzed cortical responses collected while subjects listened to the same speech sounds (vowels /a/,/i/, and /u/) spoken by different speakers (boy, girl, male) and performed a delayed-match-to-sample task on either speech sound or speaker identity. Univariate analyses showed a task-specific activation increase in the right superior temporal gyrus/sulcus (STG/STS) during speaker categorization and in the right posterior temporal cortex during vowel categorization. Beyond regional differences in activation levels, multivariate classification of single trial responses demonstrated that the success with which single speakers and vowels can be decoded from auditory cortical activation patterns depends on task demands and subject's behavioral performance. Speaker/vowel classification relied on distinct but overlapping regions across the (right) mid-anterior STG/STS (speakers) and bilateral mid-posterior STG/STS (vowels), as well as the superior temporal plane including Heschl's gyrus/sulcus. The task dependency of speaker/vowel classification demonstrates that the informative fMRI response patterns reflect the top-down enhancement of behaviorally relevant sound representations. Furthermore, our findings suggest that successful selection, processing, and retention of task-relevant sound properties relies on the joint encoding of information across early and higher-order regions of the auditory cortex.

引用

页码：4548 / 4557

页数：10

共 48 条

[1] Neural mechanisms for voice recognition [J].