Early auditory sensory processing of voices is facilitated by visual mechanisms

被引:35
作者
Schall, Sonja [1 ]
Kiebel, Stefan J. [2 ,3 ]
Maess, Burkhard [1 ]
von Kriegstein, Katharina [1 ,4 ]
机构
[1] Max Planck Inst Human Cognit & Brain Sci, D-04103 Leipzig, Germany
[2] Max Planck Inst Human Cognit & Brain Sci, Dept Neurol, D-04103 Leipzig, Germany
[3] Univ Clin Jena, D-07747 Jena, Germany
[4] Humboldt Univ, D-12489 Berlin, Germany
关键词
Multisensory; Audiovisual; Speaker recognition; FFA; Human; MULTISENSORY INTERACTIONS; TEMPORAL DYNAMICS; FACE; SPEECH; CORTEX; ACTIVATION; INTEGRATION; DISCRIMINATION; RECORDINGS; LATENCY;
D O I
10.1016/j.neuroimage.2013.03.043
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
How do we recognize people that are familiar to us? There is overwhelming evidence that our brains process voice and face in a combined fashion to optimally recognize both who is speaking and what is said. Surprisingly, this combined processing of voice and face seems to occur even if one stream of information is missing. For example, if subjects only hear someone who is familiar to them talking, without seeing their face, visual face-processing areas are active. One reason for this crossmodal activation might be that it is instrumental for early sensory processing of voices a hypothesis that is contrary to current models of unisensory perception. Here, we test this hypothesis by harnessing a temporally highly resolved method, i.e., magnetoencephalography (MEG), to identify the temporal response profile of the fusiform face area in response to auditory-only voice recognition. Participants briefly learned a set of voices audio-visually, i.e., together with a talking face. After learning, we measured subjects' MEG signals in response to the auditory-only, now familiar, voices. The results revealed three key mechanisms that characterize the sensory processing of familiar speakers' voices: (i) activation in the face-sensitive fusiform gyms at very early auditory processing stages, i.e., only 100 ms after auditory onset, (ii) a temporal facilitation of auditory processing (M200), and (iii) a correlation of this temporal facilitation with recognition performance. These findings suggest that a neural representation of face information is evoked before the identity of the voice is even recognized and that the brain uses this visual representation to facilitate early sensory processing of auditory-only voices. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:237 / 245
页数:9
相关论文
共 57 条
[1]   Temporal dynamics of adaptation to natural sounds in the human auditory cortex [J].
Altmann, Christian F. ;
Nakata, Hiroki ;
Noguchi, Yasuki ;
Inui, Koji ;
Hoshiyama, Minoru ;
Kaneoke, Yoshiki ;
Kakigi, Ryusuke .
CEREBRAL CORTEX, 2008, 18 (06) :1350-1360
[2]   Spatio temporal dynamics of face recognition [J].
Barbeau, Emmanuel J. ;
Taylor, Margot J. ;
Regis, Jean ;
Marquis, Patrick ;
Chauvel, Patrick ;
Liegeois-Chauvel, Catherine .
CEREBRAL CORTEX, 2008, 18 (05) :997-1009
[3]   Visual Activation and Audiovisual Interactions in the Auditory Cortex during Speech Perception: Intracranial Recordings in Humans [J].
Besle, Julien ;
Fischer, Catherine ;
Bidet-Caulet, Aurelie ;
Lecaignard, Francoise ;
Bertrand, Olivier ;
Giard, Marie-Helene .
JOURNAL OF NEUROSCIENCE, 2008, 28 (52) :14301-14310
[4]   Physiological and anatomical evidence for multisensory interactions in auditory cortex [J].
Bizley, Jennifer K. ;
Nodal, Fernando R. ;
Bajo, Victoria M. ;
Nelken, Israel ;
King, Andrew J. .
CEREBRAL CORTEX, 2007, 17 (09) :2172-2189
[5]   Direct Structural Connections between Voice- and Face-Recognition Areas [J].
Blank, Helen ;
Anwander, Alfred ;
von Kriegstein, Katharina .
JOURNAL OF NEUROSCIENCE, 2011, 31 (36) :12906-12915
[6]   EFFECTS OF STIMULUS CONTENT AND DURATION ON TALKER IDENTIFICATION [J].
BRICKER, PD ;
PRUZANSKY, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1966, 40 (06) :1441-+
[7]   Activation of auditory cortex during silent lipreading [J].
Calvert, GA ;
Bullmore, ET ;
Brammer, MJ ;
Campbell, R ;
Williams, SCR ;
McGuire, PK ;
Woodruff, PWR ;
Iverson, SD ;
David, AS .
SCIENCE, 1997, 276 (5312) :593-596
[8]   The Early Spatio-Temporal Correlates and Task Independence of Cerebral Voice Processing Studied with MEG [J].
Capilla, Almudena ;
Belin, Pascal ;
Gross, Joachim .
CEREBRAL CORTEX, 2013, 23 (06) :1388-1395
[9]   The Natural Statistics of Audiovisual Speech [J].
Chandrasekaran, Chandramouli ;
Trubanova, Andrea ;
Stillittano, Sebastien ;
Caplier, Alice ;
Ghazanfar, Asif A. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
[10]   Electrophysiological evidence for an early processing of human voices [J].
Charest, Ian ;
Pernet, Cyril R. ;
Rousselet, Guillaume A. ;
Quinones, Ileana ;
Latinus, Marianne ;
Fillion-Bilodeau, Sarah ;
Chartrand, Jean-Pierre ;
Belin, Pascal .
BMC NEUROSCIENCE, 2009, 10 :127