Electrophysiological evidence for speech-specific audiovisual integration

被引:75
作者
Baart, Martijn [1 ,2 ]
Stekelenburg, Jeroen J. [2 ]
Vroomen, Jean [2 ]
机构
[1] Basque Ctr Cognit Brain & Language, Donostia San Sebastian 20009, Spain
[2] Tilburg Univ, Dept Cognit Neuropsychol, POB 90153,Warandelaan 2, NL-5000 LE Tilburg, Netherlands
关键词
N1; P2; Audiovisual speech; Sine-wave speech; Audiovisual integration; AUDITORY-VISUAL INTERACTIONS; MULTISENSORY INTEGRATION; SELECTIVE-ATTENTION; MISMATCH NEGATIVITY; PERCEPTION; INFORMATION; HUMANS; WINDOW; MODE;
D O I
10.1016/j.neuropsychologia.2013.11.011
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Lip-read speech is integrated with heard speech at various neural levels. Here, we investigated the extent to which lip-read induced modulations of the auditory N1 and P2 (measured with EEG) are indicative of speech-specific audiovisual integration; and we explored to what extent the ERPs were modulated by phonetic audiovisual congruency. In order to disentangle speech-specific (phonetic) integration from non-speech integration, we used Sine-Wave Speech (SWS) that was perceived as speech by half of the participants (they were in speech-mode), while the other half was in non-speech mode. Results showed that the N1 obtained with audiovisual stimuli peaked earlier than the N1 evoked by auditory-only stimuli. This lip-read induced speeding up of the N1 occurred for listeners in speech and non-speech mode. In contrast, if listeners were in speech-mode, lip-read speech also modulated the auditory P2, but not if listeners were in non-speech mode, thus revealing speech-specific audiovisual binding. Comparing ERPs for phonetically congruent audiovisual stimuli with ERPs for incongruent stimuli revealed an effect of phonetic stimulus congruency that started at similar to 200 ms after (in)congruence became apparent. Critically, akin to the P2 suppression, congruency effects were only observed if listeners were in speech mode, and not if they were in non-speech mode. Using identical stimuli, we thus confirm that audiovisual binding involves (partially) different neural mechanisms for sound processing in speech and non-speech mode. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:115 / 121
页数:7
相关论文
共 53 条
[1]  
[Anonymous], 1998, Perceiving talking faces: From speech perception to a behavioral principle, MIT Press/Bradford Books series in cognitive psychology
[2]   Dual Neural Routing of Visual Facilitation in Speech Processing [J].
Arnal, Luc H. ;
Morillon, Benjamin ;
Kell, Christian A. ;
Giraud, Anne-Lise .
JOURNAL OF NEUROSCIENCE, 2009, 29 (43) :13445-13453
[3]   Phonetic processing areas revealed by sinewave speech and acoustically similar non-speech [J].
Benson, Randall R. ;
Richardson, Matthew ;
Whalen, D. H. ;
Lai, Song .
NEUROIMAGE, 2006, 31 (01) :342-353
[4]   Parametrically dissociating speech and nonspeech perception in the brain using fMRI [J].
Benson, RR ;
Whalen, DH ;
Richardson, M ;
Swainson, B ;
Clark, VP ;
Lai, S ;
Liberman, AM .
BRAIN AND LANGUAGE, 2001, 78 (03) :364-396
[5]  
Besle J., 2004, EUROPEAN J NEUROSCIE, V20
[6]  
Boersma P., 2020, Praat: doing phonetics by computer (Version 5.3.82) Computer software
[7]   The Natural Statistics of Audiovisual Speech [J].
Chandrasekaran, Chandramouli ;
Trubanova, Andrea ;
Stillittano, Sebastien ;
Caplier, Alice ;
Ghazanfar, Asif A. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (07)
[8]   Generalization of the generation of an MMN by illusory McGurk percepts: voiceless consonants [J].
Colin, C ;
Radeau, M ;
Soquet, A ;
Deltenre, P .
CLINICAL NEUROPHYSIOLOGY, 2004, 115 (09) :1989-2000
[9]   Mismatch negativity evoked by the McGurk-MacDonald effect: a phonetic representation within short-term memory [J].
Colin, C ;
Radeau, M ;
Soquet, A ;
Demolin, D ;
Colin, F ;
Deltenre, P .
CLINICAL NEUROPHYSIOLOGY, 2002, 113 (04) :495-506
[10]   A review of the evidence for P2 being an independent component process: age, sleep and modality [J].
Crowley, KE ;
Colrain, IM .
CLINICAL NEUROPHYSIOLOGY, 2004, 115 (04) :732-744