Top-Down Predictions of Familiarity and Congruency in Audio-Visual Speech Perception at Neural Level

被引:2
作者
Kolozsvari, Orsolya B. [1 ,2 ]
Xu, Weiyong [1 ,2 ]
Leppanen, Paavo H. T. [1 ,2 ]
Hamalainen, Jarmo A. [1 ,2 ]
机构
[1] Univ Jyvaskyla, Dept Psychol, Jyvaskyla, Finland
[2] Univ Jyvaskyla, Jyvaskyla Ctr Interdisciplinary Brain Res CIBR, Jyvaskyla, Finland
基金
芬兰科学院;
关键词
speech perception; magnetoencephalography; audio-visual stimuli; audio-visual integration; familiarity; MULTISENSORY INTERACTIONS; VISUAL SPEECH; INTEGRATION; BRAIN; ACTIVATION; RESPONSES; FMRI; MEG; LOCALIZATION; INFORMATION;
D O I
10.3389/fnhum.2019.00243
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
During speech perception, listeners rely on multimodal input and make use of both auditory and visual information. When presented with speech, for example syllables, the differences in brain responses to distinct stimuli are not, however, caused merely by the acoustic or visual features of the stimuli. The congruency of the auditory and visual information and the familiarity of a syllable, that is, whether it appears in the listener's native language or not, also modulates brain responses. We investigated how the congruency and familiarity of the presented stimuli affect brain responses to audio-visual (AV) speech in 12 adult Finnish native speakers and 12 adult Chinese native speakers. They watched videos of a Chinese speaker pronouncing syllables (/pa/, /pha/, /ta/, /tha/, /fa/) during a magnetoencephalography (MEG) measurement where only /pa/ and /ta/ were part of Finnish phonology while all the stimuli were part of Chinese phonology. The stimuli were presented in audio-visual (congruent or incongruent), audio only, or visual only conditions. The brain responses were examined in five time-windows: 75-125, 150-200, 200-300, 300-400, and 400-600 ms. We found significant differences for the congruency comparison in the fourth time-window (300-400 ms) in both sensor and source level analysis. Larger responses were observed for the incongruent stimuli than for the congruent stimuli. For the familiarity comparisons no significant differences were found. The results are in line with earlier studies reporting on the modulation of brain responses for audio-visual congruency around 250-500 ms. This suggests a much stronger process for the general detection of a mismatch between predictions based on lip movements and the auditory signal than for the top-down modulation of brain responses based on phonological information.
引用
收藏
页数:11
相关论文
共 48 条
[1]  
[Anonymous], 2018, PRAAT DOING PHONETIC
[2]   Dual Neural Routing of Visual Facilitation in Speech Processing [J].
Arnal, Luc H. ;
Morillon, Benjamin ;
Kell, Christian A. ;
Giraud, Anne-Lise .
JOURNAL OF NEUROSCIENCE, 2009, 29 (43) :13445-13453
[3]   Electrophysiological evidence for speech-specific audiovisual integration [J].
Baart, Martijn ;
Stekelenburg, Jeroen J. ;
Vroomen, Jean .
NEUROPSYCHOLOGIA, 2014, 53 :115-121
[4]   fMRI-Guided Transcranial Magnetic Stimulation Reveals That the Superior Temporal Sulcus Is a Cortical Locus of the McGurk Effect [J].
Beauchamp, Michael S. ;
Nath, Audrey R. ;
Pasalar, Siavash .
JOURNAL OF NEUROSCIENCE, 2010, 30 (07) :2414-2417
[5]   Neural pathways for visual speech perception [J].
Bernstein, Lynne E. ;
Liebenthal, Einat .
FRONTIERS IN NEUROSCIENCE, 2014, 8
[6]   Bimodal speech: early suppressive visual effects in human auditory cortex [J].
Besle, J ;
Fort, A ;
Delpuech, C ;
Giard, MH .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2004, 20 (08) :2225-2234
[7]   Multisensory integration sites identified by perception of spatial wavelet filtered visual speech gesture information [J].
Callan, DE ;
Jones, JA ;
Munhall, K ;
Kroos, C ;
Callan, AM ;
Vatikiotis-Bateson, E .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2004, 16 (05) :805-816
[8]   The processing of audio-visual speech: empirical and neural bases [J].
Campbell, Ruth .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1493) :1001-1010
[9]   The cortical organization of audio-visual sentence comprehension: an fMRI study at 4 Tesla [J].
Capek, CM ;
Bavelier, D ;
Corina, D ;
Newman, AJ ;
Jezzard, P ;
Neville, HJ .
COGNITIVE BRAIN RESEARCH, 2004, 20 (02) :111-119
[10]   Dynamic statistical parametric mapping: Combining fMRI and MEG for high-resolution imaging of cortical activity [J].
Dale, AM ;
Liu, AK ;
Fischl, BR ;
Buckner, RL ;
Belliveau, JW ;
Lewine, JD ;
Halgren, E .
NEURON, 2000, 26 (01) :55-67