Audio-visual speech scene analysis: Characterization of the dynamics of unbinding and rebinding the McGurk effect

被引:25
|
作者
Nahorna, Olha [1 ]
Berthommier, Frederic [1 ]
Schwartz, Jean-Luc [1 ]
机构
[1] Grenoble Univ, CNRS, Speech & Cognit Dept, GIPSA Lab,UMR 5216, Grenoble, France
基金
欧洲研究理事会;
关键词
VISUAL SPEECH; SPATIAL ATTENTION; AUDITORY SPEECH; BIMODAL SPEECH; PERCEPTION; INTEGRATION; INFORMATION; DECISIONS; VOICES; INTELLIGIBILITY;
D O I
10.1121/1.4904536
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
While audiovisual interactions in speech perception have long been considered as automatic, recent data suggest that this is not the case. In a previous study, Nahorna et al. [(2012). J. Acoust. Soc. Am. 132, 1061-1077] showed that the McGurk effect is reduced by a previous incoherent audiovisual context. This was interpreted as showing the existence of an audiovisual binding stage controlling the fusion process. Incoherence would produce unbinding and decrease the weight of the visual input in fusion. The present paper explores the audiovisual binding system to characterize its dynamics. A first experiment assesses the dynamics of unbinding, and shows that it is rapid: An incoherent context less than 0.5 s long (typically one syllable) suffices to produce a maximal reduction in the McGurk effect. A second experiment tests the rebinding process, by presenting a short period of either coherent material or silence after the incoherent unbinding context. Coherence provides rebinding, with a recovery of the McGurk effect, while silence provides no rebinding and hence freezes the unbinding process. These experiments are interpreted in the framework of an audiovisual speech scene analysis process assessing the perceptual organization of an audiovisual speech input before decision takes place at a higher processing stage. (C) 2015 Acoustical Society of America.
引用
收藏
页码:362 / 377
页数:16
相关论文
共 50 条
  • [21] Effects of audio-visual integration on the detection of masked speech and non-speech sounds
    Eramudugolla, Ranmalee
    Henderson, Rachel
    Mattingley, Jason B.
    BRAIN AND COGNITION, 2011, 75 (01) : 60 - 66
  • [22] Atypical audio-visual neural synchrony and speech processing in early autism
    Wang, Xiaoyue
    Bouton, Sophie
    Kojovic, Nada
    Giraud, Anne-Lise
    Schaer, Marie
    JOURNAL OF NEURODEVELOPMENTAL DISORDERS, 2025, 17 (01)
  • [23] The processing of audio-visual speech: empirical and neural bases
    Campbell, Ruth
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1493) : 1001 - 1010
  • [24] Effects of audio-visual information on the intelligibility of alaryngeal speech
    Evitts, Paul M.
    Portugal, Lindsay
    Van Dine, Ami
    Holler, Aline
    JOURNAL OF COMMUNICATION DISORDERS, 2010, 43 (02) : 92 - 104
  • [25] Audio-visual speech recognition using an infrared headset
    Huang, J
    Potamianos, G
    Connell, J
    Neti, C
    SPEECH COMMUNICATION, 2004, 44 (1-4) : 83 - 96
  • [26] Cortical integration of audio-visual speech and non-speech stimuli
    Wyk, Brent C. Vander
    Ramsay, Gordon J.
    Hudac, Caitlin M.
    Jones, Warren
    Lin, David
    Klin, Ami
    Lee, Su Mei
    Pelphrey, Kevin A.
    BRAIN AND COGNITION, 2010, 74 (02) : 97 - 106
  • [27] Top-Down Predictions of Familiarity and Congruency in Audio-Visual Speech Perception at Neural Level
    Kolozsvari, Orsolya B.
    Xu, Weiyong
    Leppanen, Paavo H. T.
    Hamalainen, Jarmo A.
    FRONTIERS IN HUMAN NEUROSCIENCE, 2019, 13
  • [28] Objectivization of Audio-Visual Correlation Analysis
    Kunka, Bartosz
    Kostek, Bozena
    ARCHIVES OF ACOUSTICS, 2012, 37 (01) : 63 - 72
  • [29] The impact of the Lombard effect on audio and visual speech recognition systems
    Marxer, Ricard
    Barker, Jon
    Alghamdi, Najwa
    Maddock, Steve
    SPEECH COMMUNICATION, 2018, 100 : 58 - 68
  • [30] Contributions of local speech encoding and functional connectivity to audio-visual speech perception
    Giordano, Bruno L.
    Ince, Robin A. A.
    Gross, Joachim
    Schyns, Philippe G.
    Panzeri, Stefano
    Kayser, Christoph
    ELIFE, 2017, 6