Timing in audiovisual speech perception: A mini review and new psychophysical data

被引:17
|
作者
Venezia, Jonathan H. [1 ]
Thurman, Steven M. [2 ]
Matchin, William [3 ]
George, Sahara E. [4 ]
Hickok, Gregory [1 ]
机构
[1] Univ Calif Irvine, Dept Cognit Sci, Irvine, CA 92697 USA
[2] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA USA
[3] Univ Maryland, Dept Linguist, Baltimore, MD 21201 USA
[4] Univ Calif Irvine, Dept Anat & Neurobiol, Irvine, CA 92717 USA
关键词
Audiovisual speech; Multisensory integration; Prediction; Classification image; Timing; McGurk; Speech kinematics; MULTISENSORY INTEGRATION; VISUAL SPEECH; SPATIOTEMPORAL DYNAMICS; SUPERIOR COLLICULUS; MOVEMENT VELOCITY; TEMPORAL WINDOW; RECOGNITION; INFORMATION; TIME; SYNCHRONY;
D O I
10.3758/s13414-015-1026-y
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Recent influential models of audiovisual speech perception suggest that visual speech aids perception by generating predictions about the identity of upcoming speech sounds. These models place stock in the assumption that visual speech leads auditory speech in time. However, it is unclear whether and to what extent temporally-leading visual speech information contributes to perception. Previous studies exploring audiovisual-speech timing have relied upon psychophysical procedures that require artificial manipulation of cross-modal alignment or stimulus duration. We introduce a classification procedure that tracks perceptually relevant visual speech information in time without requiring such manipulations. Participants were shown videos of a McGurk syllable (auditory /apa/ + visual /aka/ = perceptual /ata/) and asked to perform phoneme identification (/apa/ yes-no). The mouth region of the visual stimulus was overlaid with a dynamic transparency mask that obscured visual speech in some frames but not others randomly across trials. Variability in participants' responses (similar to 35 % identification of /apa/ compared to similar to 5 % in the absence of the masker) served as the basis for classification analysis. The outcome was a high resolution spatiotemporal map of perceptually relevant visual features. We produced these maps for McGurk stimuli at different audiovisual temporal offsets (natural timing, 50-ms visual lead, and 100-ms visual lead). Briefly, temporally-leading (similar to 130 ms) visual information did influence auditory perception. Moreover, several visual features influenced perception of a single speech sound, with the relative influence of each feature depending on both its temporal relation to the auditory signal and its informational content.
引用
收藏
页码:583 / 601
页数:19
相关论文
共 50 条
  • [1] Timing in audiovisual speech perception: A mini review and new psychophysical data
    Jonathan H. Venezia
    Steven M. Thurman
    William Matchin
    Sahara E. George
    Gregory Hickok
    Attention, Perception, & Psychophysics, 2016, 78 : 583 - 601
  • [2] Neural processing of asynchronous audiovisual speech perception
    Stevenson, Ryan A.
    Altieri, Nicholas A.
    Kim, Sunah
    Pisoni, David B.
    James, Thomas W.
    NEUROIMAGE, 2010, 49 (04) : 3308 - 3318
  • [3] Causal inference and temporal predictions in audiovisual perception of speech and music
    Noppeney, Uta
    Lee, Hwee Ling
    ANNALS OF THE NEW YORK ACADEMY OF SCIENCES, 2018, 1423 (01) : 102 - 116
  • [4] Prediction and constraint in audiovisual speech perception
    Peelle, Jonathan E.
    Sommers, Mitchell S.
    CORTEX, 2015, 68 : 169 - 181
  • [5] The Principle of Inverse Effectiveness in Audiovisual Speech Perception
    van de Rijt, Luuk P. H.
    Roye, Anja
    Mylanus, Emmanuel A. M.
    van Opstal, A. John
    van Wanrooij, Marc M.
    FRONTIERS IN HUMAN NEUROSCIENCE, 2019, 13
  • [6] Crossmodal and incremental perception of audiovisual cues to emotional speech
    Barkhuysen, Pashiera
    Krahmer, Emiel
    Swerts, Marc
    LANGUAGE AND SPEECH, 2010, 53 : 3 - 30
  • [7] Sources of Confusion in Infant Audiovisual Speech Perception Research
    Shaw, Kathleen E.
    Bortfeld, Heather
    FRONTIERS IN PSYCHOLOGY, 2015, 6
  • [8] Audiovisual speech perception: Moving beyond McGurk
    Van Engen, Kristin J. J.
    Dey, Avanti
    Sommers, Mitchell S. S.
    Peelle, Jonathan E. E.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2022, 152 (06) : 3216 - 3225
  • [9] Audiovisual Binding for Speech Perception in Noise and in Aging
    Ganesh, Attigodu Chandrashekara
    Berthommier, Frederic
    Schwartz, Jean-Luc
    LANGUAGE LEARNING, 2018, 68 : 193 - 220
  • [10] Audiovisual Speech Perception and Eye Gaze Behavior of Adults with Asperger Syndrome
    Saalasti, Satu
    Katsyri, Jari
    Tiippana, Kaisa
    Laine-Hernandez, Mari
    von Wendt, Lennart
    Sams, Mikko
    JOURNAL OF AUTISM AND DEVELOPMENTAL DISORDERS, 2012, 42 (08) : 1606 - 1615