Audio-visual onset differences are used to determine syllable identity for ambiguous audio-visual stimulus pairs

被引:30
作者
ten Oever, Sanne [1 ]
Sack, Alexander T. [1 ]
Wheat, Katherine L. [1 ]
Bien, Nina [1 ,2 ]
van Atteveldt, Nienke [1 ,3 ]
机构
[1] Maastricht Univ, Fac Psychol & Neurosci, NL-6200 MD Maastricht, Netherlands
[2] Univ Luxembourg, EMACS Res Unit, Luxembourg, Luxembourg
[3] Netherlands Inst Neurosci, Neuroimaging & Neuromodeling Grp, Amsterdam, Netherlands
关键词
audiovisual; temporal cues; audio-visual onset differences; content cues; predictability; detection; MULTISENSORY INTEGRATION; VISUAL SPEECH; CROSSMODAL BINDING; NEURONAL OSCILLATIONS; AUDITORY-CORTEX; PERCEPTION; SOUNDS; MODULATION; SYNCHRONY; VOICES;
D O I
10.3389/fpsyg.2013.00331
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Content and temporal cues have been shown to interact during audio-visual (AV) speech identification. Typically, the most reliable unimodal cue is used more strongly to identify specific speech features; however, visual cues are only used if the AV stimuli are presented within a certain temporal window of integration (TWI). This suggests that temporal cues denote whether unimodal stimuli belong together, that is, whether they should be integrated. It is not known whether temporal cues also provide information about the identity of a syllable. Since spoken syllables have naturally varying AV onset asynchronies, we hypothesize that for suboptimal AV cues presented within the TWI, information about the natural AV onset differences can aid in speech identification. To test this, we presented low-intensity auditory syllables concurrently with visual speech signals, and varied the stimulus onset asynchronies (SOA) of the AV pair, while participants were instructed to identify the auditory syllables. We revealed that specific speech features (e.g., voicing) were identified by relying primarily on one modality (e.g., auditory). Additionally, we showed a wide window in which visual information influenced auditory perception, that seemed even wider for congruent stimulus pairs. Finally, we found a specific response pattern across the SOA range for syllables that were not reliably identified by the unimodal cues, which we explained as the result of the use of natural onset differences between AV speech signals. This indicates that temporal cues not only provide information about the temporal integration of AV stimuli, but additionally convey information about the identity of AV pairs. These results provide a detailed behavioral basis for further neuro-imaging and stimulation studies to unravel the neurofunctional mechanisms of the audio-visual-temporal interplay within speech perception.
引用
收藏
页数:13
相关论文
共 67 条
[41]  
Munhall K. G., 2004, The handbook of multisensory processes, P177, DOI [10.7551/mitpress/3422.003.0015, DOI 10.7551/MITPRESS/3422.003.0015]
[42]  
Munhall KG, 1998, Hearing by Eye: Part 2, The Psychology of Speechreading and Audiovisual Speech, P123
[43]  
PANDEY P C, 1986, Journal of Auditory Research, V26, P27
[44]   'When Birds of a Feather Flock Together': Synesthetic Correspondences Modulate Audiovisual Integration in Non-Synesthetes [J].
Parise, Cesare Valerio ;
Spence, Charles .
PLOS ONE, 2009, 4 (05)
[45]   The analysis of speech in different temporal integration windows: cerebral lateralization as 'asymmetric sampling in time' [J].
Poeppel, D .
SPEECH COMMUNICATION, 2003, 41 (01) :245-255
[46]   Neuronal oscillations and visual amplification of speech [J].
Schroeder, Charles E. ;
Lakatos, Peter ;
Kajikawa, Yoshinao ;
Partan, Sarah ;
Puce, Aina .
TRENDS IN COGNITIVE SCIENCES, 2008, 12 (03) :106-113
[47]   Assessing automaticity in audiovisual speech integration: evidence from the speeded classification task [J].
Soto-Faraco, S ;
Navarra, J ;
Alsius, A .
COGNITION, 2004, 92 (03) :B13-B23
[48]   Multisensory integration: Maintaining the perception of synchrony [J].
Spence, C ;
Squire, S .
CURRENT BIOLOGY, 2003, 13 (13) :R519-R521
[49]  
Stein Barry E., 1993, The Merging of the Senses. The Merging of the Senses. Cognitive Neuroscience
[50]   Interactions between the spatial and temporal stimulus factors that influence multisensory integration in human performance [J].
Stevenson, Ryan A. ;
Fister, Juliane Krueger ;
Barnett, Zachary P. ;
Nidiffer, Aaron R. ;
Wallace, Mark T. .
EXPERIMENTAL BRAIN RESEARCH, 2012, 219 (01) :121-137