Audiovisual Asynchrony Detection in Human Speech

被引:48
作者
Maier, Joost X. [1 ]
Di Luca, Massimiliano [1 ]
Noppeney, Uta [1 ]
机构
[1] Max Planck Inst Biol Cybernet, Cognit Neuroimaging Grp, D-72076 Tubingen, Germany
关键词
speech; multisensory integration; synchrony judgment; temporal order judgment; spectral rotation; TEMPORAL-ORDER; SYNCHRONY PERCEPTION; VOWEL; RECALIBRATION; INTEGRATION; ADAPTATION; EXPOSURE; WINDOW; MUSIC;
D O I
10.1037/a0019952
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Combining information from the visual and auditory senses can greatly enhance intelligibility of natural speech. Integration of audiovisual speech signals is robust even when temporal offsets are present between the component signals. In the present study, we characterized the temporal integration window for speech and nonspeech stimuli with similar spectrotemporal structure to investigate to what extent humans have adapted to the specific characteristics of natural audiovisual speech. We manipulated spectrotemporal structure of the auditory signal, stimulus length, and task context. Results indicate that the temporal integration window is narrower and more asymmetric for speech than for nonspeech signals. When perceiving audiovisual speech, subjects tolerate visual leading asynchronies, but are nevertheless very sensitive to auditory leading asynchronies that are less likely to occur in natural speech. Thus, speech perception may be fine-tuned to the natural statistics of audiovisual speech, where facial movements always occur before acoustic speech articulation.
引用
收藏
页码:245 / 256
页数:12
相关论文
共 45 条
[1]   A "voice inversion effect?" [J].
Bédard, C ;
Belin, P .
BRAIN AND COGNITION, 2004, 55 (02) :247-249
[2]   SPEECH PERCEPTION UNDER CONDITIONS OF SPECTRAL TRANSFORMATION .1. PHONETIC CHARACTERISTICS [J].
BLESSER, B .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1972, 15 (01) :5-&
[3]   HEARING BY EYE [J].
CAMPBELL, R ;
DODD, B .
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1980, 32 (FEB) :85-99
[4]   Auditory-visual speech perception and synchrony detection for speech and nonspeech signals [J].
Conrey, B ;
Pisoni, DB .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2006, 119 (06) :4065-4073
[5]  
De la Vaux S.K., 2004, Cognitive Processing, V5, P106
[6]   THE DETECTION OF AUDITORY VISUAL DESYNCHRONY [J].
DIXON, NF ;
SPITZ, L .
PERCEPTION, 1980, 9 (06) :719-721
[7]   A PHYSIOLOGICAL CORRELATIVE OF VOWEL INTENSITY [J].
Fairbanks, Grant .
SPEECH MONOGRAPHS, 1950, 17 (04) :390-395
[8]   Recalibration of audiovisual simultaneity [J].
Fujisaki, W ;
Shimojo, S ;
Kashino, M ;
Nishida, S .
NATURE NEUROSCIENCE, 2004, 7 (07) :773-778
[9]   Detection of auditory (cross-spectral) and auditory-visual (cross-modal) synchrony [J].
Grant, KW ;
van Wassenhove, V ;
Poeppel, D .
SPEECH COMMUNICATION, 2004, 44 (1-4) :43-53
[10]   The use of visible speech cues for improving auditory detection of spoken sentences [J].
Grant, KW ;
Seitz, PF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (03) :1197-1208