Learning to recognize unfamiliar faces from fine-phonetic detail in visual speech

被引:0
|
作者
Jesse, Alexandra [1 ]
机构
[1] Univ Massachusetts, Dept Psychol & Brain Sci, 135 Hicks Way, Amherst, MA 01003 USA
关键词
Speech; Talker variability; Talker identification; Learning; Face perception; TALKER VARIABILITY; FACIAL MOTION; BIOLOGICAL MOTION; SPEAKING RATE; PERCEPTION; INFORMATION; MOVEMENT; DISPLAYS; IDENTIFICATION; FAMILIARITY;
D O I
10.3758/s13414-025-03049-y
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
How speech is realized varies across talkers but can be somewhat consistent within a talker. Humans are sensitive to these idiosyncrasies when perceiving auditory speech, but also, in face-to-face communications, when perceiving their visual speech. Our recent work has shown that humans can also use talker idiosyncrasies seen in how talkers produce sentences to rapidly learn to recognize unfamiliar talkers, suggesting that visual speech information can be used for speech perception and talker recognition. However, in learning from sentences, learners may focus only on global information about the talker, such as talker-specific realizations of prosody and rate. The present study tested whether human perceivers can learn the identity of the talker based solely on fine-phonetic detail in the dynamic realization of visual speech alone. Participants learned to identify talkers from point-light displays showing them uttering isolated words. These point-light displays isolated the dynamic speech information, while discarding static information about the talker's face. No sound was presented. Feedback was given only during training. Test included point-light displays of familiar words from training and of novel words. Participants learned to recognize two and four talkers from the word-level dynamics of visual speech from very little exposure. The established representations allowed talker recognition independent of linguistic content-that is, even from novel words. Spoken words therefore contain sufficient indexical information in their fine-phonetic detail for perceivers to acquire dynamic facial representations for unfamiliar talkers that allows generalization across words. Dynamic representations of talking faces are formed for the recognition of unfamiliar faces.
引用
收藏
页码:936 / 951
页数:16
相关论文
共 37 条