Multimodal Integration of Dynamic Audio-Visual Cues in the Communication of Agreement and Disagreement

被引:13
|
作者
Mehu, Marc [1 ,2 ]
van der Maaten, Laurens [3 ]
机构
[1] Univ Geneva, Swiss Ctr Affect Sci, CH-1205 Geneva, Switzerland
[2] Webster Vienna Private Univ, Dept Psychol, A-1020 Vienna, Austria
[3] Delft Univ Technol, Dept Comp Sci, NL-2628 CD Delft, Netherlands
关键词
Multimodal communication; Agreement; Disagreement; Social perception; Optical flow; FACIAL EXPRESSIONS; NONVERBAL BEHAVIOR; EMPATHIC ACCURACY; CODING SYSTEM; EMOTION; PERSONALITY; PERCEPTION; EVOLUTION; JUDGMENTS; DOMINANCE;
D O I
10.1007/s10919-014-0192-2
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Recent research has stressed the importance of using multimodal and dynamic features to investigate the role of nonverbal behavior in social perception. This paper examines the influence of low-level visual and auditory cues on the communication of agreement, disagreement, and the ability to convince others. In addition, we investigate whether the judgment of these attitudes depends on ratings of socio-emotional dimensions such as dominance, arousal, and valence. The material we used consisted of audio-video excerpts that represent statements of agreement and disagreement, as well as neutral utterances taken from political discussions. Each excerpt was rated on a number of dimensions: agreement, disagreement, dominance, valence, arousal, and convincing power in three rating conditions: audio-only, video-only, and audio-video. We extracted low-level dynamic visual features using optical flow. Auditory features consisted of pitch measurements, vocal intensity, and articulation rate. Results show that judges were able to distinguish statements of disagreement from agreement and neutral utterances on the basis of nonverbal cues alone, in particular, when both auditory and visual information were present. Visual features were more influential when presented along with auditory features. Perceivers mainly used changes in pitch and the maximum speed of vertical movements to infer agreement and disagreement, and targets appeared more convincing when they showed consistent and rapid movements on the vertical plane. The effect of nonverbal features on ratings of agreement and disagreement was completely mediated by ratings of dominance, valence, and arousal, indicating that the impact of low-level audio-visual features on the perception of agreement and disagreement depends on the perception of fundamental socio-emotional dimensions.
引用
收藏
页码:569 / 597
页数:29
相关论文
共 50 条
  • [1] Multimodal Integration of Dynamic Audio–Visual Cues in the Communication of Agreement and Disagreement
    Marc Mehu
    Laurens van der Maaten
    Journal of Nonverbal Behavior, 2014, 38 : 569 - 597
  • [2] Audio-visual integration in multimodal communication
    Chen, T
    Rao, RR
    PROCEEDINGS OF THE IEEE, 1998, 86 (05) : 837 - 852
  • [3] Audio-visual interaction in multimodal communication
    Chellappa, R
    Chen, TH
    Katsaggelos, A
    IEEE SIGNAL PROCESSING MAGAZINE, 1997, 14 (04) : 37 - 38
  • [4] Audio-visual integration of emotional cues in song
    Thompson, William Forde
    Russo, Frank A.
    Quinto, Lena
    COGNITION & EMOTION, 2008, 22 (08) : 1457 - 1470
  • [5] Multimodal integration: Audio-visual integration by swarming mosquitoes
    Bomphrey, Richard J.
    CURRENT BIOLOGY, 2024, 34 (18) : R866 - R868
  • [6] Multimodal and Temporal Perception of Audio-visual Cues for Emotion Recognition
    Ghaleb, Esam
    Popa, Mirela
    Asteriadis, Stylianos
    2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
  • [7] EEG Guided Multimodal Lie Detection with Audio-Visual Cues
    Javaid, Hamza
    Dilawari, Aniqa
    Khan, Usman Ghani
    Wajid, Bilal
    PROCEEDINGS OF 2ND IEEE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (ICAI 2022), 2022, : 71 - 78
  • [8] Effects of spatial congruity on audio-visual multimodal integration
    Teder-Sälejärvi, WA
    Di Russo, F
    McDonald, JJ
    Hillyard, SA
    JOURNAL OF COGNITIVE NEUROSCIENCE, 2005, 17 (09) : 1396 - 1409
  • [9] Statistical multimodal integration for audio-visual speech processing
    Nakamura, S
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (04): : 854 - 866
  • [10] Non-invasive extraction of audio-visual cues for multimodal applications
    Kabre, H
    HYBRID IMAGE AND SIGNAL PROCESSING VI, 1998, 3389 : 133 - 138