Contributions of audio and visual modalities to perception of Mandarin Chinese emotions in valence-arousal space

被引:1
|
作者
Li, Yongwei [1 ]
Li, Aijun [2 ]
Tao, Jianhua [3 ,4 ]
Li, Feng [5 ]
Erickson, Donna [6 ]
Akagi, Masato [7 ]
机构
[1] Chinese Acad Sci, Inst Psychol, CAS Key Lab Behav Sci, Beijing 100045, Peoples R China
[2] Chinese Acad Social Sci, Inst Linguist, Corpus & Computat Linguist Ctr, Beijing 100732, Peoples R China
[3] Tsinghua Univ, Dept Automat, Beijing 100084, Peoples R China
[4] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Beijing 100190, Peoples R China
[5] Anhui Univ Finance & Econ, Sch Management Sci & Engn, Bengbu 233030, Anhui, Peoples R China
[6] Haskins Labs Inc, New Haven, CT 06511 USA
[7] Japan Adv Inst Sci & Technol, Grad Sch Adv Sci & Technol, Nomi, Ishikawa 9231292, Japan
基金
中国国家自然科学基金;
关键词
Dimensional emotion; Valence-arousal; Emotion perception; Audio-visual; AUDIOVISUAL INTEGRATION; EXPRESSION; SIGNALS; MODEL; FACE; CUES;
D O I
10.1250/ast.e24.41
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Emotions are usually perceived by multimodal cues for human communications; in recent years, emotions have been studied from the perspective of dimensional approaches. Investigation of audio and video cues to emotion perception in terms of categories of emotion has been relatively extensively conducted, but the contribution of audio and video cues to emotion perception in dimensional space is relatively under-investigated, especially in Mandarin Chinese. In this present study, three psychoacoustic experiments were conducted to investigate the contributions of audio, visual, and audio-visual modalities to emotional perception in the valence and arousal space. Audioonly, video-only, and audio-video modalities were presented to native Chinese subjects with normal hearing and vision for perceptual ratings of emotion in the valence and arousal dimensions. Results suggested that (1) different modalities contribute differently to perceiving valence and arousal dimensions; (2) compared to video-only modality, audio-only modality generally decreases arousal and valence at lower levels, and increases arousal and valence at higher levels; (3) the video-only modality plays an important role in separating anger and happiness emotions in the valence space.
引用
收藏
页码:55 / 63
页数:9
相关论文
共 45 条
  • [31] Stress in interactive applications: analysis of the valence-arousal space based on physiological signals and self-reported data
    Alexandros Liapis
    Christos Katsanos
    Dimitris G. Sotiropoulos
    Nikos Karousos
    Michalis Xenos
    Multimedia Tools and Applications, 2017, 76 : 5051 - 5071
  • [32] Does restriction of pitch variation affect the perception of vocal emotions in Mandarin Chinese?
    Wang, Ting
    Lee, Yong-cheol
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (01): : EL117 - EL123
  • [33] Deep Learning-Based Approach for Continuous Affect Prediction From Facial Expression Images in Valence-Arousal Space
    Hwooi, Stephen Khor Wen
    Othmani, Alice
    Sabri, Aznul Qalid Md
    IEEE ACCESS, 2022, 10 : 96053 - 96065
  • [34] The development of visual speech perception in Mandarin Chinese-speaking children
    Chen, Liang
    Lei, Jianghua
    CLINICAL LINGUISTICS & PHONETICS, 2017, 31 (7-9) : 514 - 525
  • [35] How fleeting emotions affect hazard perception and steering while driving: The impact of image arousal and valence
    Trick, Lana M.
    Brandigampola, Seneca
    Enns, James T.
    ACCIDENT ANALYSIS AND PREVENTION, 2012, 45 : 222 - 229
  • [36] Contributions of emotional valence and arousal to visual word processing in sentences: Central and peripheral psychophysiological indicators
    Bayer, Mareike
    Schacht, Annekathrin
    Sommer, Werner
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2008, 43 (3-4) : 208 - 208
  • [37] Faciliation of Mandarin tone perception by visual speech in clear and degraded audio: Implications for cochlear implants
    Smith, D. (da.smith@uws.edu.au), 1600, Acoustical Society of America (131):
  • [38] Faciliation of Mandarin tone perception by visual speech in clear and degraded audio: Implications for cochlear implants
    Smith, Damien
    Burnham, Denis
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (02): : 1480 - 1489
  • [39] English vowel perception by non-native speakers: impact of audio and visual training modalities
    Pereira Reyes, Yasna
    Hazan, Valerie
    ONOMAZEIN, 2021, (51): : 111 - 136
  • [40] Contributions of local speech encoding and functional connectivity to audio-visual speech perception
    Giordano, Bruno L.
    Ince, Robin A. A.
    Gross, Joachim
    Schyns, Philippe G.
    Panzeri, Stefano
    Kayser, Christoph
    ELIFE, 2017, 6