Multimodal emotion recognition based on peak frame selection from video

被引:24
|
作者
Zhalehpour, Sara [1 ]
Akhtar, Zahid [2 ]
Erdem, Cigdem Eroglu [3 ]
机构
[1] INRS EMT, Montreal, PQ, Canada
[2] Univ Udine, I-33100 Udine, Italy
[3] Bahcesehir Univ, Istanbul, Turkey
关键词
Affective computing; Facial expression recognition; Apex frame; Audio-visual emotion recognition; FUSION;
D O I
10.1007/s11760-015-0822-0
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We present a fully automatic multimodal emotion recognition system based on three novel peak frame selection approaches using the video channel. Selection of peak frames (i.e., apex frames) is an important preprocessing step for facial expression recognition as they contain the most relevant information for classification. Two of the three proposed peak frame selection methods (i.e., MAXDIST and DEND-CLUSTER) do not employ any training or prior learning. The third method proposed for peak frame selection (i.e., EIFS) is based on measuring the "distance" of the expressive face from the subspace of neutral facial expression, which requires a prior learning step to model the subspace of neutral face shapes. The audio and video modalities are fused at the decision level. The subject-independent audio-visual emotion recognition system has shown promising results on two databases in two different languages (eNTERFACE and BAUM-1a).
引用
收藏
页码:827 / 834
页数:8
相关论文
共 50 条
  • [31] Character emotion recognition algorithm in small sample video based on multimodal feature fusion
    Xie, Jian
    Chu, Dan
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2025, 17 (1-2) : 1 - 14
  • [32] A multimodal emotion recognition model integrating speech, video and MoCAP
    Jia, Ning
    Zheng, Chunjun
    Sun, Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32265 - 32286
  • [33] A Dynamic Frame Selection Framework for Fast Video Recognition
    Wu, Zuxuan
    Li, Hengduo
    Xiong, Caiming
    Jiang, Yu-Gang
    Davis, Larry Steven
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (04) : 1699 - 1711
  • [34] AdaFrame: Adaptive Frame Selection for Fast Video Recognition
    Wu, Zuxuan
    Xiong, Caiming
    Ma, Chih-Yao
    Socher, Richard
    Davis, Larry S.
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1278 - 1287
  • [35] Multimodal Emotion Recognition Based on the Decoupling of Emotion and Speaker Information
    Gajsek, Rok
    Struc, Vitomir
    Mihelic, France
    TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 275 - 282
  • [36] Multimodal Emotion Recognition Based on Feature Fusion
    Xu, Yurui
    Wu, Xiao
    Su, Hang
    Liu, Xiaorui
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 7 - 11
  • [37] Multimodal Emotion Recognition in Conversation Based on Hypergraphs
    Li, Jiaze
    Mei, Hongyan
    Jia, Liyun
    Zhang, Xing
    ELECTRONICS, 2023, 12 (22)
  • [38] Video-based multimodal spontaneous emotion recognition using facial expressions and physiological signals
    Ouzar, Yassine
    Bousefsaf, Frederic
    Djeldjli, Djamaleddine
    Maaoui, Choubeila
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2459 - 2468
  • [39] Multimodal Emotion Recognition and State Analysis of Classroom Video and Audio Based on Deep Neural Network
    Li, Mingyong
    Liu, Mingyue
    Jiang, Zheng
    Zhao, Zongwei
    Zhang, Jiayan
    Ge, Mingyuan
    Duan, Huiming
    Wang, Yanxia
    JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP04)
  • [40] A Multimodal Driver Emotion Recognition Algorithm Based on the Audio and Video Signals in Internet of Vehicles Platform
    Ying, Na
    Jiang, Yinhe
    Guo, Chunsheng
    Zhou, Di
    Zhao, Jian
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (22): : 35812 - 35824