Multimodal emotion recognition based on peak frame selection from video

被引：24

作者：

Zhalehpour, Sara ^{[1
]}

Akhtar, Zahid ^{[2
]}

Erdem, Cigdem Eroglu ^{[3
]}

机构：

[1] INRS EMT, Montreal, PQ, Canada

[2] Univ Udine, I-33100 Udine, Italy

[3] Bahcesehir Univ, Istanbul, Turkey

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2016年 / 10卷 / 05期

关键词：

Affective computing; Facial expression recognition; Apex frame; Audio-visual emotion recognition; FUSION;

D O I：

10.1007/s11760-015-0822-0

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

We present a fully automatic multimodal emotion recognition system based on three novel peak frame selection approaches using the video channel. Selection of peak frames (i.e., apex frames) is an important preprocessing step for facial expression recognition as they contain the most relevant information for classification. Two of the three proposed peak frame selection methods (i.e., MAXDIST and DEND-CLUSTER) do not employ any training or prior learning. The third method proposed for peak frame selection (i.e., EIFS) is based on measuring the "distance" of the expressive face from the subspace of neutral facial expression, which requires a prior learning step to model the subspace of neutral face shapes. The audio and video modalities are fused at the decision level. The subject-independent audio-visual emotion recognition system has shown promising results on two databases in two different languages (eNTERFACE and BAUM-1a).

引用

页码：827 / 834

页数：8

共 50 条

[31] Character emotion recognition algorithm in small sample video based on multimodal feature fusion
Xie, Jian
Chu, Dan
INTERNATIONAL JOURNAL OF BIOMETRICS, 2025, 17 (1-2) : 1 - 14
[32] A multimodal emotion recognition model integrating speech, video and MoCAP
Jia, Ning
Zheng, Chunjun
Sun, Wei
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (22) : 32265 - 32286
[33] A Dynamic Frame Selection Framework for Fast Video Recognition
Wu, Zuxuan
Li, Hengduo
Xiong, Caiming
Jiang, Yu-Gang
Davis, Larry Steven
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (04) : 1699 - 1711
[34] AdaFrame: Adaptive Frame Selection for Fast Video Recognition
Wu, Zuxuan
Xiong, Caiming
Ma, Chih-Yao
Socher, Richard
Davis, Larry S.
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 1278 - 1287
[35] Multimodal Emotion Recognition Based on the Decoupling of Emotion and Speaker Information
Gajsek, Rok
Struc, Vitomir
Mihelic, France
TEXT, SPEECH AND DIALOGUE, 2010, 6231 : 275 - 282
[36] Multimodal Emotion Recognition Based on Feature Fusion
Xu, Yurui
Wu, Xiao
Su, Hang
Liu, Xiaorui
2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 7 - 11
[37] Multimodal Emotion Recognition in Conversation Based on Hypergraphs
Li, Jiaze
Mei, Hongyan
Jia, Liyun
Zhang, Xing
ELECTRONICS, 2023, 12 (22)
[38] Video-based multimodal spontaneous emotion recognition using facial expressions and physiological signals
Ouzar, Yassine
Bousefsaf, Frederic
Djeldjli, Djamaleddine
Maaoui, Choubeila
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2459 - 2468
[39] Multimodal Emotion Recognition and State Analysis of Classroom Video and Audio Based on Deep Neural Network
Li, Mingyong
Liu, Mingyue
Jiang, Zheng
Zhao, Zongwei
Zhang, Jiayan
Ge, Mingyuan
Duan, Huiming
Wang, Yanxia
JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP04)
[40] A Multimodal Driver Emotion Recognition Algorithm Based on the Audio and Video Signals in Internet of Vehicles Platform
Ying, Na
Jiang, Yinhe
Guo, Chunsheng
Zhou, Di
Zhao, Jian
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (22): : 35812 - 35824

← 1 2 3 4 5 →