Context-aware Cascade Attention-based RNN for Video Emotion Recognition

被引:0
|
作者
Sun, Man-Chin [1 ]
Hsu, Shih-Huan [1 ]
Yang, Min-Chun [1 ]
Chien, Jen-Hsien [1 ]
机构
[1] Emotibot Inc, Taipei, Taiwan
关键词
emotion recognition; video classification; action recognition; spatiotemporal model; human-computer interaction; HCI;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition can provide crucial information about the user in many applications when building human-computer interaction (HCI) systems. Most of current researches on visual emotion recognition are focusing on exploring facial features. However, context information including surrounding environment and human body can also provide extra clues to recognize emotion more accurately. Inspired by "sequence to sequence model" for neural machine translation, which models input and output sequences by an encoder and a decoder in recurrent neural network (RNN) architecture respectively, a novel architecture, "CACA-RNN", is proposed in this work. The proposed network consists of two RNNs in a cascaded architecture to process both context and facial information to perform video emotion classification. Results of the model were submitted to video emotion recognition sub-challenge in Multimodal Emotion Recognition Challenge (MEC2017). CACA-RNN outperforms the MEC2017 baseline (mAP of 21.7%): it achieved mAP of 45.51% on the testing set in the video only challenge.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Sequential Interactive Biased Network for Context-Aware Emotion Recognition
    Li, Xinpeng
    Peng, Xiaojiang
    Ding, Changxing
    2021 INTERNATIONAL JOINT CONFERENCE ON BIOMETRICS (IJCB 2021), 2021,
  • [32] Brain Activity Recognition Method Based on Attention-Based RNN Mode
    Zhou, Song
    Gao, Tianhan
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [33] Context-aware emotion cause analysis with multi-attention-based neural network
    Li, Xiangju
    Feng, Shi
    Wang, Daling
    Zhang, Yifei
    KNOWLEDGE-BASED SYSTEMS, 2019, 174 : 205 - 218
  • [34] Stacked Multimodal Attention Network for Context-Aware Video Captioning
    Zheng, Yi
    Zhang, Yuejie
    Feng, Rui
    Zhang, Tao
    Fan, Weiguo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (01) : 31 - 42
  • [35] An Attention-based Activity Recognition for Egocentric Video
    Matsuo, Kenji
    Yamada, Kentaro
    Ueno, Satoshi
    Naito, Sei
    2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 565 - +
  • [36] Context-Aware EEG-Based Perceived Stress Recognition based on Emotion Transition Paradigm
    Liu, Jiyao
    He, Lang
    Chen, Zhiwei
    Chen, Ziyi
    Hao, Yu
    Jiang, Dongmei
    2023 11TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION WORKSHOPS AND DEMOS, ACIIW, 2023,
  • [37] MultiEMO: An Attention-Based Correlation-Aware Multimodal Fusion Framework for Emotion Recognition in Conversations
    Shi, Tao
    Huang, Shao-Lun
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14752 - 14766
  • [38] Siamese Attention-Based LSTM for Speech Emotion Recognition
    Nizamidin, Tashpolat
    Zhao, Li
    Liang, Ruiyu
    Xie, Yue
    Hamdulla, Askar
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2020, E103A (07) : 937 - 941
  • [39] An Attention-based Ensemble Model for Emotion Recognition in Conversation
    Farooq, Misbah
    De Silva, Varuna
    Shi, Xiyu
    2024 14TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS, ICPRS, 2024,
  • [40] Incorporating structured emotion commonsense knowledge and interpersonal relation into context-aware emotion recognition
    Jing Chen
    Tao Yang
    Ziqiang Huang
    Kejun Wang
    Meichen Liu
    Chunyan Lyu
    Applied Intelligence, 2023, 53 : 4201 - 4217