Context-aware Cascade Attention-based RNN for Video Emotion Recognition

被引:0
|
作者
Sun, Man-Chin [1 ]
Hsu, Shih-Huan [1 ]
Yang, Min-Chun [1 ]
Chien, Jen-Hsien [1 ]
机构
[1] Emotibot Inc, Taipei, Taiwan
关键词
emotion recognition; video classification; action recognition; spatiotemporal model; human-computer interaction; HCI;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition can provide crucial information about the user in many applications when building human-computer interaction (HCI) systems. Most of current researches on visual emotion recognition are focusing on exploring facial features. However, context information including surrounding environment and human body can also provide extra clues to recognize emotion more accurately. Inspired by "sequence to sequence model" for neural machine translation, which models input and output sequences by an encoder and a decoder in recurrent neural network (RNN) architecture respectively, a novel architecture, "CACA-RNN", is proposed in this work. The proposed network consists of two RNNs in a cascaded architecture to process both context and facial information to perform video emotion classification. Results of the model were submitted to video emotion recognition sub-challenge in Multimodal Emotion Recognition Challenge (MEC2017). CACA-RNN outperforms the MEC2017 baseline (mAP of 21.7%): it achieved mAP of 45.51% on the testing set in the video only challenge.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Incorporating structured emotion commonsense knowledge and interpersonal relation into context-aware emotion recognition
    Chen, Jing
    Yang, Tao
    Huang, Ziqiang
    Wang, Kejun
    Liu, Meichen
    Lyu, Chunyan
    APPLIED INTELLIGENCE, 2023, 53 (04) : 4201 - 4217
  • [42] Attention-Based Dense LSTM for Speech Emotion Recognition
    Xie, Yue
    Liang, Ruiyu
    Liang, Zhenlin
    Zhao, Li
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (07): : 1426 - 1429
  • [43] Speech Emotion Recognition using Context-Aware Dilated Convolution Network
    Kakuba, Samuel
    Han, Dong Seog
    2022 27TH ASIA PACIFIC CONFERENCE ON COMMUNICATIONS (APCC 2022): CREATING INNOVATIVE COMMUNICATION TECHNOLOGIES FOR POST-PANDEMIC ERA, 2022, : 601 - 604
  • [44] Context-Aware Cross-Attention for Skeleton-Based Human Action Recognition
    Fan, Yanbo
    Weng, Shuchen
    Zhang, Yong
    Shi, Boxin
    Zhang, Yi
    IEEE ACCESS, 2020, 8 (08): : 15280 - 15290
  • [45] An Efficient Context-Aware Music Recommendation Based on Emotion and Time Context
    Selvi, C.
    Sivasankar, E.
    DATA SCIENCE AND BIG DATA ANALYTICS, 2019, 16 : 215 - 228
  • [46] Context-aware Interactive Attention for Multi-modal Sentiment and Emotion Analysis
    Chauhan, Dushyant Singh
    Akhtar, Md Shad
    Ekbal, Asif
    Bhattacharyya, Pushpak
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5647 - 5657
  • [47] Context-Aware Multi-View Attention Networks for Emotion Cause Extraction
    Xiao, Xinglin
    Wei, Penghui
    Mao, Wenji
    Wang, Lei
    2019 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2019, : 128 - 133
  • [48] CONTEXT-AWARE GENERATION-BASED NET FOR MULTI-LABEL VISUAL EMOTION RECOGNITION
    Ruan, Shulan
    Zhang, Kun
    Wang, Yijun
    Tao, Hanqing
    He, Weidong
    Lv, Guangyi
    Chen, Enhong
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [49] Context-Aware Contribution Estimation for Feature Aggregation in Video Face Recognition
    Zhang, Meng
    Liu, Rujie
    Deguchi, Daisuke
    Murase, Hiroshi
    IEEE ACCESS, 2022, 10 : 79301 - 79310
  • [50] Significance of handcrafted features in human activity recognition with attention-based RNN models
    Abraham, Sonia
    James, Rekha K.
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (10) : 1151 - 1163