Enhancing Feature Correlation for Bi-Modal Group Emotion Recognition

被引:6
|
作者
Liu, Ningjie [1 ]
Fang, Yuchun [1 ]
Guo, Yike [1 ,2 ]
机构
[1] Shanghai Univ, Sch Comp Engn & Sci, Shanghai, Peoples R China
[2] Imperial Coll London, Dept Comp, London, England
基金
中国国家自然科学基金;
关键词
Group emotion recognition; B-CNN; Non-local block; AROUSAL;
D O I
10.1007/978-3-030-00767-6_3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Group emotion recognition in the wild has received much attention in computer vision community. It is a very challenge issue, due to interactions taking place between various numbers of people, different occlusions. According to human cognitive and behavioral researches, background and facial expression play a dominating role in the perception of group's mood. Hence, in this paper, we propose a novel approach that combined these two features for image-based group emotion recognition with feature correlation enhancement. The feature enhancement is mainly reflected in two parts. For facial expression feature extraction, we plug non-local blocks into Xception network to enhance the feature correlation of different positions in low-level, which can avoid the fast loss of position information of the traditional CNNs and effectively enhance the network's feature representation capability. For global scene information, we build a bilinear convolutional neural network (B-CNN) consisting of VGG16 networks to model local pairwise feature interactions in a translationally invariant manner. The experimental results show that the fused feature could effectively improve the performance.
引用
收藏
页码:24 / 34
页数:11
相关论文
共 50 条
  • [1] Emotion Recognition Based on Meta Bi-Modal Learning Model
    Li Z.
    Sun Y.
    Zhang X.
    Zhou Y.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (05): : 87 - 105
  • [2] Bi-Modal Bi-Task Emotion Recognition Based on Transformer Architecture
    Song, Yu
    Zhou, Qi
    APPLIED ARTIFICIAL INTELLIGENCE, 2024, 38 (01)
  • [3] Bi-modal emotion recognition from expressive face and body gestures
    Gunes, Hatice
    Piccardi, Massimo
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2007, 30 (04) : 1334 - 1345
  • [4] Gait Emotion Recognition Using a Bi-modal Deep Neural Network
    Bhatia, Yajury
    Bari, A. S. M. Hossain
    Gavrilovn, Marina
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT I, 2022, 13598 : 46 - 60
  • [5] BI-MODAL COMPOSITIONAL NETWORK FOR FEATURE DISENTANGLEMENT
    Panda, Aditya
    Santra, Bikash
    Mukherjee, Dipti Prasad
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3051 - 3055
  • [6] Enhancing Visual Question Answering through Bi-Modal Feature Fusion: Performance Analysis
    Mao, Keyu
    6TH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND MACHINE VISION, IPMV 2024, 2024, : 115 - 122
  • [7] Automatic bi-modal emotion recognition system based on fusion of facial expressions and emotion extraction from speech
    Datcu, Dragos
    Rothkrantz, Leon J. M.
    2008 8TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE & GESTURE RECOGNITION (FG 2008), VOLS 1 AND 2, 2008, : 606 - 607
  • [8] Bi-modal Regression for Apparent Personality Trait Recognition
    Rai, Nishant
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 55 - 60
  • [9] Audio-lingual and visual-facial emotion recognition: Towards a bi-modal interaction system
    Alepis, E.
    Stathopoulou, I. -O.
    Virvou, M.
    Tsihrintzis, G. A.
    Kabassi, K.
    22ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2010), PROCEEDINGS, VOL 2, 2010, : 274 - 281
  • [10] Bi-modal OnPLS
    Lofstedt, Tommy
    Eriksson, Lennart
    Wormbs, Gunilla
    Trygg, Johan
    JOURNAL OF CHEMOMETRICS, 2012, 26 (06) : 236 - 245