Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition

被引:19
|
作者
Lian, Zheng [1 ,3 ]
Tao, Jianhua [1 ,2 ,3 ]
Liu, Bin [1 ]
Huang, Jian [1 ,3 ]
Yang, Zhanlei [4 ]
Li, Rongjun [4 ]
机构
[1] CASIA, Natl Lab Pattern Recognit, Beijing, Peoples R China
[2] CAS Ctr Excellence Brain Sci & Intelligence Techn, Beijing, Peoples R China
[3] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
[4] Huawei Technol Co LTD, Beijing, Peoples R China
来源
INTERSPEECH 2020 | 2020年
基金
中国国家自然科学基金;
关键词
emotion recognition; domain adversarial learning; speaker-independent representations; contextual information; multimodal features; SPEECH;
D O I
10.21437/Interspeech.2020-1705
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Emotion recognition remains a complex task due to speaker variations and low-resource training samples. To address these difficulties, we focus on the domain adversarial neural networks (DANN) for emotion recognition. The primary task is to predict emotion labels. The secondary task is to learn a common representation where speaker identities can not be distinguished. By using this approach, we bring the representations of different speakers closer. Meanwhile, through using the unlabeled data in the training process, we alleviate the impact of low-resource training samples. In the meantime, prior work found that contextual information and multimodal features are important for emotion recognition. However, previous DANN based approaches ignore these information, thus limiting their performance. In this paper, we propose the context-dependent domain adversarial neural network for multimodal emotion recognition. To verify the effectiveness of our proposed method, we conduct experiments on the benchmark dataset IEMOCAP. Experimental results demonstrate that the proposed method shows an absolute improvement of 3.48% over state-of-the-art strategies.
引用
收藏
页码:394 / 398
页数:5
相关论文
共 50 条
  • [1] Context-dependent emotion recognition
    Wang, Zili
    Lao, Lingjie
    Zhang, Xiaoya
    Li, Yong
    Zhang, Tong
    Cui, Zhen
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 89
  • [2] A HIERARCHICAL, CONTEXT-DEPENDENT NEURAL NETWORK ARCHITECTURE FOR IMPROVED PHONE RECOGNITION
    Toth, Laszlo
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5040 - 5043
  • [3] MALN: Multimodal Adversarial Learning Network for Conversational Emotion Recognition
    Ren, Minjie
    Huang, Xiangdong
    Liu, Jing
    Liu, Ming
    Li, Xuanya
    Liu, An-An
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (11) : 6965 - 6980
  • [4] A Bi-Hemisphere Domain Adversarial Neural Network Model for EEG Emotion Recognition
    Li, Yang
    Zheng, Wenming
    Zong, Yuan
    Cui, Zhen
    Zhang, Tong
    Zhou, Xiaoyan
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2021, 12 (02) : 494 - 504
  • [5] Domain Adversarial Network for Cross-Domain Emotion Recognition in Conversation
    Ma, Hongchao
    Zhang, Chunyan
    Zhou, Xiabing
    Chen, Junyi
    Zhou, Qinglei
    APPLIED SCIENCES-BASEL, 2022, 12 (11):
  • [6] ENSEMBLE OF DOMAIN ADVERSARIAL NEURAL NETWORKS FOR SPEECH EMOTION RECOGNITION
    Lee, Shi-wook
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 374 - 379
  • [7] Unsupervised Cross-Lingual Speech Emotion Recognition Using Domain Adversarial Neural Network
    Cai, Xiong
    Wu, Zhiyong
    Zhong, Kuo
    Su, Bin
    Dai, Dongyang
    Meng, Helen
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [8] Emotion recognition specialization and context-dependent risk of anxiety and depression in adolescents
    Oldehinkel, Albertine J.
    Hartman, Catharina A.
    Van Oort, Floor V. A.
    Nederhof, Esther
    BRAIN AND BEHAVIOR, 2015, 5 (02): : 1 - 10
  • [9] Multimodal Emotion Recognition Based on Ensemble Convolutional Neural Network
    Huang, Haiping
    Hu, Zhenchao
    Wang, Wenming
    Wu, Min
    IEEE ACCESS, 2020, 8 : 3265 - 3271
  • [10] A Multimodal Low Complexity Neural Network Approach for Emotion Recognition
    Aguinaga, Adrian Rodriguez
    Ramirez, Margarita Ramirez
    Soto, Maria del Consuelo Salgado
    Cisnero, Maria de los Angeles Quezada
    HUMAN BEHAVIOR AND EMERGING TECHNOLOGIES, 2024, 2024