Generative technology for human emotion recognition: A scoping review

被引:0
作者
Ma, Fei [1 ]
Yuan, Yucheng
Xie, Yifan [3 ]
Ren, Hongwei [4 ]
Liu, Ivan [5 ]
He, Ying [6 ]
Ren, Fuji [7 ,8 ]
Yu, Fei Richard
Ni, Shiguang [2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen, Peoples R China
[2] Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[3] Xi An Jiao Tong Univ, Sch Software Engn, Xian, Peoples R China
[4] Hong Kong Univ Sci & Technol GZ, MICS Thrust, Guangzhou, Peoples R China
[5] Beijing Normal Univ Zhuhai, Fac Arts & Sci, Dept Psychol, Zhuhai, Peoples R China
[6] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen, Peoples R China
[7] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu, Peoples R China
[8] Univ Elect Sci & Technol China, Shenzhen Inst Adv Study, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
Emotion recognition; Generative technology; Autoencoder; Generative Adversarial Network; Diffusion model; Large Language Model; FACIAL EXPRESSION RECOGNITION; LEARNING REPRESENTATIONS; ADVERSARIAL NETWORKS; DATA AUGMENTATION; SPEECH; EEG; AUTOENCODER; DATABASES; MODELS;
D O I
10.1016/j.inffus.2024.102753
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Affective computing stands at the forefront of artificial intelligence (AI), seeking to imbue machines with the ability to comprehend and respond to human emotions. Central to this field is emotion recognition, which endeavors to identify and interpret human emotional states from different modalities, such as speech, facial images, text, and physiological signals. In recent years, important progress has been made in generative models, including Autoencoder, Generative Adversarial Network, Diffusion Model, and Large Language Model. These models, with their powerful data generation capabilities, emerge as pivotal tools in advancing emotion recognition. However, up to now, there remains a paucity of systematic efforts that review generative technology for emotion recognition. This survey aims to bridge the gaps in the existing literature by conducting a comprehensive analysis of over 330 research papers until June 2024. Specifically, this survey will firstly introduce the mathematical principles of different generative models and the commonly used datasets. Subsequently, through a taxonomy, it will provide an in-depth analysis of how generative techniques address emotion recognition based on different modalities in several aspects, including data augmentation, feature extraction, semi-supervised learning, cross-domain, etc. Finally, the review will outline future research directions, emphasizing the potential of generative models to advance the field of emotion recognition and enhance the emotional intelligence of AI systems.
引用
收藏
页数:30
相关论文
共 336 条
[1]   Identity preserving multi-pose facial expression recognition using fine tuned VGG on the latent space vector of generative adversarial network [J].
Abiram, R. Nandhini ;
Vincent, P. M. Durai Raj .
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (04) :3699-3717
[2]   A systematic survey on multimodal emotion recognition using learning algorithms [J].
Ahmed, Naveed ;
Al Aghbari, Zaher ;
Girija, Shini .
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 17
[3]   Speech emotion recognition: Emotional models, databases, features, preprocessing methods, supporting modalities, and classifiers [J].
Akcay, Mehmet Berkehan ;
Oguz, Kaya .
SPEECH COMMUNICATION, 2020, 116 :56-76
[4]   Emotions Recognition Using EEG Signals: A Survey [J].
Alarcao, Soraia M. ;
Fonseca, Manuel J. .
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (03) :374-393
[5]  
Ali K, 2019, Arxiv, DOI arXiv:1909.13135
[6]   Variational Autoencoder with Global- and Medium Timescale Auxiliaries for Emotion Recognition from Speech [J].
Almotlak, Hussam ;
Weber, Cornelius ;
Qu, Leyuan ;
Wermter, Stefan .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT I, 2020, 12396 :529-540
[7]   A survey of state-of-the-art approaches for emotion recognition in text [J].
Alswaidan, Nourah ;
Menai, Mohamed El Bachir .
KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (08) :2937-2987
[8]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[9]   Subject independent emotion recognition using EEG signals employing attention driven neural networks [J].
Arjun ;
Rajpoot, Aniket Singh ;
Panicker, Mahesh Raveendranatha .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 75
[10]   Deep Reinforcement Learning A brief survey [J].
Arulkumaran, Kai ;
Deisenroth, Marc Peter ;
Brundage, Miles ;
Bharath, Anil Anthony .
IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38