Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models

被引:7
作者
Bian, Yifan [1 ]
Kuester, Dennis [2 ]
Liu, Hui [2 ]
Krumhuber, Eva G. [1 ]
机构
[1] UCL, Dept Expt Psychol, London WC1H 0AP, England
[2] Univ Bremen, Dept Math & Comp Sci, D-28359 Bremen, Germany
关键词
automatic facial expression recognition; naturalistic context; deep learning; multimodal large language model; RECOGNITION; EMOTION; CONTEXT; FACE; DATABASE;
D O I
10.3390/s24010126
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper provides a comprehensive overview of affective computing systems for facial expression recognition (FER) research in naturalistic contexts. The first section presents an updated account of user-friendly FER toolboxes incorporating state-of-the-art deep learning models and elaborates on their neural architectures, datasets, and performances across domains. These sophisticated FER toolboxes can robustly address a variety of challenges encountered in the wild such as variations in illumination and head pose, which may otherwise impact recognition accuracy. The second section of this paper discusses multimodal large language models (MLLMs) and their potential applications in affective science. MLLMs exhibit human-level capabilities for FER and enable the quantification of various contextual variables to provide context-aware emotion inferences. These advancements have the potential to revolutionize current methodological approaches for studying the contextual influences on emotions, leading to the development of contextualized emotion models.
引用
收藏
页数:15
相关论文
共 50 条
[41]   Robust and Affordable Deep Learning Models for Multimodal Sensor Fusion [J].
Xaviar, Sanju .
PROCEEDINGS OF THE 2021 THE 19TH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS, SENSYS 2021, 2021, :403-404
[42]   Exploring Theory of Mind in Large Language Models through Multimodal Negotiation [J].
Yongsatianchot, Nutchanon ;
Thejll-Madsen, Tobias ;
Marsella, Stacy .
PROCEEDINGS OF THE 24TH ACM INTERNATIONAL CONFERENCE ON INTELLIGENT VIRTUAL AGENTS, IVA 2024, 2024,
[43]   Large Language Model Enhanced Particle Swarm Optimization for Hyperparameter Tuning for Deep Learning Models [J].
Hameed, Saad ;
Qolomany, Basheer ;
Belhaouari, Samir Brahim ;
Abdallah, Mohamed ;
Qadir, Junaid ;
Al-Fuqaha, Ala .
IEEE OPEN JOURNAL OF THE COMPUTER SOCIETY, 2025, 6 :574-585
[44]   FRACTAL-INSPIRED SENTIMENT ANALYSIS: EVALUATION OF LARGE LANGUAGE MODELS AND DEEP LEARNING METHODS [J].
Alsagri, Hatoon S. ;
Sohail, Shahab Saquib .
FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2024,
[45]   Workshop on Deep Learning and Large Language Models for Knowledge Graphs (DL4KG) [J].
Alam, Mehwish ;
Buscaldi, Davide ;
Cochez, Michael ;
Gesese, Genet Asefa ;
Osborne, Francesco ;
Recupero, Diego Reforgiato .
PROCEEDINGS OF THE 30TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2024, 2024, :6704-6705
[46]   A multifactor model using large language models and multimodal investor sentiment [J].
Zhang, Junhuan ;
Zhang, Ziyan ;
Wen, Jiaqi .
INTERNATIONAL REVIEW OF ECONOMICS & FINANCE, 2025, 102
[47]   Comparing Recognition Performance and Robustness of Multimodal Deep Learning Models for Multimodal Emotion Recognition [J].
Liu, Wei ;
Qiu, Jie-Lin ;
Zheng, Wei-Long ;
Lu, Bao-Liang .
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2022, 14 (02) :715-729
[48]   Auto Diagnosis of Parkinson's Disease Via a Deep Learning Model Based on Mixed Emotional Facial Expressions [J].
Huang, Wei ;
Xu, Wenqiang ;
Wan, Renjie ;
Zhang, Peng ;
Zha, Yufei ;
Pang, Meng .
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (05) :2547-2557
[49]   Using deep learning to predict ideology from facial photographs: expressions, beauty, and extra-facial information [J].
Rasmussen, Stig Hebbelstrup Rye ;
Ludeke, Steven G. ;
Klemmensen, Robert .
SCIENTIFIC REPORTS, 2023, 13 (01)
[50]   Emotion Recognition System via Facial Expressions and Speech Using Machine Learning and Deep Learning Techniques [J].
Chaudhari A. ;
Bhatt C. ;
Nguyen T.T. ;
Patel N. ;
Chavda K. ;
Sarda K. .
SN Computer Science, 4 (4)