Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models

被引:5
作者
Bian, Yifan [1 ]
Kuester, Dennis [2 ]
Liu, Hui [2 ]
Krumhuber, Eva G. [1 ]
机构
[1] UCL, Dept Expt Psychol, London WC1H 0AP, England
[2] Univ Bremen, Dept Math & Comp Sci, D-28359 Bremen, Germany
关键词
automatic facial expression recognition; naturalistic context; deep learning; multimodal large language model; RECOGNITION; EMOTION; CONTEXT; FACE; DATABASE;
D O I
10.3390/s24010126
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
This paper provides a comprehensive overview of affective computing systems for facial expression recognition (FER) research in naturalistic contexts. The first section presents an updated account of user-friendly FER toolboxes incorporating state-of-the-art deep learning models and elaborates on their neural architectures, datasets, and performances across domains. These sophisticated FER toolboxes can robustly address a variety of challenges encountered in the wild such as variations in illumination and head pose, which may otherwise impact recognition accuracy. The second section of this paper discusses multimodal large language models (MLLMs) and their potential applications in affective science. MLLMs exhibit human-level capabilities for FER and enable the quantification of various contextual variables to provide context-aware emotion inferences. These advancements have the potential to revolutionize current methodological approaches for studying the contextual influences on emotions, leading to the development of contextualized emotion models.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Understanding the Efficiency of Deep Learning in Language Learning using Personalized Language Learning Apps
    Divya, R.
    Hema, N.
    LITERARY VOICE, 2021, 1 (15): : 240 - 249
  • [22] QueryMintAI: Multipurpose Multimodal Large Language Models for Personal Data
    Ghosh, Ananya
    Deepa, K.
    IEEE ACCESS, 2024, 12 : 144631 - 144651
  • [23] Comparison of Multi-Modal Large Language Models with Deep Learning Models for Medical Image Classification
    Than, Joel Chia Ming
    Vong, Wan Tze
    Yong, Kelvin Sheng Chek
    2024 IEEE 8TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, ICSIPA, 2024,
  • [24] Deep learning: from speech recognition to language and multimodal processing
    Deng, Li
    APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2016, 5
  • [25] Sign Language Recognition with Multimodal Sensors and Deep Learning Methods
    Lu, Chenghong
    Kozakai, Misaki
    Jing, Lei
    ELECTRONICS, 2023, 12 (23)
  • [26] Deep Learning and Web Applications Vulnerabilities Detection: An Approach Based on Large Language Models
    1600, Science and Information Organization (15): : 1391 - 1399
  • [27] Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models
    Yue, Tianwei
    Wang, Yuanxin
    Zhang, Longxiang
    Gu, Chunming
    Xue, Haoru
    Wang, Wenping
    Lyu, Qi
    Dun, Yujie
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (21)
  • [28] Deep learning detection method for large language models-generated scientific content
    Alhijawi, Bushra
    Jarrar, Rawan
    AbuAlRub, Aseel
    Bader, Arwa
    Neural Computing and Applications, 2025, 37 (01) : 91 - 104
  • [29] Deep Learning and Web Applications Vulnerabilities Detection: An Approach Based on Large Language Models
    Nana, Sidwendluian Romaric
    Bassole, Didier
    Guel, Desire
    Sie, Oumarou
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) : 1391 - 1399
  • [30] Pain detection through facial expressions in children with autism using deep learning
    Sandeep, P. V. K.
    Kumar, N. Suresh
    SOFT COMPUTING, 2024, 28 (5) : 4621 - 4630