Understanding Naturalistic Facial Expressions with Deep Learning and Multimodal Large Language Models

被引：7

作者：

Bian, Yifan ^{[1
]}

Kuester, Dennis ^{[2
]}

Liu, Hui ^{[2
]}

Krumhuber, Eva G. ^{[1
]}

机构：

[1] UCL, Dept Expt Psychol, London WC1H 0AP, England

[2] Univ Bremen, Dept Math & Comp Sci, D-28359 Bremen, Germany

来源：

SENSORS | 2024年 / 24卷 / 01期

关键词：

automatic facial expression recognition; naturalistic context; deep learning; multimodal large language model; RECOGNITION; EMOTION; CONTEXT; FACE; DATABASE;

D O I：

10.3390/s24010126

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

This paper provides a comprehensive overview of affective computing systems for facial expression recognition (FER) research in naturalistic contexts. The first section presents an updated account of user-friendly FER toolboxes incorporating state-of-the-art deep learning models and elaborates on their neural architectures, datasets, and performances across domains. These sophisticated FER toolboxes can robustly address a variety of challenges encountered in the wild such as variations in illumination and head pose, which may otherwise impact recognition accuracy. The second section of this paper discusses multimodal large language models (MLLMs) and their potential applications in affective science. MLLMs exhibit human-level capabilities for FER and enable the quantification of various contextual variables to provide context-aware emotion inferences. These advancements have the potential to revolutionize current methodological approaches for studying the contextual influences on emotions, leading to the development of contextualized emotion models.

引用

收藏

页数：15

相关论文

共 50 条

[21] A Survey on Multimodal Large Language Models for Autonomous Driving [J].

Cui, Can ;

Ma, Yunsheng ;

Cao, Xu ;

Ye, Wenqian ;

Zhou, Yang ;

Liang, Kaizhao ;

Chen, Jintai ;

Lu, Juanwu ;

Yang, Zichong ;

Liao, Kuei-Da ;

Gao, Tianren ;

Li, Erlong ;

Tang, Kun ;

Cao, Zhipeng ;

Zhou, Tong ;

Liu, Ao ;

Yan, Xinrui ;

Mei, Shuqi ;

Cao, Jianguo ;

Wang, Ziran ;

Zheng, Chao .

2024 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WORKSHOPS, WACVW 2024, 2024, :958-979

[22] Overview of deep learning and large language models in machine translation: a special perspective on the Arabic language [J].

Sanaa Abou Elhamayed ;

Mohamed Nour .

Journal of Electrical Systems and Information Technology, 12 (1)

[23] Deep Learning Models for Facial Expression Recognition [J].

Sajjanhar, Atul ;

Wu, ZhaoQi ;

Wen, Quan .

2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, :583-588

[24] Personalized and Timely Feedback in Online Education: Enhancing Learning with Deep Learning and Large Language Models [J].

Cuellar, Oscar ;

Contero, Manuel ;

Hincapie, Mauricio .

MULTIMODAL TECHNOLOGIES AND INTERACTION, 2025, 9 (05)

[25] Deep learning detects subtle facial expressions in a multilevel society primate [J].

Fang, Gu ;

Peng, Xianlin ;

Xie, Penglin ;

Ren, Jun ;

Peng, Shenglin ;

Feng, Xiaoyi ;

Tian, Xin ;

Zhou, Mingzhu ;

Li, Zhibo ;

Peng, Jinye ;

Matsuzawa, Tetsuro ;

Xia, Zhaoqiang ;

Li, Baoguo .

INTEGRATIVE ZOOLOGY, 2025, 20 (04) :774-787

[26] Understanding the Efficiency of Deep Learning in Language Learning using Personalized Language Learning Apps [J].

Divya, R. ;

Hema, N. .

LITERARY VOICE, 2021, 1 (15) :240-249

[27] Comparison of Multi-Modal Large Language Models with Deep Learning Models for Medical Image Classification [J].

Than, Joel Chia Ming ;

Vong, Wan Tze ;

Yong, Kelvin Sheng Chek .

2024 IEEE 8TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING APPLICATIONS, ICSIPA, 2024,

[28] QueryMintAI: Multipurpose Multimodal Large Language Models for Personal Data [J].

Ghosh, Ananya ;

Deepa, K. .

IEEE ACCESS, 2024, 12 :144631-144651

[29] Deep Learning for Genomics: From Early Neural Nets to Modern Large Language Models [J].

Yue, Tianwei ;

Wang, Yuanxin ;

Zhang, Longxiang ;

Gu, Chunming ;

Xue, Haoru ;

Wang, Wenping ;

Lyu, Qi ;

Dun, Yujie .

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (21)

[30] Deep Learning and Web Applications Vulnerabilities Detection: An Approach Based on Large Language Models [J].

Nana, Sidwendluian Romaric ;

Bassole, Didier ;

Guel, Desire ;

Sie, Oumarou .

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (07) :1391-1399

← 1 2 3 4 5 →