Sparse landmarks for facial action unit detection using vision transformer and perceiver

被引：0

作者：

Cakir, Duygu ^{[1
]}

Yilmaz, Gorkem ^{[2
]}

Arica, Nafiz ^{[3
]}

机构：

[1] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Software Engn, Istanbul, Turkiye

[2] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Comp Engn, Istanbul, Turkiye

[3] Piri Reis Univ, Fac Engn, Dept Informat Syst Engn, Istanbul, Turkiye

来源：

INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING | 2024年 / 27卷 / 05期

关键词：

action unit detection; sparse learning; vision transformer; perceiver; RECOGNITION; PATCHES;

D O I：

10.1504/IJCSE.2023.10060451

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The ability to accurately detect facial expressions, represented by facial action units (AUs), holds significant implications across diverse fields such as mental health diagnosis, security, and human-computer interaction. Although earlier approaches have made progress, the burgeoning complexity of facial actions demands more nuanced, computationally efficient techniques. This study pioneers the integration of sparse learning with vision transformer (ViT) and perceiver networks, focusing on the most active and descriptive landmarks for AU detection across both controlled (DISFA, BP4D) and in-the-wild (EmotioNet) datasets. Our novel approach, employing active landmark patches instead of the whole face, not only attains state-of-the-art performance but also uncovers insights into the differing attention mechanisms of ViT and perceiver. This fusion of techniques marks a significant advancement in facial analysis, potentially reshaping strategies in noise reduction and patch optimisation, setting a robust foundation for future research in the domain.

引用

页码：607 / 620

页数：15

共 50 条

[41] Driver distraction detection using semi-supervised lightweight vision transformer
Mohammed, Adam A. Q.
Geng, Xin
Wang, Jing
Ali, Zafar
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
[42] Smoke detection in foggy surveillance environment using parallel vision transformer network
Shubhangi Chaturvedi
Poornima Singh Thakur
Pritee Khanna
Aparajita Ojha
[J]. Neural Computing and Applications, 2024, 36 (31) : 19499 - 19514
[43] DFDT: An End-to-End DeepFake Detection Framework Using Vision Transformer
Khormali, Aminollah
Yuan, Jiann-Shiun
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (06):
[44] Real-Time Earthquake Detection and Magnitude Estimation Using Vision Transformer
Saad, Omar M.
Chen, Yunfeng
Savvaidis, Alexandros
Fomel, Sergey
Chen, Yangkang
[J]. JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2022, 127 (05)
[45] Investigation of facial aggression detection techniques based on facial action units using machine learning
Hasan, Tabreer T.
Issa, Abbas H.
[J]. APPLIED NANOSCIENCE, 2022, 13 (4) : 3123 - 3123
[46] Joint Facial Action Unit Detection and Feature Fusion: A Multi-Conditional Learning Approach
Eleftheriadis, Stefanos
Rudovic, Ognjen
Pantic, Maja
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (12) : 5727 - 5742
[47] Ultra-lightweight face activation for dynamic vision sensor with convolutional filter-level fusion using facial landmarks
Kim, Sungsoo
Park, Jeongeun
Yang, Donguk
Shin, Dongyup
Kim, Jungyeon
Ryu, Hyunsurk Eric
Kim, Ha Young
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
[48] HCiT: Deepfake Video Detection Using a Hybrid Model of CNN features and Vision Transformer
Kaddar, Bachir
Fezza, Sid Ahmed
Hamidouche, Wassim
Akhtar, Zahid
Hadid, Abdenour
[J]. 2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
[49] An optimized bidirectional vision transformer based colorectal cancer detection using histopathological images
Choudhary, Raman
Deepak, Akshay
Krishnasamy, Gopalakrishnan
Kumar, Vikash
[J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 102
[50] A novel hierarchical framework for plant leaf disease detection using residual vision transformer
Vallabhajosyula, Sasikala
Sistla, Venkatramaphanikumar
Kolli, Venkata Krishna Kishore
[J]. HELIYON, 2024, 10 (09)

← 1 2 3 4 5 →