Sparse landmarks for facial action unit detection using vision transformer and perceiver

被引:0
作者
Cakir, Duygu [1 ]
Yilmaz, Gorkem [2 ]
Arica, Nafiz [3 ]
机构
[1] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Software Engn, Istanbul, Turkiye
[2] Bahcesehir Univ, Fac Engn & Nat Sci, Dept Comp Engn, Istanbul, Turkiye
[3] Piri Reis Univ, Fac Engn, Dept Informat Syst Engn, Istanbul, Turkiye
关键词
action unit detection; sparse learning; vision transformer; perceiver; RECOGNITION; PATCHES;
D O I
10.1504/IJCSE.2023.10060451
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The ability to accurately detect facial expressions, represented by facial action units (AUs), holds significant implications across diverse fields such as mental health diagnosis, security, and human-computer interaction. Although earlier approaches have made progress, the burgeoning complexity of facial actions demands more nuanced, computationally efficient techniques. This study pioneers the integration of sparse learning with vision transformer (ViT) and perceiver networks, focusing on the most active and descriptive landmarks for AU detection across both controlled (DISFA, BP4D) and in-the-wild (EmotioNet) datasets. Our novel approach, employing active landmark patches instead of the whole face, not only attains state-of-the-art performance but also uncovers insights into the differing attention mechanisms of ViT and perceiver. This fusion of techniques marks a significant advancement in facial analysis, potentially reshaping strategies in noise reduction and patch optimisation, setting a robust foundation for future research in the domain.
引用
收藏
页码:607 / 620
页数:15
相关论文
共 50 条
  • [41] Driver distraction detection using semi-supervised lightweight vision transformer
    Mohammed, Adam A. Q.
    Geng, Xin
    Wang, Jing
    Ali, Zafar
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 129
  • [42] Smoke detection in foggy surveillance environment using parallel vision transformer network
    Shubhangi Chaturvedi
    Poornima Singh Thakur
    Pritee Khanna
    Aparajita Ojha
    [J]. Neural Computing and Applications, 2024, 36 (31) : 19499 - 19514
  • [43] DFDT: An End-to-End DeepFake Detection Framework Using Vision Transformer
    Khormali, Aminollah
    Yuan, Jiann-Shiun
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (06):
  • [44] Real-Time Earthquake Detection and Magnitude Estimation Using Vision Transformer
    Saad, Omar M.
    Chen, Yunfeng
    Savvaidis, Alexandros
    Fomel, Sergey
    Chen, Yangkang
    [J]. JOURNAL OF GEOPHYSICAL RESEARCH-SOLID EARTH, 2022, 127 (05)
  • [45] Investigation of facial aggression detection techniques based on facial action units using machine learning
    Hasan, Tabreer T.
    Issa, Abbas H.
    [J]. APPLIED NANOSCIENCE, 2022, 13 (4) : 3123 - 3123
  • [46] Joint Facial Action Unit Detection and Feature Fusion: A Multi-Conditional Learning Approach
    Eleftheriadis, Stefanos
    Rudovic, Ognjen
    Pantic, Maja
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (12) : 5727 - 5742
  • [47] Ultra-lightweight face activation for dynamic vision sensor with convolutional filter-level fusion using facial landmarks
    Kim, Sungsoo
    Park, Jeongeun
    Yang, Donguk
    Shin, Dongyup
    Kim, Jungyeon
    Ryu, Hyunsurk Eric
    Kim, Ha Young
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [48] HCiT: Deepfake Video Detection Using a Hybrid Model of CNN features and Vision Transformer
    Kaddar, Bachir
    Fezza, Sid Ahmed
    Hamidouche, Wassim
    Akhtar, Zahid
    Hadid, Abdenour
    [J]. 2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [49] An optimized bidirectional vision transformer based colorectal cancer detection using histopathological images
    Choudhary, Raman
    Deepak, Akshay
    Krishnasamy, Gopalakrishnan
    Kumar, Vikash
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 102
  • [50] A novel hierarchical framework for plant leaf disease detection using residual vision transformer
    Vallabhajosyula, Sasikala
    Sistla, Venkatramaphanikumar
    Kolli, Venkata Krishna Kishore
    [J]. HELIYON, 2024, 10 (09)