Accurate Facial Landmark Detector via Multi-scale Transformer

被引:2
|
作者
Sha, Yuyang [1 ]
Meng, Weiyu [1 ]
Zhai, Xiaobing [1 ]
Xie, Can [1 ]
Li, Kefeng [1 ]
机构
[1] Macao Polytech Univ, Fac Appl Sci, Taipa, Macao, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V | 2024年 / 14429卷
关键词
Facial landmark detection; Vision transformer; Multi-scale feature; Global information;
D O I
10.1007/978-981-99-8469-5_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Facial landmark detection is an essential prerequisite for many face applications, which has attracted much attention and made remarkable progress in recent years. However, some problems still need to be solved urgently, including improving the accuracy of facial landmark detectors in complex scenes, encoding long-range relationships between keypoints and facial components, and optimizing the robustness of methods in unconstrained environments. To address these problems, we propose a novel facial landmark detector via multi-scale transformer (MTLD), which contains three modules: Multi-scale Transformer, Joint Regression, and Structure Loss. The proposed Multi-scale Transformer focuses on capturing long-range information and cross-scale representations from multi-scale feature maps. The Joint Regression takes advantage of both coordinate and heatmap regression, which could boost the inference speed without sacrificing model accuracy. Furthermore, in order to explore the structural dependency between facial landmarks, we design the Structure Loss to fully utilize the geometric information in face images. We evaluate the proposed method through extensive experiments on four benchmark datasets. The results demonstrate that our method outperforms state-of-the-art approaches both in accuracy and efficiency.
引用
收藏
页码:278 / 290
页数:13
相关论文
共 50 条
  • [41] Multi-scale Knowledge Transfer Vision Transformer for 3D vessel shape segmentation
    Hua, Michael J.
    Wu, Junjie
    Zhong, Zichun
    COMPUTERS & GRAPHICS-UK, 2024, 122
  • [42] ConTrans-Detect: A Multi-Scale Convolution-Transformer Network for DeepFake Video Detection
    Sun, Weirong
    Ma, Yujun
    Zhang, Hong
    Wang, Ruili
    2023 29TH INTERNATIONAL CONFERENCE ON MECHATRONICS AND MACHINE VISION IN PRACTICE, M2VIP 2023, 2023,
  • [43] Transformer guided self-adaptive network for multi-scale skin lesion image segmentation
    Xin, Chao
    Liu, Zhifang
    Ma, Yizhao
    Wang, Dianchen
    Zhang, Jing
    Li, Lingzhi
    Zhou, Qiongyan
    Xu, Suling
    Zhang, Yingying
    COMPUTERS IN BIOLOGY AND MEDICINE, 2024, 169
  • [44] A Robust Facial Landmark Detection Method in Multi-views
    Liu, Xinran
    Su, Fei
    2015 VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2015,
  • [45] An efficient transformer network for detecting multi-scale chicken in complex free-range farming environments via improved RT-DETR
    Li, Xiaoxin
    Cai, Mingrui
    Tan, Xinjie
    Yin, Chengcheng
    Chen, Weihao
    Liu, Zhen
    Wen, Jiangtao
    Han, Yuxing
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2024, 224
  • [46] Cascaded Iterative Transformer for Jointly Predicting Facial Landmark, Occlusion Probability and Head Pose
    Yaokun Li
    Guang Tan
    Chao Gou
    International Journal of Computer Vision, 2024, 132 : 1242 - 1257
  • [47] Cascaded Iterative Transformer for Jointly Predicting Facial Landmark, Occlusion Probability and Head Pose
    Li, Yaokun
    Tan, Guang
    Gou, Chao
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (04) : 1242 - 1257
  • [48] MFMAM: Image inpainting via multi-scale feature module with attention module
    Chen, Yuantao
    Xia, Runlong
    Yang, Kai
    Zou, Ke
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 238
  • [49] HiViT: Hierarchical attention-based Transformer for multi-scale whole slide histopathological image classification
    Yu, Jinze
    Li, Shuo
    Tan, Luxin
    Zhou, Haoyi
    Li, Zhongwu
    Li, Jianxin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 277
  • [50] MG-Trans: Multi-Scale Graph Transformer With Information Bottleneck for Whole Slide Image Classification
    Shi, Jiangbo
    Tang, Lufei
    Gao, Zeyu
    Li, Yang
    Wang, Chunbao
    Gong, Tieliang
    Li, Chen
    Fu, Huazhu
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2023, 42 (12) : 3871 - 3883