Automating Manga Character Analysis: A Robust Deep Vision-Transformer Approach to Facial Landmark Detection

被引:0
|
作者
Vachmanus, Sirawich [1 ]
Phinklao, Noppanan [1 ]
Phongsarnariyakul, Naruparn [1 ]
Plongcharoen, Thanat [1 ]
Hotta, Seiji [2 ]
Tuarob, Suppawong [1 ]
机构
[1] Mahidol Univ, Fac Informat & Commun Technol, Nakhon Pathom 73170, Thailand
[2] Tokyo Univ Agr & Technol, Inst Engn, Tokyo 1838538, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Face recognition; Cultural aspects; Feature extraction; Accuracy; Heating systems; Neural networks; Transformers; Deep learning; Face landmarks; deep learning; manga; cultural heritage;
D O I
10.1109/ACCESS.2024.3459419
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Comics, particularly Japanese manga, are a powerful medium that blends images and text to convey ideas and encapsulate a unique cultural heritage. Going beyond mere entertainment, manga merges diverse styles and content deeply rooted in Japanese cultural heritage. This study utilizes computer vision analysis, with a specific focus on facial landmark detection, acknowledging the growing significance of technology in analyzing manga images. Through a comprehensive exploration of various methods, the research identifies the extended version of Bidirectional Encoder Representations from Transformers (BERT), BERT Pre-Training of Image Transformers (BEiT), model as a standout performer due to its efficiency and effectiveness. The BEiT model's success lies in its ability to extract facial features, consequently establishing itself as a go-to solution for landmark detection on manga faces. The outcomes achieved the lowest Failure Rate compared to other landmark detection networks, with a Failure Rate of approximately 9.4% and a Mean Average Error of about 4.6 pixels. Beyond its technical accomplishments, this study carries a cultural significance, contributing to the ongoing narrative of manga in Japan.
引用
收藏
页码:131284 / 131295
页数:12
相关论文
共 3 条
  • [1] Robust Facial Landmark Detection by Multiorder Multiconstraint Deep Networks
    Wan, Jun
    Lai, Zhihui
    Li, Jing
    Zhou, Jie
    Gao, Can
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) : 2181 - 2194
  • [2] Deep coupling neural network for robust facial landmark detection
    Wu, Wenyan
    Wu, Xingzhe
    Cai, Yici
    Zhou, Qiang
    COMPUTERS & GRAPHICS-UK, 2019, 82 : 286 - 294
  • [3] Deep Neural Networks and Computer Vision for Real-Time Facial Landmark Detection
    Ciobotaru, Alexandru
    Gota, Dan Ioan
    Puscasiu, Adela
    Stan, Ovidiu
    Fanca, Alexandra
    Domnuta, Claudiu
    Valean, Honoriu
    Miclea, Liviu
    INTERNATIONAL CONFERENCE ON ELECTRICAL, COMPUTER AND ENERGY TECHNOLOGIES (ICECET 2021), 2021, : 424 - 429