3-D Facial Landmarks Detection for Intelligent Video Systems

被引：14

作者：

Hoang, Van-Thanh ^{[1
]}

Huang, De-Shuang ^{[2
]}

Jo, Kang-Hyun ^{[3
,4
]}

机构：

[1] Univ Ulsan, Grad Sch Elect Engn, Elect & Comp Engn, Ulsan 44610, South Korea

[2] Tongji Univ, Sch Elect & Informat Engn, Inst Machine Learning & Syst Biol, Shanghai 201804, Peoples R China

[3] Tongji Univ, Shanghai, Peoples R China

[4] Univ Ulsan, Sch Elect Engn, Ulsan, South Korea

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2021年 / 17卷 / 01期

关键词：

Face; Three-dimensional displays; Detectors; Computer architecture; Convolution; Task analysis; Computational modeling; Convolution block; convolutional neural network (CNN); facial landmarks; stacked hourglass;

D O I：

10.1109/TII.2020.2966513

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Facial landmark detection is a fundamental research topic in computer vision that is widely adopted in many applications. Recently, thanks to the development of convolutional neural networks, this topic has been largely improved. This article proposes facial-landmark detector, which is based on a state-of-the-art architecture for landmark localization called stacked hourglass network, to obtain accurate facial landmark-points. More specifically, this article uses residual networks as the backbone instead of a 7 x 7 convolution layer. Additionally, it modifies the hourglass modules by using the residual-dense blocks in the mainstream for capturing more efficient features and the 1 x 1 convolution layers in the branch streams for reducing the model size and computational time, instead of the original residual blocks. The proposed architecture also enhances the features from modified hourglass modules with finer-resolution features via a lateral connection to generate more accurate results. The proposed network can outperform other state-of-the-art methods on the AFLW2000-3D dataset and the LS3D-W dataset, the largest three-dimensional (3-D face) alignment dataset to date.

引用

页码：578 / 586

页数：9

共 50 条

[21] 3-D Seismic Fault Detection Using Recurrent Convolutional Neural Networks With Compound Loss
Ma, Xiao
Yao, Gang
Zhang, Feng
Wu, Di
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[22] ARFA: Adaptive Reception Field Aggregation for 3-D Detection From LiDAR Point Cloud
Zhang, Diankun
Wang, Xueqing
Zheng, Zhijie
Liu, Xiaojun
Fang, Guangyou
IEEE SENSORS JOURNAL, 2023, 23 (11) : 11156 - 11167
[23] DSAV: A Deep Sparse Acceleration Framework for Voxel-Based 3-D Object Detection
Fang, Haining
Tan, Yujuan
Ren, Ao
Zhuang, Wei
Hua, Yang
Qin, Zhiyong
Liu, Duo
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 44 (02) : 613 - 626
[24] 3-D Scene Graph: A Sparse and Semantic Representation of Physical Environments for Intelligent Agents
Kim, Ue-Hwan
Park, Jin-Man
Song, Taek-Jin
Kim, Jong-Hwan
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 4921 - 4933
[25] Spatial-Temporal-Geometric Graph Convolutional Network for 3-D Human Pose Estimation From Multiview Video
Dong, Kaiwen
Zhou, Yu
Riou, Kevin
Yun, Xiao
Sun, Yanjing
Subrin, Kevin
Le Callet, Patrick
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
[26] Sagitta: An Energy-Efficient Sparse 3D-CNN Accelerator for Real-Time 3-D Understanding
Zhou, Changchun
Liu, Min
Qiu, Siyuan
Cao, Xugang
Fu, Yuzhe
He, Yifan
Jiao, Hailong
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23): : 20703 - 20717
[27] Improved 3-D LSTM: A Video Prediction Approach to Long Sequence Load Forecasting
Xiao, Jiang-Wen
Cui, Xue-Ying
Liu, Xiao-Kang
Fang, Hongliang
Li, Peng-Cheng
IEEE TRANSACTIONS ON SMART GRID, 2025, 16 (02) : 1885 - 1896
[28] Face 2D to 3D Reconstruction Network Based on Head Pose and 3D Facial Landmarks
Xu, Yuanquan
Jung, Cheolkon
2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
[29] Leveraging facial landmarks improves generalization ability for deepfake detection
Gao, Qi
Zhang, Baopeng
Wu, Jianghao
Luo, Wenxin
Teng, Zhu
Fan, Jianping
PATTERN RECOGNITION, 2025, 164
[30] Adaptive Feature Aggregation Centric Enhance Network for Accurate and Fast Monocular 3-D Object Detection
Lin, Peng-Wei
Hsu, Chih-Ming
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73

← 1 2 3 4 5 →