3-D Facial Landmarks Detection for Intelligent Video Systems

被引:14
|
作者
Hoang, Van-Thanh [1 ]
Huang, De-Shuang [2 ]
Jo, Kang-Hyun [3 ,4 ]
机构
[1] Univ Ulsan, Grad Sch Elect Engn, Elect & Comp Engn, Ulsan 44610, South Korea
[2] Tongji Univ, Sch Elect & Informat Engn, Inst Machine Learning & Syst Biol, Shanghai 201804, Peoples R China
[3] Tongji Univ, Shanghai, Peoples R China
[4] Univ Ulsan, Sch Elect Engn, Ulsan, South Korea
关键词
Face; Three-dimensional displays; Detectors; Computer architecture; Convolution; Task analysis; Computational modeling; Convolution block; convolutional neural network (CNN); facial landmarks; stacked hourglass;
D O I
10.1109/TII.2020.2966513
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Facial landmark detection is a fundamental research topic in computer vision that is widely adopted in many applications. Recently, thanks to the development of convolutional neural networks, this topic has been largely improved. This article proposes facial-landmark detector, which is based on a state-of-the-art architecture for landmark localization called stacked hourglass network, to obtain accurate facial landmark-points. More specifically, this article uses residual networks as the backbone instead of a 7 x 7 convolution layer. Additionally, it modifies the hourglass modules by using the residual-dense blocks in the mainstream for capturing more efficient features and the 1 x 1 convolution layers in the branch streams for reducing the model size and computational time, instead of the original residual blocks. The proposed architecture also enhances the features from modified hourglass modules with finer-resolution features via a lateral connection to generate more accurate results. The proposed network can outperform other state-of-the-art methods on the AFLW2000-3D dataset and the LS3D-W dataset, the largest three-dimensional (3-D face) alignment dataset to date.
引用
收藏
页码:578 / 586
页数:9
相关论文
共 50 条
  • [21] 3-D Seismic Fault Detection Using Recurrent Convolutional Neural Networks With Compound Loss
    Ma, Xiao
    Yao, Gang
    Zhang, Feng
    Wu, Di
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [22] ARFA: Adaptive Reception Field Aggregation for 3-D Detection From LiDAR Point Cloud
    Zhang, Diankun
    Wang, Xueqing
    Zheng, Zhijie
    Liu, Xiaojun
    Fang, Guangyou
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 11156 - 11167
  • [23] DSAV: A Deep Sparse Acceleration Framework for Voxel-Based 3-D Object Detection
    Fang, Haining
    Tan, Yujuan
    Ren, Ao
    Zhuang, Wei
    Hua, Yang
    Qin, Zhiyong
    Liu, Duo
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2025, 44 (02) : 613 - 626
  • [24] 3-D Scene Graph: A Sparse and Semantic Representation of Physical Environments for Intelligent Agents
    Kim, Ue-Hwan
    Park, Jin-Man
    Song, Taek-Jin
    Kim, Jong-Hwan
    IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 4921 - 4933
  • [25] Spatial-Temporal-Geometric Graph Convolutional Network for 3-D Human Pose Estimation From Multiview Video
    Dong, Kaiwen
    Zhou, Yu
    Riou, Kevin
    Yun, Xiao
    Sun, Yanjing
    Subrin, Kevin
    Le Callet, Patrick
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2025, 74
  • [26] Sagitta: An Energy-Efficient Sparse 3D-CNN Accelerator for Real-Time 3-D Understanding
    Zhou, Changchun
    Liu, Min
    Qiu, Siyuan
    Cao, Xugang
    Fu, Yuzhe
    He, Yifan
    Jiao, Hailong
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (23): : 20703 - 20717
  • [27] Improved 3-D LSTM: A Video Prediction Approach to Long Sequence Load Forecasting
    Xiao, Jiang-Wen
    Cui, Xue-Ying
    Liu, Xiao-Kang
    Fang, Hongliang
    Li, Peng-Cheng
    IEEE TRANSACTIONS ON SMART GRID, 2025, 16 (02) : 1885 - 1896
  • [28] Face 2D to 3D Reconstruction Network Based on Head Pose and 3D Facial Landmarks
    Xu, Yuanquan
    Jung, Cheolkon
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [29] Leveraging facial landmarks improves generalization ability for deepfake detection
    Gao, Qi
    Zhang, Baopeng
    Wu, Jianghao
    Luo, Wenxin
    Teng, Zhu
    Fan, Jianping
    PATTERN RECOGNITION, 2025, 164
  • [30] Adaptive Feature Aggregation Centric Enhance Network for Accurate and Fast Monocular 3-D Object Detection
    Lin, Peng-Wei
    Hsu, Chih-Ming
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73