Adaptive Wing Loss for Robust Face Alignment via Heatmap Regression

被引:240
作者
Wang, Xinyao [1 ,2 ]
Bo, Liefeng [2 ]
Li Fuxin [1 ]
机构
[1] Oregon State Univ, Corvallis, OR 97331 USA
[2] JD Digits, Beijing, Peoples R China
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
基金
美国国家科学基金会;
关键词
D O I
10.1109/ICCV.2019.00707
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Heatmap regression with a deep network has become one of the mainstream approaches to localize facial landmarks. However, the loss function for heatmap regression is rarely studied. In this paper, we analyze the ideal loss function properties for heatmap regression in face alignment problems. Then we propose a novel loss function, named Adaptive Wing loss, that is able to adapt its shape to different types of ground truth heatmap pixels. This adaptability penalizes loss more on foreground pixels while less on background pixels. To address the imbalance between foreground and background pixels, we also propose Weighted Loss Map, which assigns high weights on foreground and difficult background pixels to help training process focus more on pixels that are crucial to landmark localization. To further improve face alignment accuracy, we introduce boundary prediction and CoordConv with boundary co-ordinates. Extensive experiments on different benchmarks, including COFW, 300W and WFLW, show our approach outperforms the state-of-the-art by a significant margin on various evaluation metrics. Besides, the Adaptive Wing loss also helps other heatmap regression tasks.
引用
收藏
页码:6970 / 6980
页数:11
相关论文
共 66 条
[11]   Robust face landmark estimation under occlusion [J].
Burgos-Artizzu, Xavier P. ;
Perona, Pietro ;
Dollar, Piotr .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1513-1520
[12]   Face Alignment by Explicit Shape Regression [J].
Cao, Xudong ;
Wei, Yichen ;
Wen, Fang ;
Sun, Jian .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (02) :177-190
[13]  
CHU X, 2017, PROC CVPR IEEE, P5669, DOI [DOI 10.1109/CVPR.2017.601, 10.1109/CVPR.2017.601]
[14]  
Deng J., 2018, ARXIV180107698
[15]   M3 CSR: Multi-view, multi-scale and multi-component cascade shape regression [J].
Deng, Jiankang ;
Liu, Qingshan ;
Yang, Jing ;
Tao, Dacheng .
IMAGE AND VISION COMPUTING, 2016, 47 :19-26
[16]   Style Aggregated Network for Facial Landmark Detection [J].
Dong, Xuanyi ;
Yan, Yan ;
Ouyang, Wanli ;
Yang, Yi .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :379-388
[17]   End-to-end 3D face reconstruction with deep neural networks [J].
Dou, Pengfei ;
Shah, Shishir K. ;
Kakadiaris, Ioannis A. .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :1503-1512
[18]   Approaching human level facial landmark localization by deep learning [J].
Fan, Haoqiang ;
Zhou, Erjin .
IMAGE AND VISION COMPUTING, 2016, 47 :27-35
[19]   Wing Loss for Robust Facial Landmark Localisation with Convolutional Neural Networks [J].
Feng, Zhen-Hua ;
Kittler, Josef ;
Awais, Muhammad ;
Huber, Patrik ;
Wu, Xiao-Jun .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2235-2245
[20]   Dynamic Attention-controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-set Sample Weighting [J].
Feng, Zhen-Hua ;
Kittler, Josef ;
Christmas, William ;
Huber, Patrik ;
Wu, Xiao-Jun .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :3681-3686