Laplace Landmark Localization

被引:32
作者
Robinson, Joseph P. [1 ,2 ]
Li, Yuncheng [2 ]
Zhang, Ning [2 ]
Fu, Yun [1 ]
Tulyakov, Sergey [2 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] Snap Inc, Santa Monica, CA USA
来源
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019) | 2019年
关键词
D O I
10.1109/ICCV.2019.01020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Landmark localization in images and videos is a classic problem solved in various ways. Nowadays, with deep networks prevailing throughout machine learning, there are revamped interests in pushing facial landmark detectors to handle more challenging data. Most efforts use network objectives based on L-1 or L-2 norms, which have several disadvantages. First of all, the generated heatmaps translate to the locations of landmarks (i.e. confidence maps) from which predicted landmark locations (i.e. the means) get penalized without accounting for the spread: a high-scatter corresponds to low confidence and vice-versa. For this, we introduce a LaplaceKL objective that penalizes for low confidence. Another issue is a dependency on labeled data, which are expensive to obtain and susceptible to error. To address both issues, we propose an adversarial training framework that leverages unlabeled data to improve model performance. Our method claims state-of-the-art on all of the 300W benchmarks and ranks second-to-best on the Annotated Facial Landmarks in the Wild (AFLW) dataset. Furthermore, our model is robust with a reduced size: 1/8 the number of channels (i.e. 0.0398 MB) is comparable to the state-of-the-art in real-time on CPU. Thus, this work is of high practical value to real-life application.
引用
收藏
页码:10102 / 10111
页数:10
相关论文
共 48 条
  • [11] Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
  • [12] Multi-PIE
    Gross, Ralph
    Matthews, Iain
    Cohn, Jeffrey
    Kanade, Takeo
    Baker, Simon
    [J]. IMAGE AND VISION COMPUTING, 2010, 28 (05) : 807 - 813
  • [13] DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild
    Guler, Riza Alp
    Trigeorgis, George
    Antonakos, Epameinondas
    Snape, Patrick
    Zafeiriou, Stefanos
    Kokkinos, Iasonas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2614 - 2623
  • [14] Hoffman J., 2017, ARXIV171103213
  • [15] Hoffman MD, 2013, J MACH LEARN RES, V14, P1303
  • [16] Improving Landmark Localization with Semi-Supervised Learning
    Honari, Sina
    Molchanov, Pavlo
    Tyree, Stephen
    Vincent, Pascal
    Pal, Christopher
    Kautz, Jan
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1546 - 1555
  • [17] Recombinator Networks: Learning Coarse-to-Fine Feature Aggregation
    Honari, Sina
    Yosinski, Jason
    Vincent, Pascal
    Pal, Christopher
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5743 - 5752
  • [18] Isola P, 2017, PROC CVPR IEEE, P1125, DOI DOI 10.1109/CVPR.2017.632
  • [19] Jeni Laszlo A., 2015, 2015 11th IEEE International Conference and Workshops on Automatic Face and Gesture Recognition (FG), P1, DOI 10.1109/FG.2015.7163142
  • [20] Kazemi Vahid, 2014, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2014.241