Improving Landmark Localization with Semi-Supervised Learning

被引：113

作者：

Honari, Sina ^{[1
,6
]}

Molchanov, Pavlo ^{[2
]}

Tyree, Stephen ^{[2
]}

Vincent, Pascal ^{[1
,4
,5
]}

Pal, Christopher ^{[1
,3
]}

Kautz, Jan ^{[2
]}

机构：

[1] Univ Montreal, MILA, Montreal, PQ, Canada

[2] NVIDIA, Santa Clara, CA USA

[3] Ecole Polytech Montreal, Montreal, PQ, Canada

[4] CIFAR, Toronto, ON, Canada

[5] Facebook AI Res, Menlo Pk, CA USA

[6] NVIDIA Res, Santa Clara, CA USA

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

FACE ALIGNMENT; REPRESENTATION;

D O I：

10.1109/CVPR.2018.00167

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present two techniques to improve landmark localization in images from partially annotated datasets. Our primary goal is to leverage the common situation where precise landmark locations are only provided for a small data subset, but where class labels for classification or regression tasks related to the landmarks are more abundantly available. First, we propose the framework of sequential multitasking and explore it here through an architecture for landmark localization where training with class labels acts as an auxiliary signal to guide the landmark localization on unlabeled data. A key aspect of our approach is that errors can be backpropagated through a complete landmark localization model. Second, we propose and explore an unsupervised learning technique for landmark localization based on having a model predict equivariant landmarks with respect to transformations applied to the image. We show that these techniques, improve landmark prediction considerably and can learn effective detectors even when only a small fraction of the dataset has landmark labels. We present results on two toy datasets and four real datasets, with hands and faces, and report new state-of-the-art on two datasets in the wild, e.g. with only 5% of labeled images we outperform previous state-of-the-art trained on the AFLW dataset.

引用

页码：1546 / 1555

页数：10

共 56 条

[1]

[Anonymous], PAMI

[2]

[Anonymous], IEEE C COMP VIS PATT

[3]

[Anonymous], 2015, PROC 28 INT C NEURAL

[4]

[Anonymous], 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence

[5]

[Anonymous], P 9 IEEE INT WORKSH

[6]

[Anonymous], INT C LEARN REPR

[7]

[Anonymous], IEEE C COMP VIS PATT

[8]

[Anonymous], 2014, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-319-10605-2_1

[9]

[Anonymous], 2016, ABS160502688 ARXIV

[10] Robust Discriminative Response Map Fitting with Constrained Local Models [J].

Asthana, Akshay ;

Zafeiriou, Stefanos ;

Cheng, Shiyang ;

Pantic, Maja .

2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, :3444-3451

← 1 2 3 4 5 6 →