Unsupervised learning of object landmarks by factorized spatial embeddings

被引：92

作者：

Thewlis, James ^{[1
]}

Bilen, Hakan ^{[1
,2
]}

Vedaldi, Andrea ^{[1
]}

机构：

[1] Univ Oxford, Oxford, England

[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland

来源：

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1109/ICCV.2017.348

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning automatically the structure of object categories remains an important open problem in computer vision. In this paper, we propose a novel unsupervised approach that can discover and learn landmarks in object categories, thus characterizing their structure. Our approach is based on factorizing image deformations, as induced by a viewpoint change or an object deformation, by learning a deep neural network that detects landmarks consistently with such visual effects. Furthermore, we show that the learned landmarks establish meaningful correspondences between different object instances in a category without having to impose this requirement explicitly. We assess the method qualitatively on a variety of object types, natural and man-made. We also show that our unsupervised landmarks are highly predictive of manually-annotated landmarks in face benchmark datasets, and can be used to regress these with a high degree of accuracy.

引用

页码：3229 / 3238

页数：10