Unsupervised learning of object landmarks by factorized spatial embeddings

被引:92
作者
Thewlis, James [1 ]
Bilen, Hakan [1 ,2 ]
Vedaldi, Andrea [1 ]
机构
[1] Univ Oxford, Oxford, England
[2] Univ Edinburgh, Edinburgh, Midlothian, Scotland
来源
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV) | 2017年
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ICCV.2017.348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning automatically the structure of object categories remains an important open problem in computer vision. In this paper, we propose a novel unsupervised approach that can discover and learn landmarks in object categories, thus characterizing their structure. Our approach is based on factorizing image deformations, as induced by a viewpoint change or an object deformation, by learning a deep neural network that detects landmarks consistently with such visual effects. Furthermore, we show that the learned landmarks establish meaningful correspondences between different object instances in a category without having to impose this requirement explicitly. We assess the method qualitatively on a variety of object types, natural and man-made. We also show that our unsupervised landmarks are highly predictive of manually-annotated landmarks in face benchmark datasets, and can be used to regress these with a high degree of accuracy.
引用
收藏
页码:3229 / 3238
页数:10
相关论文
共 80 条
[1]  
[Anonymous], CVPR WORKSH
[2]  
[Anonymous], P CVPR
[3]  
[Anonymous], 2004, IJCV
[4]  
[Anonymous], P ECCV
[5]  
[Anonymous], 2015, P ICCV
[6]  
[Anonymous], 2015, P ICCV
[7]  
[Anonymous], P ICCV
[8]  
[Anonymous], P ECCV
[9]  
[Anonymous], 1 IEEE INT WORKSH BE
[10]  
[Anonymous], 2006, SCIENCE