SFNet: Learning Object-aware Semantic Correspondence

被引:103
作者
Lee, Junghyup [1 ]
Kim, Dohyung [1 ]
Ponce, Jean [2 ,3 ]
Ham, Bumsub [1 ]
机构
[1] Yonsei Univ, Sch Elect & Elect Engn, Seoul, South Korea
[2] PSL Res Univ, CNRS, ENS, Dept Informat ENS, Paris, France
[3] INRIA, Paris, France
来源
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019) | 2019年
基金
新加坡国家研究基金会;
关键词
D O I
10.1109/CVPR.2019.00238
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We address the problem of semantic correspondence, that is, establishing a dense flow field between images depicting different instances of the same object or scene category. We propose to use images annotated with binary foreground masks and subjected to synthetic geometric deformations to train a convolutional neural network (CNN) for this task. Using these masks as part of the supervisory signal offers a good compromise between semantic flow methods, where the amount of training data is limited by the cost of manually selecting point correspondences, and semantic alignment ones, where the regression of a single global geometric transformation between images may be sensitive to image-specific details such as background clutter. We propose a new CNN architecture, dubbed SFNet, which implements this idea. It leverages a new and differentiable version of the argmax function for end-to-end training, with a loss that combines mask and flow consistency with smoothness terms. Experimental results demonstrate the effectiveness of our approach, which significantly outperforms the state of the art on standard benchmarks.
引用
收藏
页码:2273 / 2282
页数:10
相关论文
共 52 条
  • [1] [Anonymous], 2009, ICCV
  • [2] [Anonymous], 2017, CVPR
  • [3] [Anonymous], 2014, NIPS
  • [4] [Anonymous], 2004, ECCV
  • [5] [Anonymous], IJCV
  • [6] [Anonymous], IEEE TPAMI
  • [7] [Anonymous], 2015, CVPR
  • [8] [Anonymous], 2009, ICCV
  • [9] [Anonymous], IEEE TPAMI
  • [10] [Anonymous], 2015, CVPR