ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation

被引:651
作者
Lin, Di [1 ,2 ]
Dai, Jifeng [2 ]
Jia, Jiaya [1 ]
He, Kaiming [2 ]
Sun, Jian [2 ]
机构
[1] Chinese Univ Hong Kong, Hong Kong, Hong Kong, Peoples R China
[2] Microsoft Res, Cambridge, MA USA
来源
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2016年
关键词
D O I
10.1109/CVPR.2016.344
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large-scale data is of crucial importance for learning semantic segmentation models, but annotating per-pixel masks is a tedious and inefficient procedure. We note that for the topic of interactive image segmentation, scribbles are very widely used in academic research and commercial software, and are recognized as one of the most user-friendly ways of interacting. In this paper, we propose to use scribbles to annotate images, and develop an algorithm to train convolutional networks for semantic segmentation supervised by scribbles. Our algorithm is based on a graphical model that jointly propagates information from scribbles to unmarked pixels and learns network parameters. We present competitive object semantic segmentation results on the PASCAL VOC dataset by using scribbles as annotations. Scribbles are also favored for annotating stuff (e.g., water, sky, grass) that has no well-defined shape, and our method shows excellent results on the PASCAL-CONTEXT dataset thanks to extra inexpensive scribble annotations. Our scribble annotations on PASCAL VOC are available at http: //research.microsoft.com/en-us/um/ people/jifdai/downloads/scribble_sup.
引用
收藏
页码:3159 / 3167
页数:9
相关论文
共 31 条
[11]   Efficient graph-based image segmentation [J].
Felzenszwalb, PF ;
Huttenlocher, DP .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 59 (02) :167-181
[12]  
Grady L., 2006, PAMI
[13]  
Hariharan B, 2011, IEEE I CONF COMP VIS, P991, DOI 10.1109/ICCV.2011.6126343
[14]   Robust Higher Order Potentials for Enforcing Label Consistency [J].
Kohli, Pushmeet ;
Ladicky, L'ubor ;
Torr, Philip H. S. .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2009, 82 (03) :302-324
[15]  
Lafferty John, 2001, INT C MACH LEARN ICM
[16]  
Li Y., 2004, SIGGRAPH
[17]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[18]   Paint Selection [J].
Liu, Jiangyu ;
Sun, Jian ;
Shum, Heung-Yeung .
ACM TRANSACTIONS ON GRAPHICS, 2009, 28 (03)
[19]   Deep Learning Face Attributes in the Wild [J].
Liu, Ziwei ;
Luo, Ping ;
Wang, Xiaogang ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3730-3738
[20]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965