Weaklier Supervised Semantic Segmentation With Only One Image Level Annotation per Category

被引:22
作者
Li, Xi [1 ]
Ma, Huimin [1 ]
Luo, Xiong [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[2] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China
基金
中国国家自然科学基金;
关键词
Weaklier supervision; semantic segmentation; dual-branch iterative learning; CLASSIFICATION; SALIENCY;
D O I
10.1109/TIP.2019.2930874
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image semantic segmentation tasks and methods based on weakly supervised conditions have been proposed and achieve better and better performance in recent years. However, the purpose of these tasks is mainly to simplify the labeling work. In this paper, we establish a new and more challenging task condition: weaklier supervision with one image level annotation per category, which only provides prior knowledge that humans need to recognize new objects, and aims to achieve pixel-level object semantic understanding. In order to solve this problem, a three-stage semantic segmentation framework is put forward, which realizes image level, pixel level, and object common features learning from coarse to fine grade, and finally obtains semantic segmentation results with accurate and complete object regions. Researches on PASCAL VOC 2012 dataset demonstrates the effectiveness of the proposed method, which makes an obvious improvement compared to baselines. Based on fewer supervised information, the method also provides satisfactory performance compared to weakly supervised learning-based methods with complete image-level annotations.
引用
收藏
页码:128 / 141
页数:14
相关论文
共 50 条
[11]   3D Object Proposals Using Stereo Imagery for Accurate Object Class Detection [J].
Chen, Xiaozhi ;
Kundu, Kaustav ;
Zhu, Yukun ;
Ma, Huimin ;
Fidler, Sanja ;
Urtasun, Raquel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (05) :1259-1272
[12]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[13]   BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation [J].
Dai, Jifeng ;
He, Kaiming ;
Sun, Jian .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1635-1643
[14]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[15]   Multi-Evidence Filtering and Fusion for Multi-Label Classification, Object Detection and Semantic Segmentation Based onWeakly Supervised Learning [J].
Ge, Weifeng ;
Yang, Sibei ;
Yu, Yizhou .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1277-1286
[16]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[17]   Small Object Sensitive Segmentation of Urban Street Scene With Spatial Adjacency Between Object Classes [J].
Guo, Dazhou ;
Zhu, Ligeng ;
Lu, Yuhang ;
Yu, Hongkai ;
Wang, Song .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (06) :2643-2653
[18]  
Hariharan B, 2011, IEEE I CONF COMP VIS, P991, DOI 10.1109/ICCV.2011.6126343
[19]   Reformulating Level Sets as Deep Recurrent Neural Network Approach to Semantic Segmentation [J].
Hoang Ngan Le, T. ;
Kha Gia Quach ;
Khoa Luu ;
Chi Nhan Duong ;
Savvides, Marios .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) :2393-2407
[20]   Weakly-Supervised Semantic Segmentation Network with Deep Seeded Region Growing [J].
Huang, Zilong ;
Wang, Xinggang ;
Wang, Jiasi ;
Liu, Wenyu ;
Wang, Jingdong .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7014-7023