Weakly Supervised Salient Object Detection by Learning A Classifier-Driven Map Generator

被引：19

作者：

Hsu, Kuang-Jui ^{[1
,2
]}

Lin, Yen-Yu ^{[1
]}

Chuang, Yung-Yu ^{[1
,2
]}

机构：

[1] Acad Sinica, Res Ctr Informat Technol Innovat, Taipei 115, Taiwan

[2] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei 106, Taiwan

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2019年 / 28卷 / 11期

关键词：

Top-down object saliency detection; convolutional neural networks; weakly supervised learning; REGION DETECTION;

D O I：

10.1109/TIP.2019.2917224

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Top-down saliency detection aims to highlight the regions of a specific object category, and typically relies on pixel-wise annotated training data. In this paper, we address the high cost of collecting such training data by a weakly supervised approach to object saliency detection, where only image-level labels, indicating the presence or absence of a target object in an image, are available. The proposed framework is composed of two collaborative CNN modules, an image-level classifier and a pixel-level map generator. While the former distinguishes images with objects of interest from the rest, the latter is learned to generate saliency maps by which the images masked by the maps can be better predicted by the former. In addition to the top-down guidance from class labels, the map generator is derived by also exploring other cues, including the background prior, superpixel-and object proposal-based evidence. The background prior is introduced to reduce false positives. Evidence from superpixels helps preserve sharp object boundaries. The clue from object proposals improves the integrity of highlighted objects. These different types of cues greatly regularize the training process and reduces the risk of overfitting, which happens frequently when learning CNN models with few training data. Experiments show that our method achieves superior results, even outperforming fully supervised methods.

引用

页码：5435 / 5449

页数：15

共 65 条

[1] SLIC Superpixels Compared to State-of-the-Art Superpixel Methods [J].

Achanta, Radhakrishna ;

Shaji, Appu ;

Smith, Kevin ;

Lucchi, Aurelien ;

Fua, Pascal ;

Suesstrunk, Sabine .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) :2274-2281

[2] Fast and Robust Object Segmentation with the Integral Linear Classifier [J].

Aldavert, David ;

Ramisa, Arnau ;

Lopez de Mantaras, Ramon ;

Toledo, Ricardo .

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1046-1053

[3]

[Anonymous], P EUR C COMPUT VIS

[4]

[Anonymous], 2009, IEEE I CONF COMP VIS

[5]

[Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386

[6]

[Anonymous], MINING PIXELS WEAKLY

[7]

[Anonymous], 2003, Proceedings of the eleventh ACM international conference on Multimedia, DOI [10.1145/957013.957094, DOI 10.1145/957013.957094]

[8]

[Anonymous], 2015, P 3 INT C LEARN REPR

[9]

[Anonymous], 2014, P INT C LEARN REPR W

[10]

[Anonymous], 2015, PROC CVPR IEEE

← 1 2 3 4 5 6 7 →