Unsupervised Object Class Discovery via Saliency-Guided Multiple Class Learning

被引：99

作者：

Zhu, Jun-Yan ^{[1
]}

Wu, Jiajun ^{[2
]}

Xu, Yan ^{[3
]}

Chang, Eric ^{[4
]}

Tu, Zhuowen ^{[5
]}

机构：

[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA

[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China

[3] Beihang Univ, Dept Biomed Engn, Beijing 100191, Peoples R China

[4] Microsoft Res Asia, Beijing 100080, Peoples R China

[5] Univ Calif San Diego, Dept Cognit Sci, La Jolla, CA 92093 USA

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2015年 / 37卷 / 04期

基金：

美国国家科学基金会;

关键词：

Unsupervised object discovery; object detection; multiple instance learning; weakly supervised learning; saliency; IMAGE; SCENE; LOCALIZATION; SHAPE;

D O I：

10.1109/TPAMI.2014.2353617

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we tackle the problem of common object (multiple classes) discovery from a set of input images, where we assume the presence of one object class in each image. This problem is, loosely speaking, unsupervised since we do not know a priori about the object type, location, and scale in each image. We observe that the general task of object class discovery in a fully unsupervised manner is intrinsically ambiguous; here we adopt saliency detection to propose candidate image windows/patches to turn an unsupervised learning problem into a weakly-supervised learning problem. In the paper, we propose an algorithm for simultaneously localizing objects and discovering object classes via bottom-up (saliency-guided) multiple class learning (bMCL). Our contributions are three-fold: (1) we adopt saliency detection to convert unsupervised learning into multiple instance learning, formulated as bottom-up multiple class learning (bMCL); (2) we propose an integrated framework that simultaneously performs object localization, object class discovery, and object detector training; (3) we demonstrate that our framework yields significant improvements over existing methods for multi-class object discovery and possess evident advantages over competing methods in computer vision. In addition, although saliency detection has recently attracted much attention, its practical usage for high-level vision tasks has yet to be justified. Our method validates the usefulness of saliency detection to output "noisy input" for a top-down method to extract common patterns.

引用

页码：862 / 875

页数：14

共 61 条

[1] Measuring the Objectness of Image Windows
Alexe, Bogdan
Deselaers, Thomas
Ferrari, Vittorio
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
[2] [Anonymous], P ECCV WORKSH REPR U
[3] [Anonymous], 2007, 2007 IEEE C COMPUTER
[4] [Anonymous], 2002, PROC 15 INT C NEURAL
[5] [Anonymous], 2007, 2007 IEEE C COMPUTER, DOI [DOI 10.1109/CVPR.2007.383267, 10.1109/CVPR.2007.383267]
[6] [Anonymous], 2008, CVPR
[7] [Anonymous], 2000, Pattern Classification
[8] [Anonymous], 2013, NIPS
[9] [Anonymous], 1995, STORAGE RETRIEVAL IM, DOI [DOI 10.1117/12.205308, 10.1117/12.205308]
[10] Babenko B., 2008, P WORKSH FAC REAL LI

← 1 2 3 4 5 6 7 →