Unsupervised Object Class Discovery via Saliency-Guided Multiple Class Learning

被引:99
作者
Zhu, Jun-Yan [1 ]
Wu, Jiajun [2 ]
Xu, Yan [3 ]
Chang, Eric [4 ]
Tu, Zhuowen [5 ]
机构
[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
[3] Beihang Univ, Dept Biomed Engn, Beijing 100191, Peoples R China
[4] Microsoft Res Asia, Beijing 100080, Peoples R China
[5] Univ Calif San Diego, Dept Cognit Sci, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
Unsupervised object discovery; object detection; multiple instance learning; weakly supervised learning; saliency; IMAGE; SCENE; LOCALIZATION; SHAPE;
D O I
10.1109/TPAMI.2014.2353617
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the problem of common object (multiple classes) discovery from a set of input images, where we assume the presence of one object class in each image. This problem is, loosely speaking, unsupervised since we do not know a priori about the object type, location, and scale in each image. We observe that the general task of object class discovery in a fully unsupervised manner is intrinsically ambiguous; here we adopt saliency detection to propose candidate image windows/patches to turn an unsupervised learning problem into a weakly-supervised learning problem. In the paper, we propose an algorithm for simultaneously localizing objects and discovering object classes via bottom-up (saliency-guided) multiple class learning (bMCL). Our contributions are three-fold: (1) we adopt saliency detection to convert unsupervised learning into multiple instance learning, formulated as bottom-up multiple class learning (bMCL); (2) we propose an integrated framework that simultaneously performs object localization, object class discovery, and object detector training; (3) we demonstrate that our framework yields significant improvements over existing methods for multi-class object discovery and possess evident advantages over competing methods in computer vision. In addition, although saliency detection has recently attracted much attention, its practical usage for high-level vision tasks has yet to be justified. Our method validates the usefulness of saliency detection to output "noisy input" for a top-down method to extract common patterns.
引用
收藏
页码:862 / 875
页数:14
相关论文
共 61 条
  • [1] Measuring the Objectness of Image Windows
    Alexe, Bogdan
    Deselaers, Thomas
    Ferrari, Vittorio
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (11) : 2189 - 2202
  • [2] [Anonymous], P ECCV WORKSH REPR U
  • [3] [Anonymous], 2007, 2007 IEEE C COMPUTER
  • [4] [Anonymous], 2002, PROC 15 INT C NEURAL
  • [5] [Anonymous], 2007, 2007 IEEE C COMPUTER, DOI [DOI 10.1109/CVPR.2007.383267, 10.1109/CVPR.2007.383267]
  • [6] [Anonymous], 2008, CVPR
  • [7] [Anonymous], 2000, Pattern Classification
  • [8] [Anonymous], 2013, NIPS
  • [9] [Anonymous], 1995, STORAGE RETRIEVAL IM, DOI [DOI 10.1117/12.205308, 10.1117/12.205308]
  • [10] Babenko B., 2008, P WORKSH FAC REAL LI