Unsupervised Object Class Discovery via Saliency-Guided Multiple Class Learning

被引:99
作者
Zhu, Jun-Yan [1 ]
Wu, Jiajun [2 ]
Xu, Yan [3 ]
Chang, Eric [4 ]
Tu, Zhuowen [5 ]
机构
[1] Univ Calif Berkeley, Div Comp Sci, Berkeley, CA 94720 USA
[2] Tsinghua Univ, Inst Interdisciplinary Informat Sci, Beijing 100084, Peoples R China
[3] Beihang Univ, Dept Biomed Engn, Beijing 100191, Peoples R China
[4] Microsoft Res Asia, Beijing 100080, Peoples R China
[5] Univ Calif San Diego, Dept Cognit Sci, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
Unsupervised object discovery; object detection; multiple instance learning; weakly supervised learning; saliency; IMAGE; SCENE; LOCALIZATION; SHAPE;
D O I
10.1109/TPAMI.2014.2353617
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we tackle the problem of common object (multiple classes) discovery from a set of input images, where we assume the presence of one object class in each image. This problem is, loosely speaking, unsupervised since we do not know a priori about the object type, location, and scale in each image. We observe that the general task of object class discovery in a fully unsupervised manner is intrinsically ambiguous; here we adopt saliency detection to propose candidate image windows/patches to turn an unsupervised learning problem into a weakly-supervised learning problem. In the paper, we propose an algorithm for simultaneously localizing objects and discovering object classes via bottom-up (saliency-guided) multiple class learning (bMCL). Our contributions are three-fold: (1) we adopt saliency detection to convert unsupervised learning into multiple instance learning, formulated as bottom-up multiple class learning (bMCL); (2) we propose an integrated framework that simultaneously performs object localization, object class discovery, and object detector training; (3) we demonstrate that our framework yields significant improvements over existing methods for multi-class object discovery and possess evident advantages over competing methods in computer vision. In addition, although saliency detection has recently attracted much attention, its practical usage for high-level vision tasks has yet to be justified. Our method validates the usefulness of saliency detection to output "noisy input" for a top-down method to extract common patterns.
引用
收藏
页码:862 / 875
页数:14
相关论文
共 61 条
  • [11] Robust Object Tracking with Online Multiple Instance Learning
    Babenko, Boris
    Yang, Ming-Hsuan
    Belongie, Serge
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1619 - 1632
  • [12] iCoseg: Interactive Co-segmentation with Intelligent Scribble Guidance
    Batra, Dhruv
    Kowdle, Adarsh
    Parikh, Devi
    Luo, Jiebo
    Chen, Tsuhan
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3169 - 3176
  • [13] Constrained Parametric Min-Cuts for Automatic Object Segmentation
    Carreira, Joao
    Sminchisescu, Cristian
    [J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3241 - 3248
  • [14] A Framework for Robust Subspace Learning
    Fernando De la Torre
    Michael J. Black
    [J]. International Journal of Computer Vision, 2003, 54 (1-3) : 117 - 142
  • [15] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [16] Weakly Supervised Localization and Learning with Generic Knowledge
    Deselaers, Thomas
    Alexe, Bogdan
    Ferrari, Vittorio
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (03) : 275 - 293
  • [17] Solving the multiple instance problem with axis-parallel rectangles
    Dietterich, TG
    Lathrop, RH
    LozanoPerez, T
    [J]. ARTIFICIAL INTELLIGENCE, 1997, 89 (1-2) : 31 - 71
  • [18] Dollár P, 2008, LECT NOTES COMPUT SC, V5303, P211, DOI 10.1007/978-3-540-88688-4_16
  • [19] Endres I, 2010, LECT NOTES COMPUT SC, V6315, P575, DOI 10.1007/978-3-642-15555-0_42
  • [20] Everingham M., 2006, The pascal visual object classes challenge 2006 (voc2006) results