Min-Entropy Latent Model for Weakly Supervised Object Detection

被引:141
作者
Wan, Fang [1 ]
Wei, Pengxu [1 ]
Jiao, Jianbin [1 ]
Han, Zhenjun [1 ]
Ye, Qixiang [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing, Peoples R China
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
关键词
LOCALIZATION;
D O I
10.1109/CVPR.2018.00141
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Weakly supervised object detection is a challenging task when provided with image category supervision but required to learn, at the same time, object locations and object detectors. The inconsistency between the weak supervision and learning objectives introduces randomness to object locations and ambiguity to detectors. In this paper, a min-entropy latent model (MELM) is proposed for weakly supervised object detection. Min-entropy is used as a metric to measure the randomness of object localization during learning, as well as serving as a model to learn object locations. It aims to principally reduce the variance of positive instances and alleviate the ambiguity of detectors. MELM is deployed as two sub-models, which respectively discovers and localizes objects by minimizing the global and local entropy. MELM is unified with feature learning and optimized with a recurrent learning algorithm, which progressively transfers the weak supervision to object locations. Experiments demonstrate that MELM significantly improves the performance of weakly supervised detection, weakly supervised localization, and image classification, against the state-of-the-art approaches.
引用
收藏
页码:1297 / 1306
页数:10
相关论文
共 46 条
  • [41] Wang C, 2014, LECT NOTES COMPUT SC, V8694, P431, DOI 10.1007/978-3-319-10599-4_28
  • [42] Relaxed Multiple-Instance SVM with Application to Object Discovery
    Wang, Xinggang
    Zhu, Zhuotun
    Yao, Cong
    Bai, Xiang
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1224 - 1232
  • [43] Wu JJ, 2015, PROC CVPR IEEE, P3460, DOI 10.1109/CVPR.2015.7298968
  • [44] Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model
    Ye, Qixiang
    Zhang, Tianliang
    Ke, Wei
    Qiu, Qiang
    Chen, Jie
    Sapiro, Guillermo
    Zhang, Baochang
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2057 - 2066
  • [45] Yu C.-N. J., 2009, P 26 ANN INT C MACH, P1169, DOI DOI 10.1145/1553374.1553523
  • [46] Soft Proposal Networks for Weakly Supervised Object Localization
    Zhu, Yi
    Zhou, Yanzhao
    Ye, Qixiang
    Qiu, Qiang
    Jiao, Jianbin
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1859 - 1868