Min-Entropy Latent Model for Weakly Supervised Object Detection

被引：141

作者：

Wan, Fang ^{[1
]}

Wei, Pengxu ^{[1
]}

Jiao, Jianbin ^{[1
]}

Han, Zhenjun ^{[1
]}

Ye, Qixiang ^{[1
]}

机构：

[1] Univ Chinese Acad Sci, Beijing, Peoples R China

来源：

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年

关键词：

LOCALIZATION;

D O I：

10.1109/CVPR.2018.00141

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Weakly supervised object detection is a challenging task when provided with image category supervision but required to learn, at the same time, object locations and object detectors. The inconsistency between the weak supervision and learning objectives introduces randomness to object locations and ambiguity to detectors. In this paper, a min-entropy latent model (MELM) is proposed for weakly supervised object detection. Min-entropy is used as a metric to measure the randomness of object localization during learning, as well as serving as a model to learn object locations. It aims to principally reduce the variance of positive instances and alleviate the ambiguity of detectors. MELM is deployed as two sub-models, which respectively discovers and localizes objects by minimizing the global and local entropy. MELM is unified with feature learning and optimized with a recurrent learning algorithm, which progressively transfers the weak supervision to object locations. Experiments demonstrate that MELM significantly improves the performance of weakly supervised detection, weakly supervised localization, and image classification, against the state-of-the-art approaches.

引用

页码：1297 / 1306

页数：10

共 46 条

[41] Wang C, 2014, LECT NOTES COMPUT SC, V8694, P431, DOI 10.1007/978-3-319-10599-4_28
[42] Relaxed Multiple-Instance SVM with Application to Object Discovery
Wang, Xinggang
Zhu, Zhuotun
Yao, Cong
Bai, Xiang
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1224 - 1232
[43] Wu JJ, 2015, PROC CVPR IEEE, P3460, DOI 10.1109/CVPR.2015.7298968
[44] Self-learning Scene-specific Pedestrian Detectors using a Progressive Latent Model
Ye, Qixiang
Zhang, Tianliang
Ke, Wei
Qiu, Qiang
Chen, Jie
Sapiro, Guillermo
Zhang, Baochang
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2057 - 2066
[45] Yu C.-N. J., 2009, P 26 ANN INT C MACH, P1169, DOI DOI 10.1145/1553374.1553523
[46] Soft Proposal Networks for Weakly Supervised Object Localization
Zhu, Yi
Zhou, Yanzhao
Ye, Qixiang
Qiu, Qiang
Jiao, Jianbin
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 1859 - 1868

← 1 2 3 4 5 →