Discrepant multiple instance learning for weakly supervised object detection

被引:32
作者
Gao, Wei [1 ]
Wan, Fang [1 ]
Yue, Jun [2 ]
Xu, Songcen [2 ]
Ye, Qixiang [1 ]
机构
[1] Univ Chinese Acad Sci, Beijing 101408, Peoples R China
[2] Huawei Noahs Ark Lab, Shenzhen 518000, Peoples R China
基金
中国国家自然科学基金;
关键词
Weakly supervised detection; Multiple instance learning; Learner discrepancy; Collaborative learning; LOCALIZATION;
D O I
10.1016/j.patcog.2021.108233
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple Instance Learning (MIL) is a fundamental method for weakly supervised object detection (WSOD), but experiences difficulty in excluding local optimal solutions and may miss objects or falsely localize object parts. In this paper, we introduce discrepantly collaborative modules into MIL and thereby create discrepant multiple instance learning (D-MIL), pursuing optimal solutions in a simple-yet-effective way. D-MIL adopts multiple MIL learners to pursue discrepant yet complementary solutions indicating object parts, which are fused with a collaboration module for precise object localization. D-MIL implements a new "teachers-students" model, where MIL learners act as "teachers" and object detectors as "students". Multiple teachers provide rich yet complementary information, which are absorbed by students and transferred back to reinforce the performance of teachers. Experiments show that D-MIL significantly improves the baseline while achieves state-of-the-art performance on the challenging MS-COCO object detection benchmark. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 69 条
[1]   Dissimilarity Coefficient based Weakly Supervised Object Detection [J].
Arun, Aditya ;
Jawahar, C., V ;
Kumar, M. Pawan .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9424-9433
[2]   Ensemble modelling or selecting the best model: Many could be better than one [J].
Barai, SV ;
Reich, Y .
AI EDAM-ARTIFICIAL INTELLIGENCE FOR ENGINEERING DESIGN ANALYSIS AND MANUFACTURING, 1999, 13 (05) :377-386
[3]   MINIMUM HELLINGER DISTANCE ESTIMATES FOR PARAMETRIC MODELS [J].
BERAN, R .
ANNALS OF STATISTICS, 1977, 5 (03) :445-463
[4]   Weakly Supervised Deep Detection Networks [J].
Bilen, Hakan ;
Vedaldi, Andrea .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2846-2854
[5]  
Bilen H, 2015, PROC CVPR IEEE, P1081, DOI 10.1109/CVPR.2015.7298711
[6]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[7]   Multiple instance learning: A survey of problem characteristics and applications [J].
Carbonneau, Marc-Andre ;
Cheplygina, Veronika ;
Granger, Eric ;
Gagnon, Ghyslain .
PATTERN RECOGNITION, 2018, 77 :329-353
[8]   Weakly-Supervised Semantic Segmentation via Sub-category Exploration [J].
Chang, Yu-Ting ;
Wang, Qiaosong ;
Hung, Wei-Chih ;
Piramuthu, Robinson ;
Tsai, Yi-Hsuan ;
Yang, Ming-Hsuan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8988-8997
[9]   SLV: Spatial Likelihood Voting forWeakly Supervised Object Detection [J].
Chen, Ze ;
Fu, Zhihang ;
Jiang, Rongxin ;
Chen, Yaowu ;
Hua, Xian-Sheng .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12992-13001
[10]   Weakly Supervised Object Localization with Multi-Fold Multiple Instance Learning [J].
Cinbis, Ramazan Gokberk ;
Verbeek, Jakob ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (01) :189-203