Robust multiple-instance learning ensembles using random subspace instance selection

被引:42
作者
Carbonneau, Marc-Andre [1 ,2 ]
Granger, Eric [2 ]
Raymond, Alexandre J. [1 ]
Gagnon, Ghyslain [1 ]
机构
[1] Univ Quebec, Ecole Technol Super, Lab Commun & Integrat Microelect LACIME, 1100 Rue Notre Dame Ouest, Montreal, PQ H3C 1K3, Canada
[2] Univ Quebec, Ecole Technol Super, Lab Imagerie Vis & Intelligence Artificielle, 1100 Rue Notre Dame Ouest, Montreal, PQ H3C 1K3, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Multiple-instance learning; Random subspace methods; Classifier ensembles; Instance selection; Weakly supervised learning; Classification; MIL; CLASSIFICATION;
D O I
10.1016/j.patcog.2016.03.035
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many real-world pattern recognition problems can be modeled using multiple-instance learning (MIL), where instances are grouped into bags, and each bag is assigned a label. State-of-the-art MIL methods provide a high level of performance when strong assumptions are made regarding the underlying data distributions, and the proportion of positive to negative instances in positive bags. In this paper, a new method called Random Subspace Instance Selection (RSIS) is proposed for the robust design of MIL ensembles without any prior assumptions on the data structure and the proportion of instances in bags. First, instance selection probabilities are computed based on training data clustered in random sub-spaces. A pool of classifiers is then generated using the training subsets created with these selection probabilities. By using RSIS, MIL ensembles are more robust to many data distributions and noise, and are not adversely affected by the proportion of positive instances in positive bags because training instances are repeatedly selected in a probabilistic manner. Moreover, RSIS also allows the identification of positive instances on an individual basis, as required in many practical applications. Results obtained with several real-world and synthetic databases show the robustness of MIL ensembles designed with the proposed RSIS method over a range of witness rates, noisy features and data distributions compared to reference methods in the literature. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:83 / 99
页数:17
相关论文
共 54 条
  • [1] Confidence-Rated Multiple Instance Boosting for Object Detection
    Ali, Karim
    Saenko, Kate
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, : 2433 - 2440
  • [2] Amores J., 2010, Proceedings of the 2010 20th International Conference on Pattern Recognition (ICPR 2010), P4246, DOI 10.1109/ICPR.2010.1032
  • [3] Multiple instance classification: Review, taxonomy and comparative study
    Amores, Jaume
    [J]. ARTIFICIAL INTELLIGENCE, 2013, 201 : 81 - 105
  • [4] [Anonymous], 2000, THESIS
  • [5] [Anonymous], 2008, P 25 INT C MACHINE L, DOI [10.1145/1390156.1390273, DOI 10.1145/1390156.1390273]
  • [6] [Anonymous], 2010, ADV NEURAL INFORM PR
  • [7] Auer P., 1997, Proceedings of the Fourteenth International Conference on Machine Learning, P21
  • [8] Babenko B., 2011, International Conference on Machine Learning (ICML), P81
  • [9] Robust Object Tracking with Online Multiple Instance Learning
    Babenko, Boris
    Yang, Ming-Hsuan
    Belongie, Serge
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (08) : 1619 - 1632
  • [10] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32