Sparse multiple instance learning as document classification

被引:0
|
作者
Shengye Yan
Xiaodong Zhu
Guoqing Liu
Jianxin Wu
机构
[1] NUIST,B
[2] Minieye,DAT, CICAEET, School of Information and Control
[3] Youjia Innovation LLC,National Key Laboratory for Novel Software Technology
[4] Nanjing University,undefined
来源
Multimedia Tools and Applications | 2017年 / 76卷
关键词
Sparse multiple instance learning; Low witness rate; Structural representation; Document classification;
D O I
暂无
中图分类号
学科分类号
摘要
This work focuses on multiple instance learning (MIL) with sparse positive bags (which we name as sparse MIL). A structural representation is presented to encode both instances and bags. This representation leads to a non-i.i.d. MIL algorithm, miStruct, which uses a structural similarity to compare bags. Furthermore, MIL with this representation is shown to be equivalent to a document classification problem. Document classification also suffers from the fact that only few paragraphs/words are useful in revealing the category of a document. By using the TF-IDF representation which has excellent empirical performance in document classification, the miDoc method is proposed. The proposed methods achieve significantly higher accuracies and AUC (area under the ROC curve) than the state-of-the-art in a large number of sparse MIL problems, and the document classification analogy explains their efficacy in sparse MIL problems.
引用
收藏
页码:4553 / 4570
页数:17
相关论文
共 50 条
  • [1] Sparse multiple instance learning as document classification
    Yan, Shengye
    Zhu, Xiaodong
    Liu, Guoqing
    Wu, Jianxin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) : 4553 - 4570
  • [2] Document Image Classification and Labeling using Multiple Instance Learning
    Kumar, Jayant
    Pillai, Jaishanker
    Doermann, David
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 1059 - 1063
  • [3] Classification of COPD with Multiple Instance Learning
    Cheplygina, Veronika
    Sorensen, Lauge
    Tax, David M. J.
    Pedersen, Jesper Holst
    Loog, Marco
    de Bruijne, Marleen
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 1508 - 1513
  • [4] Multiple instance learning for malware classification
    Stiborek, Jan
    Pevny, Tomas
    Rehak, Martin
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 : 346 - 357
  • [5] Sparse Network Inversion for Key Instance Detection in Multiple Instance Learning
    Shin, Beomjo
    Cho, Junsu
    Yu, Hwanjo
    Choi, Seungjin
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4083 - 4090
  • [6] Score Thresholding for Accurate Instance Classification in Multiple Instance Learning
    Carbonneau, Marc-Andre
    Granger, Eric
    Gagnon, Ghyslain
    2016 SIXTH INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS (IPTA), 2016,
  • [7] Sparse multiple instance learning with non -convex penalty
    Zhang, Yuqi
    Zhang, Haibin
    Tian, Yingjie
    NEUROCOMPUTING, 2020, 391 : 142 - 156
  • [8] Diversified Multiple Instance Learning for Document-Level Multi-Aspect Sentiment Classification
    Ji, Yunjie
    Liu, Hao
    He, Bolei
    Xiao, Xinyan
    Wu, Hua
    Yu, Yanhua
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 7012 - 7023
  • [9] Learning Sparse Kernel Classifiers for Multi-Instance Classification
    Fu, Zhouyu
    Lu, Guojun
    Ting, Kai Ming
    Zhang, Dengsheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2013, 24 (09) : 1377 - 1389
  • [10] MULTIPLE INSTANCE LEARNING WITH CRITICAL INSTANCE FOR WHOLE SLIDE IMAGE CLASSIFICATION
    Zhou, Yuanpin
    Lu, Yao
    2023 IEEE 20TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING, ISBI, 2023,