Multi-instance clustering with applications to multi-instance prediction

被引:109
|
作者
Zhang, Min-Ling [1 ,2 ]
Zhou, Zhi-Hua [1 ]
机构
[1] Nanjing Univ, Natl Key Lab Novel Software Technol, Nanjing 210093, Peoples R China
[2] Hohai Univ, Coll Comp & Informat Engn, Nanjing 210098, Peoples R China
基金
国家高技术研究发展计划(863计划); 美国国家科学基金会;
关键词
Machine learning; Multi-instance learning; Clustering; Representation transformation; NEURAL-NETWORKS;
D O I
10.1007/s10489-007-0111-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the setting of multi-instance learning, each object is represented by a bag composed of multiple instances instead of by a single instance in a traditional learning setting. Previous works in this area only concern multi-instance prediction problems where each bag is associated with a binary (classification) or real-valued (regression) label. However, unsupervised multi-instance learning where bags are without labels has not been studied. In this paper, the problem of unsupervised multi-instance learning is addressed where a multi-instance clustering algorithm named Bamic is proposed. Briefly, by regarding bags as atomic data items and using some form of distance metric to measure distances between bags, Bamic adapts the popular k -Medoids algorithm to partition the unlabeled training bags into k disjoint groups of bags. Furthermore, based on the clustering results, a novel multi-instance prediction algorithm named Bartmip is developed. Firstly, each bag is re-represented by a k-dimensional feature vector, where the value of the i-th feature is set to be the distance between the bag and the medoid of the i-th group. After that, bags are transformed into feature vectors so that common supervised learners are used to learn from the transformed feature vectors each associated with the original bag's label. Extensive experiments show that Bamic could effectively discover the underlying structure of the data set and Bartmip works quite well on various kinds of multi-instance prediction problems.
引用
收藏
页码:47 / 68
页数:22
相关论文
共 50 条
  • [41] Instance-Level Label Propagation with Multi-Instance Learning
    Wang, Qifan
    Chechik, Gal
    Sun, Chen
    Shen, Bin
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 2943 - 2949
  • [42] A multi-instance multi-label learning algorithm based on instance correlations
    Liu, Chanjuan
    Chen, Tongtong
    Ding, Xinmiao
    Zou, Hailin
    Tong, Yan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (19) : 12263 - 12284
  • [43] Dynamic Programming for Instance Annotation in Multi-Instance Multi-Label Learning
    Pham, Anh T.
    Raich, Raviv
    Fern, Xiaoli Z.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2381 - 2394
  • [44] A multi-instance multi-label learning algorithm based on instance correlations
    Chanjuan Liu
    Tongtong Chen
    Xinmiao Ding
    Hailin Zou
    Yan Tong
    Multimedia Tools and Applications, 2016, 75 : 12263 - 12284
  • [45] Learnability of multi-instance multi-label learning
    Wang Wei
    Zhou ZhiHua
    CHINESE SCIENCE BULLETIN, 2012, 57 (19): : 2488 - 2491
  • [46] A Boosting Approach to Exploit Instance Correlations for Multi-Instance Classification
    Li, Yali
    Wang, Shengjin
    Tian, Qi
    Ding, Xiaoqing
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2016, 27 (12) : 2740 - 2747
  • [47] Learnability of multi-instance multi-label learning
    WANG Wei & ZHOU ZhiHua National Key Laboratory for Novel Software Technology
    Chinese Science Bulletin, 2012, 57 (19) : 2492 - 2495
  • [48] Multi-Instance Multi-Label Active Learning
    Huang, Sheng-Jun
    Gao, Nengneng
    Chen, Songcan
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 1886 - 1892
  • [49] Fast Multi-Instance Multi-Label Learning
    Huang, Sheng-Jun
    Gao, Wei
    Zhou, Zhi-Hua
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (11) : 2614 - 2627
  • [50] Active Multi-Instance Multi-Label Learning
    Retz, Robert
    Schwenker, Friedhelm
    ANALYSIS OF LARGE AND COMPLEX DATA, 2016, : 91 - 101