PLVI-CE: a multi-label active learning algorithm with simultaneously considering uncertainty and diversity

被引:0
作者
Gu, Yan [1 ]
Duan, Jicong [1 ]
Yu, Hualong [1 ]
Yang, Xibei [1 ]
Gao, Shang [1 ]
机构
[1] Jiangsu Univ Sci & Technol, Sch Comp, Zhenjiang 212100, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label active learning; Label-weighted extreme learning machine; Uncertainty; Diversity; Label propagation; Cross entropy; MACHINE; QUERY; ELM;
D O I
10.1007/s10489-023-05008-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In multi-label learning, each instance simultaneously associates with multiple labels, which means that labeling such instances is quite costly. Active learning, as an important machine learning paradigm, learns the classification model by querying merely a small portion of data with important information, by means of which, the labeling cost can be greatly reduced during the training process and an accurate and robust classification model could be obtained. Therefore, multi-label active learning (MLAL) has garnered increasing attentions. The primary challenge in MLAL lies in designing an effective query strategy to measure uniform information about unlabeled instances throughout all labels. In this study, we propose a query strategy named predicted label vectors inconsistency and cross entropy measure (PLVI-CE) that considers both uncertainty and diversity measures. In PLVI-CE, the uncertainty is measured by the inconsistency between two predicted label vectors from the same unlabeled instance, and the diversity is assessed by the average discrepancy in posterior probabilities between each unlabeled instance and all instances in the labeled set. Furthermore, in this study, we try to adopt label-weighted extreme learning machine (LW-ELM) as the base classifier in the MLAL framework with considering its following advantages: (1) LW-ELM has a low computational cost, (2) LW-ELM has strong generalization performance, and (3) LW-ELM can be directly used to classify multi-label data with class imbalance distributions, hence providing approximately unbiased instance querying during MLAL. Experimental results on 12 benchmark multi-label datasets indicate the effectiveness and superiority of the proposed PLVI-CE algorithm in comparison with several current state-of-the-art MLAL algorithms.
引用
收藏
页码:27844 / 27864
页数:21
相关论文
共 50 条
  • [1] Carrillo D., 2013, Trends in practical applications of agents and multiagent systems, P181
  • [2] Adaptive Batch Mode Active Learning
    Chakraborty, Shayok
    Balasubramanian, Vineeth
    Panchanathan, Sethuraman
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (08) : 1747 - 1760
  • [3] Stable matching-based two-way selection in multi-label active learning with imbalanced data
    Chen, Shuyue
    Wang, Ran
    Lu, Jian
    Wang, Xizhao
    [J]. INFORMATION SCIENCES, 2022, 610 : 281 - 299
  • [4] Demsar J, 2006, J MACH LEARN RES, V7, P1
  • [5] Robust and Discriminative Labeling for Multi-Label Active Learning Based on Maximum Correntropy Criterion
    Du, Bo
    Wang, Zengmao
    Zhang, Lefei
    Zhang, Liangpei
    Tao, Dacheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (04) : 1694 - 1707
  • [6] Multi-Label Active Learning-Based Machine Learning Model for Heart Disease Prediction
    El-Hasnony, Ibrahim M.
    Elzeki, Omar M.
    Alshehri, Ali
    Salem, Hanaa
    [J]. SENSORS, 2022, 22 (03)
  • [7] Selective sampling using the query by committee algorithm
    Freund, Y
    Seung, HS
    Shamir, E
    Tishby, N
    [J]. MACHINE LEARNING, 1997, 28 (2-3) : 133 - 168
  • [8] Multi-label active learning by model guided distribution matching
    Gao, Nengneng
    Huang, Sheng-Jun
    Chen, Songcan
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2016, 10 (05) : 845 - 855
  • [9] Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power
    Garcia, Salvador
    Fernandez, Alberto
    Luengo, Julian
    Herrera, Francisco
    [J]. INFORMATION SCIENCES, 2010, 180 (10) : 2044 - 2064
  • [10] Multi-label learning: a review of the state of the art and ongoing research
    Gibaja, Eva
    Ventura, Sebastian
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 4 (06) : 411 - 444