Learning to Sample: an Active Learning Framework

被引:10
|
作者
Shao, Jingyu [1 ]
Wang, Qing [1 ]
Liu, Fangbing [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Acton, ACT, Australia
来源
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年
基金
澳大利亚研究理事会;
关键词
active learning; meta-learning; uncertainty sampling; diversity sampling; boosting;
D O I
10.1109/ICDM.2019.00064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning algorithms for active learning are emerging as a promising paradigm for learning the "best" active learning strategy. However, current learning-based active learning approaches still require sufficient training data so as to generalize meta-learning models for active learning. This is contrary to the nature of active learning which typically starts with a small number of labeled samples. The unavailability of large amounts of labeled samples for training meta-learning models would inevitably lead to poor performance (e.g., instabilities and overfitting). In our paper, we tackle these issues by proposing a novel learning-based active learning framework, called Learning To Sample (LTS). This framework has two key components: a sampling model and a boosting model, which can mutually learn from each other in iterations to improve the performance of each other. Within this framework, the sampling model incorporates uncertainty sampling and diversity sampling into a unified process for optimization, enabling us to actively select the most representative and informative samples based on an optimized integration of uncertainty and diversity. To evaluate the effectiveness of the LTS framework, we have conducted extensive experiments on three different classification tasks: image classification, salary level prediction, and entity resolution. The experimental results show that our LTS framework significantly outperforms all the baselines when the label budget is limited, especially for datasets with highly imbalanced classes. In addition to this, our LTS framework can effectively tackle the cold start problem occurring in many existing active learning approaches.
引用
收藏
页码:538 / 547
页数:10
相关论文
共 50 条
  • [31] Graph Deep Active Learning Framework for Data Deduplication
    Cao, Huan
    Du, Shengdong
    Hu, Jie
    Yang, Yan
    Horng, Shi-Jinn
    Li, Tianrui
    BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 753 - 764
  • [32] GADAL: An Active Learning Framework for Graph Anomaly Detection
    Chang, Wenjing
    Yu, Jianjun
    Zhou, Xiaojun
    WEB AND BIG DATA, PT I, APWEB-WAIM 2022, 2023, 13421 : 435 - 442
  • [33] An Active Learning Framework for Efficient Robust Policy Search
    Narayanaswami, Sai Kiran
    Sudarsanam, Nandan
    Ravindran, Balaraman
    PROCEEDINGS OF THE 5TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA, CODS COMAD 2022, 2022, : 1 - 9
  • [34] The CLEAR Framework to Implement Active Learning in STEM Education
    Lin, Jingjing
    IEEE TALE2021: IEEE INTERNATIONAL CONFERENCE ON ENGINEERING, TECHNOLOGY AND EDUCATION, 2021, : 950 - 954
  • [35] A Unified Active Learning Framework for Biomedical Relation Extraction
    Hong-Tao Zhang
    Min-Lie Huang
    Xiao-Yan Zhu
    Journal of Computer Science and Technology, 2012, 27 : 1302 - 1313
  • [36] LEAF: A Less Expert Annotation Framework with Active Learning
    Maoliniyazi, Aishan
    Ma, Chaohong
    Meng, Xiaofeng
    Peng, Yingtao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT III, PAKDD 2024, 2024, 14647 : 369 - 384
  • [37] Cardinal, a metric-based Active learning framework
    Abraham, Alexandre
    Dreyfus-Schmidt, Leo
    SOFTWARE IMPACTS, 2022, 12
  • [38] An Active Learning Framework for Efficient Condition Severity Classification
    Nissim, Nir
    Boland, Mary Regina
    Moskovitch, Robert
    Tatonetti, Nicholas P.
    Elovici, Yuval
    Shahar, Yuval
    Hripcsak, George
    ARTIFICIAL INTELLIGENCE IN MEDICINE (AIME 2015), 2015, 9105 : 13 - 24
  • [39] Anais: A Conceptual Framework for Blended Active Learning in Healthcare
    Santos, Adriano Araujo
    Beltrao Moura, Josw Antao
    Fechine Regis de Araujo, Joseana Macedo
    de Barros, Marcelo Alves
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED EDUCATION, VOL 2 (CSEDU), 2016, : 199 - 206
  • [40] Substep active deep learning framework for image classification
    Guoqiang Li
    Ning Gong
    Pattern Analysis and Applications, 2021, 24 : 23 - 34