Learning to Sample: an Active Learning Framework

被引:10
|
作者
Shao, Jingyu [1 ]
Wang, Qing [1 ]
Liu, Fangbing [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Acton, ACT, Australia
来源
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年
基金
澳大利亚研究理事会;
关键词
active learning; meta-learning; uncertainty sampling; diversity sampling; boosting;
D O I
10.1109/ICDM.2019.00064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning algorithms for active learning are emerging as a promising paradigm for learning the "best" active learning strategy. However, current learning-based active learning approaches still require sufficient training data so as to generalize meta-learning models for active learning. This is contrary to the nature of active learning which typically starts with a small number of labeled samples. The unavailability of large amounts of labeled samples for training meta-learning models would inevitably lead to poor performance (e.g., instabilities and overfitting). In our paper, we tackle these issues by proposing a novel learning-based active learning framework, called Learning To Sample (LTS). This framework has two key components: a sampling model and a boosting model, which can mutually learn from each other in iterations to improve the performance of each other. Within this framework, the sampling model incorporates uncertainty sampling and diversity sampling into a unified process for optimization, enabling us to actively select the most representative and informative samples based on an optimized integration of uncertainty and diversity. To evaluate the effectiveness of the LTS framework, we have conducted extensive experiments on three different classification tasks: image classification, salary level prediction, and entity resolution. The experimental results show that our LTS framework significantly outperforms all the baselines when the label budget is limited, especially for datasets with highly imbalanced classes. In addition to this, our LTS framework can effectively tackle the cold start problem occurring in many existing active learning approaches.
引用
收藏
页码:538 / 547
页数:10
相关论文
共 50 条
  • [41] Active Learning for Streaming Data in A Contextual Bandit Framework
    Song, Linqi
    Xu, Jie
    Li, Congduan
    ICCDE 2019: PROCEEDINGS OF THE 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING AND DATA ENGINEERING, 2019, : 29 - 35
  • [42] Amaretto: An Active Learning Framework for Money Laundering Detection
    Labanca, Danilo
    Primerano, Luca
    Markland-Montgomery, Marcus
    Polino, Mario
    Carminati, Michele
    Zanero, Stefano
    IEEE ACCESS, 2022, 10 : 41720 - 41739
  • [43] A Unified Active Learning Framework for Biomedical Relation Extraction
    Zhang, Hong-Tao
    Huang, Min-Lie
    Zhu, Xiao-Yan
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2012, 27 (06) : 1302 - 1313
  • [44] A Simple yet Effective Framework for Active Learning to Rank
    Qingzhong Wang
    Haifang Li
    Haoyi Xiong
    Wen Wang
    Jiang Bian
    Yu Lu
    Shuaiqiang Wang
    Zhicong Cheng
    Dejing Dou
    Dawei Yin
    Machine Intelligence Research, 2024, 21 : 169 - 183
  • [45] Substep active deep learning framework for image classification
    Li, Guoqiang
    Gong, Ning
    PATTERN ANALYSIS AND APPLICATIONS, 2021, 24 (01) : 23 - 34
  • [46] PathAL: An Active Learning Framework for Histopathology Image Analysis
    Li, Wenyuan
    Li, Jiayun
    Wang, Zichen
    Polson, Jennifer
    Sisk, Anthony E.
    Sajed, Dipti P.
    Speier, William
    Arnold, Corey W.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (05) : 1176 - 1187
  • [47] A Framework Integrating DeeplabV3+, Transfer Learning, Active Learning, and Incremental Learning for Mapping Building Footprints
    Li, Zhichao
    Dong, Jinwei
    REMOTE SENSING, 2022, 14 (19)
  • [48] Active partial label learning based on adaptive sample selection
    Yan Li
    Chang Liu
    Suyun Zhao
    Qiang Hua
    International Journal of Machine Learning and Cybernetics, 2022, 13 : 1603 - 1617
  • [49] Distributed Active Learning
    Shen, Pengcheng
    Li, Chunguang
    Zhang, Zhaoyang
    IEEE ACCESS, 2016, 4 : 2572 - 2579
  • [50] An Automated Machine Learning-Genetic Algorithm Framework With Active Learning for Design Optimization
    Owoyele, Opeoluwa
    Pal, Pinaki
    Torreira, Alvaro Vidal
    JOURNAL OF ENERGY RESOURCES TECHNOLOGY-TRANSACTIONS OF THE ASME, 2021, 143 (08):