Learning to Sample: an Active Learning Framework

被引:10
|
作者
Shao, Jingyu [1 ]
Wang, Qing [1 ]
Liu, Fangbing [1 ]
机构
[1] Australian Natl Univ, Res Sch Comp Sci, Acton, ACT, Australia
来源
2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019) | 2019年
基金
澳大利亚研究理事会;
关键词
active learning; meta-learning; uncertainty sampling; diversity sampling; boosting;
D O I
10.1109/ICDM.2019.00064
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Meta-learning algorithms for active learning are emerging as a promising paradigm for learning the "best" active learning strategy. However, current learning-based active learning approaches still require sufficient training data so as to generalize meta-learning models for active learning. This is contrary to the nature of active learning which typically starts with a small number of labeled samples. The unavailability of large amounts of labeled samples for training meta-learning models would inevitably lead to poor performance (e.g., instabilities and overfitting). In our paper, we tackle these issues by proposing a novel learning-based active learning framework, called Learning To Sample (LTS). This framework has two key components: a sampling model and a boosting model, which can mutually learn from each other in iterations to improve the performance of each other. Within this framework, the sampling model incorporates uncertainty sampling and diversity sampling into a unified process for optimization, enabling us to actively select the most representative and informative samples based on an optimized integration of uncertainty and diversity. To evaluate the effectiveness of the LTS framework, we have conducted extensive experiments on three different classification tasks: image classification, salary level prediction, and entity resolution. The experimental results show that our LTS framework significantly outperforms all the baselines when the label budget is limited, especially for datasets with highly imbalanced classes. In addition to this, our LTS framework can effectively tackle the cold start problem occurring in many existing active learning approaches.
引用
收藏
页码:538 / 547
页数:10
相关论文
共 50 条
  • [1] A serial sample selection framework for active learning
    Li, Chengchao
    Zhao, Pengpeng
    Wu, Jian
    Xu, Haihui
    Cui, Zhiming
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8933 : 435 - 446
  • [2] A Serial Sample Selection Framework for Active Learning
    Li, Chengchao
    Zhao, Pengpeng
    Wu, Jian
    Xu, Haihui
    Cui, Zhiming
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 435 - 446
  • [3] ALBS: An Active Learning Framework Based on Syncretic Sample Selection Strategy
    Pan, Longfei
    Wang, Xiaojun
    ICMLC 2019: 2019 11TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2019, : 533 - 538
  • [4] The true sample complexity of active learning
    Maria-Florina Balcan
    Steve Hanneke
    Jennifer Wortman Vaughan
    Machine Learning, 2010, 80 : 111 - 139
  • [5] Active learning for ranking with sample density
    Cai, Wenbin
    Zhang, Muhan
    Zhang, Ya
    INFORMATION RETRIEVAL JOURNAL, 2015, 18 (02): : 123 - 144
  • [6] The true sample complexity of active learning
    Balcan, Maria-Florina
    Hanneke, Steve
    Vaughan, Jennifer Wortman
    MACHINE LEARNING, 2010, 80 (2-3) : 111 - 139
  • [7] Active learning for ranking with sample density
    Wenbin Cai
    Muhan Zhang
    Ya Zhang
    Information Retrieval Journal, 2015, 18 : 123 - 144
  • [8] Design of Active Learning Framework for Collaborative Anomaly Detection
    Cai, He
    Hua, Cunqing
    Xu, Wenchao
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [9] An active learning framework for set inversion
    Nguyen, Binh T.
    Nguyen, Duy M.
    Ho, Lam Si Tung
    Vu Dinh
    KNOWLEDGE-BASED SYSTEMS, 2019, 185
  • [10] An Active Learning Framework for Alpha Matting
    Shen, Yang
    Wang, Pengjie
    Pan, Zhifang
    Bao, Yanxia
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (09)