A Serial Sample Selection Framework for Active Learning

被引:0
|
作者
Li, Chengchao [1 ]
Zhao, Pengpeng [1 ]
Wu, Jian [1 ]
Xu, Haihui [1 ]
Cui, Zhiming [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
来源
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014 | 2014年 / 8933卷
关键词
Data Mining; Active Learning; Sampling Strategy; Uncertainty; Representativeness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active Learning is a machine learning and data mining technique that selects the most informative samples for labeling and uses them as training data. It aims to obtain a high performance classifier by labeling as little data as possible from large amount of unlabeled samples, which means sampling strategy is the core issue. Existing approaches either tend to ignore information in unlabeled data and are prone to querying outliers or noise samples, or calculate large amounts of non-informative samples leading to significant computation cost. In order to solve above problems, this paper proposed a serial active learning framework. It first measures uncertainty of unlabeled samples and selects the most uncertain sample set. From which, it further generates the most representative sample set based on the mutual information criterion. Finally, the framework selects the most informative sample from the most representative sample set based on expected error reduction strategy. Experimental results on multiple datasets show that our approach outperforms Random Sampling and the state of the art adaptive active learning method.
引用
收藏
页码:435 / 446
页数:12
相关论文
共 50 条
  • [21] Active learning with model selection - Simultaneous optimization of sample points and models for trigonometric polynomial models
    Sugiyama, M
    Ogawa, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (12) : 2753 - 2763
  • [22] The true sample complexity of active learning
    Maria-Florina Balcan
    Steve Hanneke
    Jennifer Wortman Vaughan
    Machine Learning, 2010, 80 : 111 - 139
  • [23] Active learning for ranking with sample density
    Cai, Wenbin
    Zhang, Muhan
    Zhang, Ya
    INFORMATION RETRIEVAL JOURNAL, 2015, 18 (02): : 123 - 144
  • [24] The true sample complexity of active learning
    Balcan, Maria-Florina
    Hanneke, Steve
    Vaughan, Jennifer Wortman
    MACHINE LEARNING, 2010, 80 (2-3) : 111 - 139
  • [25] PathAL: An Active Learning Framework for Histopathology Image Analysis
    Li, Wenyuan
    Li, Jiayun
    Wang, Zichen
    Polson, Jennifer
    Sisk, Anthony E.
    Sajed, Dipti P.
    Speier, William
    Arnold, Corey W.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2022, 41 (05) : 1176 - 1187
  • [26] Active learning for ranking with sample density
    Wenbin Cai
    Muhan Zhang
    Ya Zhang
    Information Retrieval Journal, 2015, 18 : 123 - 144
  • [27] AN ACTIVE LEARNING METHOD USING CLUSTERING AND COMMITTEE-BASED SAMPLE SELECTION FOR SOUND EVENT CLASSIFICATION
    Zhao Shuyang
    Heittola, Toni
    Virtanen, Tuomas
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 116 - 120
  • [28] Active Query Selection for Learning Rankers
    Bilgic, Mustafa
    Bennett, Paul N.
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1033 - 1034
  • [29] Active Selection Transfer Learning Algorithm
    Weifei Wu
    Yanhui Zhang
    Fuyijin Xing
    Neural Processing Letters, 2023, 55 : 10093 - 10116
  • [30] An active learning framework for set inversion
    Nguyen, Binh T.
    Nguyen, Duy M.
    Ho, Lam Si Tung
    Vu Dinh
    KNOWLEDGE-BASED SYSTEMS, 2019, 185