A Serial Sample Selection Framework for Active Learning

被引:0
作者
Li, Chengchao [1 ]
Zhao, Pengpeng [1 ]
Wu, Jian [1 ]
Xu, Haihui [1 ]
Cui, Zhiming [1 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
来源
ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014 | 2014年 / 8933卷
关键词
Data Mining; Active Learning; Sampling Strategy; Uncertainty; Representativeness;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active Learning is a machine learning and data mining technique that selects the most informative samples for labeling and uses them as training data. It aims to obtain a high performance classifier by labeling as little data as possible from large amount of unlabeled samples, which means sampling strategy is the core issue. Existing approaches either tend to ignore information in unlabeled data and are prone to querying outliers or noise samples, or calculate large amounts of non-informative samples leading to significant computation cost. In order to solve above problems, this paper proposed a serial active learning framework. It first measures uncertainty of unlabeled samples and selects the most uncertain sample set. From which, it further generates the most representative sample set based on the mutual information criterion. Finally, the framework selects the most informative sample from the most representative sample set based on expected error reduction strategy. Experimental results on multiple datasets show that our approach outperforms Random Sampling and the state of the art adaptive active learning method.
引用
收藏
页码:435 / 446
页数:12
相关论文
共 50 条
  • [31] An Active Learning Framework for Alpha Matting
    Shen, Yang
    Wang, Pengjie
    Pan, Zhifang
    Bao, Yanxia
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2019, 33 (09)
  • [32] Active Selection Transfer Learning Algorithm
    Wu, Weifei
    Zhang, Yanhui
    Xing, Fuyijin
    NEURAL PROCESSING LETTERS, 2023, 55 (07) : 10093 - 10116
  • [33] Unsupervised Projected Sample Selector for Active Learning
    Pi, Yueyang
    Shi, Yiqing
    Du, Shide
    Huang, Yang
    Wang, Shiping
    IEEE TRANSACTIONS ON BIG DATA, 2025, 11 (02) : 485 - 498
  • [34] Uncertainty SVM Active Learning Algorithm Based on Convex Hull and Sample Distance
    Xu, Hailong
    Li, Longyue
    Guo, Pengsong
    Shang, Changan
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 6815 - 6822
  • [35] Active Multi-label Learning with Optimal Label Subset Selection
    Jiao, Yang
    Zhao, Pengpeng
    Wu, Jian
    Xian, Xuefeng
    Xu, Haihui
    Cui, Zhiming
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2014, 2014, 8933 : 523 - 534
  • [36] Active multi-label learning with optimal label subset selection
    Jiao, Yang, 1600, Springer Verlag (8933): : 523 - 534
  • [37] Active Sample Selection Through Sparse Neighborhood for Imbalanced Datasets
    Gu, Ping
    Ling, Zhao
    Shao, Si Yu
    Zhou, Meng
    2019 IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (ISCC), 2019, : 112 - 117
  • [38] Active learning of constraints for weighted feature selection
    Hijazi, Samah
    Hamad, Denis
    Kalakech, Mariam
    Kalakech, Ali
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2021, 15 (02) : 337 - 377
  • [39] Active Learning With Optimal Instance Subset Selection
    Fu, Yifan
    Zhu, Xingquan
    Elmagarmid, Ahmed K.
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (02) : 464 - 475
  • [40] Active learning of constraints for weighted feature selection
    Samah Hijazi
    Denis Hamad
    Mariam Kalakech
    Ali Kalakech
    Advances in Data Analysis and Classification, 2021, 15 : 337 - 377