Active learning with misclassification sampling based on committee

被引:0
|
作者
Long, Jun [1 ]
Yin, Jianping [1 ]
Zhu, En [1 ]
Zhao, Wentao [1 ]
机构
[1] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
active learning; misclassification sampling; committee; version space reduction;
D O I
10.1142/S0218488508005248
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learning is an important approach to reduce data-collection costs for inductive learning problems by sampling only the most informative instances for labeling. We focus here on the sampling criterion for how to select these most informative instances. Three contributions are made in this paper. First, in contrast to the leading sampling strategy of halving the volume of version space, we present the sampling strategy of reducing the volume of version space by more than half with the assumption of target function being chosen from nonuniform distribution over version space. Second, we propose the idea of sampling the instances that would be most possibly misclassified. Third, we develop a sampling method named CBMPMS (Committee Based Most Possible Misclassification Sampling) which samples the instances that have the largest probability to be misclassified by the current classifier. Comparing the proposed CBMPMS method with the existing active learning methods, when the classifiers achieve the same accuracy, the former method will sample fewer times than the latter ones. The experiments show that the proposed method outperforms the traditional sampling methods on most selected datasets.
引用
收藏
页码:55 / 70
页数:16
相关论文
共 50 条
  • [1] An active learning method based on most possible misclassification sampling using committee
    Long, Jun
    Yin, Jianping
    Zhu, En
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4617 : 104 - +
  • [2] Active learning with misclassification sampling using diverse ensembles enhanced by unlabeled instances
    Long, Jun
    Yin, Jianping
    Zhu, En
    Zhao, Wentao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 951 - 957
  • [3] Active learning for regression based on query by committee
    Bujrbidge, Robert
    Rowland, Jefn J.
    King, Ross D.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2007, 2007, 4881 : 209 - 218
  • [4] SPEECH MODELING BASED ON COMMITTEE-BASED ACTIVE LEARNING
    Hamanaka, Yuzo
    Shinoda, Koichi
    Furui, Sadaoki
    Emori, Tadashi
    Koshinaka, Takafumi
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4350 - 4353
  • [5] Committee-Based Active Learning for Speech Recognition
    Hamanaka, Yuzo
    Shinoda, Koichi
    Tsutaoka, Takuya
    Furui, Sadaoki
    Emori, Tadashi
    Koshinaka, Takafumi
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2011, E94D (10): : 2015 - 2023
  • [6] Active learning method based on instability sampling
    He H.
    Xie M.
    Huang S.
    Guofang Keji Daxue Xuebao/Journal of National University of Defense Technology, 2022, 44 (03): : 50 - 56
  • [7] Active learning using uncertainty sampling and query-by-committee for software defect prediction
    Qu Y.
    Chen X.
    Chen R.
    Ju X.
    Guo J.
    International Journal of Performability Engineering, 2019, 15 (10): : 2701 - 2708
  • [8] Evidence-based uncertainty sampling for active learning
    Manali Sharma
    Mustafa Bilgic
    Data Mining and Knowledge Discovery, 2017, 31 : 164 - 202
  • [9] Important sampling based active learning for imbalance classification
    Xinyue WANG
    Bo LIU
    Siyu CAO
    Liping JING
    Jian YU
    Science China(Information Sciences), 2020, 63 (08) : 196 - 209
  • [10] Combining Committee-Based Semi-Supervised Learning and Active Learning
    Mohamed Farouk Abdel Hady
    Friedhelm Schwenker
    JournalofComputerScience&Technology, 2010, 25 (04) : 681 - 698