Empirical investigation of active learning strategies

被引:29
|
作者
Pereira-Santos, Davi [1 ]
Cavalcante Prudencio, Ricardo Bastos [2 ]
de Carvalho, Andre C. P. L. F. [1 ]
机构
[1] Univ Sao Paulo, Inst Ciencias Matemat & Comp, Trabalhador Sao Carlense Av 400, BR-13560970 Sao Carlos, SP, Brazil
[2] Univ Fed Pernambuco, Ctr Informat, Jornalista Anibal Fernandes Av, BR-50740560 Recife, PE, Brazil
基金
巴西圣保罗研究基金会;
关键词
Active learning; Agnostic active learning; Non-agnostic active learning; Data sampling; Partially labeled data; Data labeling; CLASSIFICATION; CLASSIFIERS; PREDICTION;
D O I
10.1016/j.neucom.2017.05.105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many predictive tasks require labeled data to induce classification models. The data labeling process may have a high cost. Several strategies have been proposed to optimize the selection of the most relevant examples, a process referred to as active learning. However, a lack of empirical studies comparing different active learning approaches across multiple datasets makes it difficult identifying the most promising strategies, or even assessing the relative gain of active learning over the trivial random selection of instances. In this study, a comprehensive comparison of active learning strategies is presented, with various instance selection criteria, different classification algorithms and a large number of datasets. The experimental results confirm the effectiveness of active learning and provide insights about the relationship between classification algorithms and active learning strategies. Additionally, ranking curves with bands are introduced as a means to summarize in a single chart the performance of each active learning strategy for different classification algorithms and datasets. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:15 / 27
页数:13
相关论文
共 50 条
  • [31] Data Labeling: An Empirical Investigation into Industrial Challenges and Mitigation Strategies
    Fredriksson, Teodor
    Mattos, David Issa
    Bosch, Jan
    Olsson, Helena Holmstrom
    PRODUCT-FOCUSED SOFTWARE PROCESS IMPROVEMENT (PROFES 2020), 2020, 12562 : 202 - 216
  • [32] KNOWLEDGE CONSTRUCTION IN e-LEARNING: AN EMPIRICAL VALIDATION OF AN ACTIVE LEARNING MODEL
    Koohang, Alex
    Paliszkiewicz, Joanna
    JOURNAL OF COMPUTER INFORMATION SYSTEMS, 2013, 53 (03) : 109 - 114
  • [33] Active learning in international relations: an empirical study on the role of the ludic in the learning process
    Nanci Izidro Goncalves, Fernanda Cristina
    Simoes de Moraes Lima, Leticia Cordeiro
    OASIS-OBSERVATORIO DE ANALISIS DE LOS SISTEMAS INTERNACIONALES, 2020, (32): : 29 - 47
  • [34] An Empirical Investigation of PU Learning for Predicting Length of Stay
    Arjannikov, Tom
    Tzanetakis, George
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 41 - 47
  • [35] Understanding Student Characteristics in the Development of Active Learning Strategies
    Mehta, Seema
    Schukow, Casey P.
    Takrani, Amar
    Ritchie, Raquel P.
    Wilkins, Carol A.
    Faner, Martha A.
    MEDICAL SCIENCE EDUCATOR, 2022, 32 (03) : 615 - 626
  • [36] Evaluation of multiple active learning strategies in a pharmacology course
    Sumanasekera, Wasana
    Turner, Chase
    Ly, Kaven
    Hoang, Philip
    Jent, Travis
    Sumanasekera, Thimira
    CURRENTS IN PHARMACY TEACHING AND LEARNING, 2020, 12 (01) : 88 - 94
  • [37] Active Learning Strategies for Semi-Supervised DBSCAN
    Li, Jundong
    Sander, Joerg
    Campello, Ricardo
    Zimek, Arthur
    ADVANCES IN ARTIFICIAL INTELLIGENCE, CANADIAN AI 2014, 2014, 8436 : 179 - 190
  • [38] Active learning strategies and assessment in world geography classes
    Klein, P
    JOURNAL OF GEOGRAPHY, 2003, 102 (04) : 146 - 157
  • [39] Optimization of Active Learning Strategies for Causal Network Structure
    Zhang, Mengxin
    Zhang, Xiaojun
    MATHEMATICS, 2024, 12 (06)
  • [40] Graph-Based Query Strategies for Active Learning
    Wu, Wei
    Ostendorf, Mari
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (02): : 260 - 269