Uncertainty-Based Selective Clustering for Active Learning

被引:4
作者
Hwang, Sekjin [1 ]
Choi, Jinwoo [1 ]
Choi, Joonsoo [1 ]
机构
[1] Kookmin Univ, Coll Comp Sci, Seoul 02707, South Korea
关键词
Labeling; Data models; Uncertainty; Training; Learning systems; Deep learning; Clustering methods; Active learning; data sampling; uncertainty-based method; diversity-based method; pool-based scenario;
D O I
10.1109/ACCESS.2022.3216065
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Labeling large amount of data is one of important issues in deep learning due to high labeling cost. One method to address this issue is the use of active learning. Active learning selects from a large unlabeled data pool, a set of data that is more informative to training a model for the task at hand. Many active learning approaches use uncertainty-based methods or diversity-based methods. Both have had good results. However, using uncertainty-based methods, there is a risk that sampled data may be redundant, and the use of redundant data can adversely affect the training of the model. Diversity-based methods risk losing some data important for training the model. In this paper, we propose the uncertainty-based Selective Clustering for Active Learning (SCAL), a method of selectively clustering for data with high uncertainty to sample data from each cluster to reduce redundancy. SCAL is expected to extend the area of the decision boundary represented by the sampled data. SCAL achieves a cutting-edge performance for classification tasks on balanced and unbalanced image datasets as well as semantic segmentation tasks.
引用
收藏
页码:110983 / 110991
页数:9
相关论文
共 50 条
  • [21] Batch Mode Active Learning Based on Multi-Set Clustering
    Yang, Yazhou
    Yin, Xiaoqing
    Zhao, Yang
    Lei, Jun
    Li, Weili
    Shu, Zhe
    IEEE ACCESS, 2021, 9 : 51452 - 51463
  • [22] Exploring Active Learning Based on Representativeness and Uncertainty for Biomedical Data Classification
    Bressan, Rafael S.
    Camargo, Guilherme
    Bugatti, Pedro Henrique
    Saito, Priscila Tiemi Maeda
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (06) : 2238 - 2244
  • [23] Calibrated Aleatoric Uncertainty-Based Adaptive Label Distribution Learning for Pose Estimation of Sichuan Peppers
    Liu, Xueyin
    Dong, Dawei
    Luo, Jianqiao
    Li, Bailin
    IEEE SENSORS JOURNAL, 2024, 24 (07) : 10727 - 10741
  • [24] On Measuring Uncertainty and Uncertainty-Based Information: Recent Developments
    George J. Klir
    Richard M. Smith
    Annals of Mathematics and Artificial Intelligence, 2001, 32 : 5 - 33
  • [25] Active Learning for Ordinal Classification Based on Adaptive Diversity-Based Uncertainty Sampling
    He, Deniu
    IEEE ACCESS, 2023, 11 : 16396 - 16410
  • [26] On measuring uncertainty and uncertainty-based information: Recent developments
    Klir, GJ
    Smith, RM
    ANNALS OF MATHEMATICS AND ARTIFICIAL INTELLIGENCE, 2001, 32 (1-4) : 5 - 33
  • [27] How Much a Model be Trained by Passive Learning Before Active Learning?
    Jo, Dae Ung
    Yun, Sangdoo
    Choi, Jin Young
    IEEE ACCESS, 2022, 10 : 34677 - 34689
  • [28] Unsupervised Fusion Feature Matching for Data Bias in Uncertainty Active Learning
    Huang, Wei
    Sun, Shuzhou
    Lin, Xiao
    Li, Ping
    Zhu, Lei
    Wang, Jihong
    Chen, C. L. Philip
    Sheng, Bin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 5749 - 5763
  • [29] USD: Uncertainty-Based One-Phase Learning to Enhance Pseudo-Label Reliability for Semi-Supervised Object Detection
    Chun, Dayoung
    Lee, Seungil
    Kim, Hyun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6336 - 6347
  • [30] An uncertainty-based interpretable deep learning framework for predicting breast cancer outcome
    Chai, Hua
    Lin, Siyin
    Lin, Junqi
    He, Minfan
    Yang, Yuedong
    OuYang, Yongzhong
    Zhao, Huiying
    BMC BIOINFORMATICS, 2024, 25 (01)