TiDAL: Learning Training Dynamics for Active Learning

被引：7

作者：

Kye, Seong Min ^{[1
]}

Choi, Kwanghee ^{[2
]}

Byun, Hyeongmin ^{[1
]}

Chang, Buru ^{[3
]}

机构：

[1] Hyperconnect, Seoul, South Korea

[2] Carnegie Mellon Univ, Pittsburgh, PA USA

[3] Sogang Univ, Seoul, South Korea

来源：

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023) | 2023年

关键词：

D O I：

10.1109/ICCV51070.2023.02041

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Active learning (AL) aims to select the most useful data samples from an unlabeled data pool and annotate them to expand the labeled dataset under a limited budget. Especially, uncertainty-based methods choose the most uncertain samples, which are known to be effective in improving model performance. However, previous methods often overlook training dynamics (TD), defined as the ever-changing model behavior during optimization via stochastic gradient descent, even though other research areas have empirically shown that TD provides important clues for measuring the data uncertainty. In this paper, we first provide theoretical and empirical evidence to argue the usefulness of utilizing the ever-changing model behavior rather than the fully trained model snapshot. We then propose a novel AL method, Training Dynamics for Active Learning (TiDAL), which efficiently predicts the training dynamics of unlabeled data to estimate their uncertainty. Experimental results show that our TiDAL achieves better or comparable performance on both balanced and imbalanced benchmark datasets compared to state-of-the-art AL methods, which estimate data uncertainty using only static information after model training.

引用

页码：22278 / 22288

页数：11

共 12 条

[1] Class-Balanced Active Learning for Image Classification [J].

Bengar, Javad Zolfaghari ;

van de Weijer, Joost ;

Fuentes, Laura Lopez ;

Raducanu, Bogdan .

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :3707-3716

[2]

COHN D, 1994, MACH LEARN, V15, P201, DOI 10.1007/BF00993277

[3] Disentangling Label Distribution for Long-tailed Visual Recognition [J].

Hong, Youngkyu ;

Han, Seungju ;

Choi, Kwanghee ;

Seo, Seokjun ;

Kim, Beomsu ;

Chang, Buru .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :6622-6632

[4]

Jiang Heinrich, 2018, Advances in Neural Information Processing Systems, V31, DOI DOI 10.48550/ARXIV.1805.11783

[5]

Laine S., 2017, INT C LEARN REPR, DOI DOI 10.48550/ARXIV.1610.02242

[6]

Lee J., 2018, ICLR

[7] A Survey of Deep Active Learning [J].

Ren, Pengzhen ;

Xiao, Yun ;

Chang, Xiaojun ;

Huang, Po-Yao ;

Li, Zhihui ;

Gupta, Brij B. ;

Chen, Xiaojiang ;

Wang, Xin .

ACM COMPUTING SURVEYS, 2022, 54 (09)

[8]

Song H., International Conference on Machine Learning, P5907

[9]

Tran T, 2019, PR MACH LEARN RES, V97

[10]

Xiao H, 2017, Arxiv, DOI [arXiv:1708.07747, DOI 10.48550/ARXIV.1708.07747]

← 1 2 →