A deep reinforcement active learning method for multi-label image classification

被引：0

作者：

Cai, Qing ^{[1
,2
]}

Tao, Ran ^{[1
,2
]}

Fang, Xiufen ^{[3
]}

Xie, Xiurui ^{[3
]}

Liu, Guisong ^{[1
,2
]}

机构：

[1] Southwestern Univ Finance & Econ, Sch Comp & Artificial Intelligence, Chengdu 611130, Sichuan, Peoples R China

[2] Southwestern Univ Finance & Econ, Complex Lab New Finance & Econ, Chengdu 611130, Sichuan, Peoples R China

[3] Univ Elect Sci & Technol China, Chengdu 611731, Sichuan, Peoples R China

来源：

COMPUTER VISION AND IMAGE UNDERSTANDING | 2025年 / 257卷

基金：

中国国家自然科学基金;

关键词：

Active learning; Multi-label image classification; Deep learning; Reinforcement learning;

D O I：

10.1016/j.cviu.2025.104351

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Active learning is a widely used method for addressing the high cost of sample labeling in deep learning models and has achieved significant success in recent years. However, most existing active learning methods only focus on single-label image classification and have limited application in the context of multi-label images. To address this issue, we propose a novel, multi-label active learning approach based on a reinforcement learning strategy. The proposed approach introduces a reinforcement active learning framework that accounts for the expected error reduction in multi-label images, making it adaptable to multi-label classification models. Additionally, we develop a multi-label reinforcement active learning module (MLRAL), which employs an actor-critic strategy and proximal policy optimization algorithm (PPO). Our state and reward functions consider multi-label correlations to accurately evaluate the potential impact of unlabeled samples on the current model state. We conduct experiments on various multi-label image classification tasks, including the VOC 2007, MS-COCO, NUS-WIDE and ODIR. We also compare our method with multiple classification models, and experimental results show that our method outperforms existing approaches on various tasks, demonstrating the superiority and effectiveness of the proposed method.

引用

页数：10

共 45 条

[1]

[Anonymous], 2009, P ACM INT C IM VID R

[2]

Ben-Baruch E, 2021, Arxiv, DOI arXiv:2009.14119

[3] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[4] Lesion Location Attention Guided Network for Multi-Label Thoracic Disease Classification in Chest X-Rays [J].

Chen, Bingzhi ;

Li, Jinxing ;

Lu, Guangming ;

Zhang, David .

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2020, 24 (07) :2016-2027

[5] Stable matching-based two-way selection in multi-label active learning with imbalanced data [J].

Chen, Shuyue ;

Wang, Ran ;

Lu, Jian ;

Wang, Xizhao .

INFORMATION SCIENCES, 2022, 610 :281-299

[6]

Chen TS, 2018, AAAI CONF ARTIF INTE, P6730

[7] Multi-Label Image Recognition with Graph Convolutional Networks [J].

Chen, Zhao-Min ;

Wei, Xiu-Shen ;

Wang, Peng ;

Guo, Yanwen .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181

[8]

Dugas E., 2015, Kaggle

[9]

Epshteyn A., 2008, MACHINE LEARNING

[10] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

← 1 2 3 4 5 →