Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-supervised Multi-label Learning

被引：0

作者：

Xiao, Jia-Hao ^{[1
]}

Xie, Ming-Kun ^{[1
]}

Fan, Heng-Bo ^{[1
]}

Niu, Gang ^{[2
]}

Sugiyama, Masashi ^{[2
,3
]}

Huang, Sheng-Jun ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China

[2] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan

[3] Univ Tokyo, Tokyo, Japan

来源：

COMPUTER VISION - ECCV 2024, PT LII | 2025年 / 15110卷

基金：

国家重点研发计划;

关键词：

Multi-label learning; Semi-supervised learning; CLASSIFICATION;

D O I：

10.1007/978-3-031-72943-0_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations. Unlike semi-supervised learning, one cannot select the most probable label as the pseudo-label in SSMLL due to multiple semantics contained in an instance. To solve this problem, the mainstream method developed an effective thresholding strategy to generate accurate pseudo-labels. Unfortunately, the method neglected the quality of model predictions and its potential impact on pseudo-labeling performance. In this paper, we propose a dual-perspective method to generate high-quality pseudo-labels. To improve the quality of model predictions, we perform dual-decoupling to boost the learning of correlative and discriminative features, while refining the generation and utilization of pseudo-labels. To obtain proper class-wise thresholds, we propose the metric-adaptive thresholding strategy to estimate the thresholds, which maximize the pseudo-label performance for a given metric on labeled data. Experiments on multiple benchmark datasets show the proposed method can achieve the state-of-the-art performance and outperform the comparative methods with a significant margin. The implementation is available at JiahaoXxX/SSMLL-D2L MAT.

引用

页码：437 / 454

页数：18

共 57 条

[11]

DeVries T, 2017, Arxiv, DOI arXiv:1708.04552

[12]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[13] Learning a Deep ConvNet for Multi-label Classification with Partial Labels [J].

Durand, Thibaut ;

Mehrasa, Nazanin ;

Mori, Greg .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :647-657

[14] The Pascal Visual Object Classes (VOC) Challenge [J].

Everingham, Mark ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338

[15] Long-Tailed Multi-Label Visual Recognition by Collaborative Training on Uniform and Re-balanced Samplings [J].

Guo, Hao ;

Wang, Song .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15084-15093

[16]

Guo LZ, 2022, PR MACH LEARN RES

[17] LVIS: A Dataset for Large Vocabulary Instance Segmentation [J].

Gupta, Agrim ;

Dollar, Piotr ;

Girshick, Ross .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5351-5359

[18] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[19] Weakly Supervised Image Classification through Noise Regularization [J].

Hu, Mengying ;

Han, Hu ;

Shan, Shiguang ;

Chen, Xilin .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11509-11517

[20]

Huang S. J., 2012, P 26 AAAI C ART INT, P949, DOI DOI 10.1609/AAAI.V26I1.8287

← 1 2 3 4 5 6 →