Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-supervised Multi-label Learning

被引:0
作者
Xiao, Jia-Hao [1 ]
Xie, Ming-Kun [1 ]
Fan, Heng-Bo [1 ]
Niu, Gang [2 ]
Sugiyama, Masashi [2 ,3 ]
Huang, Sheng-Jun [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
[2] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan
[3] Univ Tokyo, Tokyo, Japan
来源
COMPUTER VISION - ECCV 2024, PT LII | 2025年 / 15110卷
基金
国家重点研发计划;
关键词
Multi-label learning; Semi-supervised learning; CLASSIFICATION;
D O I
10.1007/978-3-031-72943-0_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations. Unlike semi-supervised learning, one cannot select the most probable label as the pseudo-label in SSMLL due to multiple semantics contained in an instance. To solve this problem, the mainstream method developed an effective thresholding strategy to generate accurate pseudo-labels. Unfortunately, the method neglected the quality of model predictions and its potential impact on pseudo-labeling performance. In this paper, we propose a dual-perspective method to generate high-quality pseudo-labels. To improve the quality of model predictions, we perform dual-decoupling to boost the learning of correlative and discriminative features, while refining the generation and utilization of pseudo-labels. To obtain proper class-wise thresholds, we propose the metric-adaptive thresholding strategy to estimate the thresholds, which maximize the pseudo-label performance for a given metric on labeled data. Experiments on multiple benchmark datasets show the proposed method can achieve the state-of-the-art performance and outperform the comparative methods with a significant margin. The implementation is available at JiahaoXxX/SSMLL-D2L MAT.
引用
收藏
页码:437 / 454
页数:18
相关论文
共 57 条
[11]  
DeVries T, 2017, Arxiv, DOI arXiv:1708.04552
[12]  
Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929
[13]   Learning a Deep ConvNet for Multi-label Classification with Partial Labels [J].
Durand, Thibaut ;
Mehrasa, Nazanin ;
Mori, Greg .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :647-657
[14]   The Pascal Visual Object Classes (VOC) Challenge [J].
Everingham, Mark ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) :303-338
[15]   Long-Tailed Multi-Label Visual Recognition by Collaborative Training on Uniform and Re-balanced Samplings [J].
Guo, Hao ;
Wang, Song .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15084-15093
[16]  
Guo LZ, 2022, PR MACH LEARN RES
[17]   LVIS: A Dataset for Large Vocabulary Instance Segmentation [J].
Gupta, Agrim ;
Dollar, Piotr ;
Girshick, Ross .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5351-5359
[18]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[19]   Weakly Supervised Image Classification through Noise Regularization [J].
Hu, Mengying ;
Han, Hu ;
Shan, Shiguang ;
Chen, Xilin .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :11509-11517
[20]  
Huang S. J., 2012, P 26 AAAI C ART INT, P949, DOI DOI 10.1609/AAAI.V26I1.8287