Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-supervised Multi-label Learning

被引:0
作者
Xiao, Jia-Hao [1 ]
Xie, Ming-Kun [1 ]
Fan, Heng-Bo [1 ]
Niu, Gang [2 ]
Sugiyama, Masashi [2 ,3 ]
Huang, Sheng-Jun [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China
[2] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan
[3] Univ Tokyo, Tokyo, Japan
来源
COMPUTER VISION - ECCV 2024, PT LII | 2025年 / 15110卷
基金
国家重点研发计划;
关键词
Multi-label learning; Semi-supervised learning; CLASSIFICATION;
D O I
10.1007/978-3-031-72943-0_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations. Unlike semi-supervised learning, one cannot select the most probable label as the pseudo-label in SSMLL due to multiple semantics contained in an instance. To solve this problem, the mainstream method developed an effective thresholding strategy to generate accurate pseudo-labels. Unfortunately, the method neglected the quality of model predictions and its potential impact on pseudo-labeling performance. In this paper, we propose a dual-perspective method to generate high-quality pseudo-labels. To improve the quality of model predictions, we perform dual-decoupling to boost the learning of correlative and discriminative features, while refining the generation and utilization of pseudo-labels. To obtain proper class-wise thresholds, we propose the metric-adaptive thresholding strategy to estimate the thresholds, which maximize the pseudo-label performance for a given metric on labeled data. Experiments on multiple benchmark datasets show the proposed method can achieve the state-of-the-art performance and outperform the comparative methods with a significant margin. The implementation is available at JiahaoXxX/SSMLL-D2L MAT.
引用
收藏
页码:437 / 454
页数:18
相关论文
共 57 条
[21]   Interactive Multi-Label CNN Learning with Partial Labels [J].
Huynh, Dat ;
Elhamifar, Ehsan .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9420-9429
[22]   Large Loss Matters in Weakly Supervised Multi-Label Classification [J].
Kim, Youngwook ;
Kim, Jae Myung ;
Akata, Zeynep ;
Lee, Jungwoo .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :14136-14145
[23]   The Open Images Dataset V4 Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale [J].
Kuznetsova, Alina ;
Rom, Hassan ;
Alldrin, Neil ;
Uijlings, Jasper ;
Krasin, Ivan ;
Pont-Tuset, Jordi ;
Kamali, Shahab ;
Popov, Stefan ;
Malloci, Matteo ;
Kolesnikov, Alexander ;
Duerig, Tom ;
Ferrari, Vittorio .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (07) :1956-1981
[24]   General Multi-label Image Classification with Transformers [J].
Lanchantin, Jack ;
Wang, Tianlu ;
Ordonez, Vicente ;
Qi, Yanjun .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16473-16483
[25]  
Lee Hyuck, 2021, Advances in Neural Information Processing Systems, V34
[26]  
Li PY, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1359
[27]  
Li SK, 2022, ADV NEUR IN
[28]  
Lin JY, 2018, Arxiv, DOI arXiv:1808.08561
[29]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[30]   The Emerging Trends of Multi-Label Learning [J].
Liu, Weiwei ;
Wang, Haobo ;
Shen, Xiaobo ;
Tsang, Ivor W. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :7955-7974