Dual-Decoupling Learning and Metric-Adaptive Thresholding for Semi-supervised Multi-label Learning

被引：0

作者：

Xiao, Jia-Hao ^{[1
]}

Xie, Ming-Kun ^{[1
]}

Fan, Heng-Bo ^{[1
]}

Niu, Gang ^{[2
]}

Sugiyama, Masashi ^{[2
,3
]}

Huang, Sheng-Jun ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Nanjing, Peoples R China

[2] RIKEN, Ctr Adv Intelligence Project, Tokyo, Japan

[3] Univ Tokyo, Tokyo, Japan

来源：

COMPUTER VISION - ECCV 2024, PT LII | 2025年 / 15110卷

基金：

国家重点研发计划;

关键词：

Multi-label learning; Semi-supervised learning; CLASSIFICATION;

D O I：

10.1007/978-3-031-72943-0_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Semi-supervised multi-label learning (SSMLL) is a powerful framework for leveraging unlabeled data to reduce the expensive cost of collecting precise multi-label annotations. Unlike semi-supervised learning, one cannot select the most probable label as the pseudo-label in SSMLL due to multiple semantics contained in an instance. To solve this problem, the mainstream method developed an effective thresholding strategy to generate accurate pseudo-labels. Unfortunately, the method neglected the quality of model predictions and its potential impact on pseudo-labeling performance. In this paper, we propose a dual-perspective method to generate high-quality pseudo-labels. To improve the quality of model predictions, we perform dual-decoupling to boost the learning of correlative and discriminative features, while refining the generation and utilization of pseudo-labels. To obtain proper class-wise thresholds, we propose the metric-adaptive thresholding strategy to estimate the thresholds, which maximize the pseudo-label performance for a given metric on labeled data. Experiments on multiple benchmark datasets show the proposed method can achieve the state-of-the-art performance and outperform the comparative methods with a significant margin. The implementation is available at JiahaoXxX/SSMLL-D2L MAT.

引用

页码：437 / 454

页数：18

共 57 条

[21] Interactive Multi-Label CNN Learning with Partial Labels [J].

Huynh, Dat ;

Elhamifar, Ehsan .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9420-9429

[22] Large Loss Matters in Weakly Supervised Multi-Label Classification [J].

Kim, Youngwook ;

Kim, Jae Myung ;

Akata, Zeynep ;

Lee, Jungwoo .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :14136-14145

[23] The Open Images Dataset V4 Unified Image Classification, Object Detection, and Visual Relationship Detection at Scale [J].

Kuznetsova, Alina ;

Rom, Hassan ;

Alldrin, Neil ;

Uijlings, Jasper ;

Krasin, Ivan ;

Pont-Tuset, Jordi ;

Kamali, Shahab ;

Popov, Stefan ;

Malloci, Matteo ;

Kolesnikov, Alexander ;

Duerig, Tom ;

Ferrari, Vittorio .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (07) :1956-1981

[24] General Multi-label Image Classification with Transformers [J].

Lanchantin, Jack ;

Wang, Tianlu ;

Ordonez, Vicente ;

Qi, Yanjun .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :16473-16483

[25]

Lee Hyuck, 2021, Advances in Neural Information Processing Systems, V34

[26]

Li PY, 2020, PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1359

[27]

Li SK, 2022, ADV NEUR IN

[28]

Lin JY, 2018, Arxiv, DOI arXiv:1808.08561

[29] Microsoft COCO: Common Objects in Context [J].

Lin, Tsung-Yi ;

Maire, Michael ;

Belongie, Serge ;

Hays, James ;

Perona, Pietro ;

Ramanan, Deva ;

Dollar, Piotr ;

Zitnick, C. Lawrence .

COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755

[30] The Emerging Trends of Multi-Label Learning [J].

Liu, Weiwei ;

Wang, Haobo ;

Shen, Xiaobo ;

Tsang, Ivor W. .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :7955-7974

← 1 2 3 4 5 6 →