UNM: A Universal Approach for Noisy Multi-Label Learning

被引：0

作者：

Chen, Jia-Yao ^{[1
]}

Li, Shao-Yuan ^{[1
]}

Huang, Sheng-Jun ^{[1
]}

Chen, Songcan ^{[1
]}

Wang, Lei ^{[1
]}

Xie, Ming-Kun ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut NUAA, Coll Comp Sci & Technol, MIIT Key Lab Pattern Anal & Machine Intelligence, Nanjing 211106, Peoples R China

来源：

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING | 2024年 / 36卷 / 09期

基金：

国家重点研发计划; 中国国家自然科学基金;

关键词：

Noise measurement; Correlation; Semantics; Image classification; Training; Task analysis; Computational modeling; Label refinement; multi-label classification; noisy labels;

D O I：

10.1109/TKDE.2024.3373500

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-label image classification relies on a large-scale, well-maintained dataset, which may easily be mislabeled due to various subjective reasons. Existing methods for coping with noise usually focus on improving the model robustness in the case of single-label noise. However, compared with noisy single-label learning, noisy multi-label learning is more practical and challenging. To reduce the negative impact of noisy multi-annotations, we propose a universal approach for noisy multi-label learning (UNM). In UNM, we propose the label-wise embedding network which investigates the semantic alignment between label embeddings and their corresponding output features to learn robust feature representations. Meanwhile, mining the co-occurrence of multi-labels is also added to regularize the noisy network predictions. We cyclically change the fitting status of our label-wise embedding network to distinguish the noisy samples and generate pseudo labels for them. As a result, UNM provides an effective way to exploit the label-wise features and semantic label embeddings in noisy scenarios. To verify the generalizability of our method, we also test our method on Partial Multi-label Learning (PML) and Multi-label Learning with Missing Labels (MLML). Extensive experiments on benchmark datasets including Microsoft COCO, Pascal VOC, and Visual Genome explicitly validate the proposed method.

引用

页码：4968 / 4980

页数：13

共 55 条

[1]

Arpit D, 2017, PR MACH LEARN RES, V70

[2]

Bai JW, 2020, Arxiv, DOI arXiv:2007.06126

[3]

Bai JW, 2022, PR MACH LEARN RES

[4]

Bai YB, 2021, ADV NEUR IN

[5]

Ben-Baruch E, 2021, Arxiv, DOI arXiv:2009.14119

[6]

Chen TS, 2022, AAAI CONF ARTIF INTE, P339

[7] Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition [J].

Chen, Tianshui ;

Xu, Muxin ;

Hui, Xiaolu ;

Wu, Hefeng ;

Lin, Liang .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :522-531

[8]

Chen TS, 2018, AAAI CONF ARTIF INTE, P6730

[9] Multi-Label Image Recognition with Graph Convolutional Networks [J].

Chen, Zhao-Min ;

Wei, Xiu-Shen ;

Wang, Peng ;

Guo, Yanwen .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181

[10] Randaugment: Practical automated data augmentation with a reduced search space [J].

Cubuk, Ekin D. ;

Zoph, Barret ;

Shlens, Jonathon ;

Le, Quoc, V .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017

← 1 2 3 4 5 6 →