UNM: A Universal Approach for Noisy Multi-Label Learning

被引:0
作者
Chen, Jia-Yao [1 ]
Li, Shao-Yuan [1 ]
Huang, Sheng-Jun [1 ]
Chen, Songcan [1 ]
Wang, Lei [1 ]
Xie, Ming-Kun [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut NUAA, Coll Comp Sci & Technol, MIIT Key Lab Pattern Anal & Machine Intelligence, Nanjing 211106, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Noise measurement; Correlation; Semantics; Image classification; Training; Task analysis; Computational modeling; Label refinement; multi-label classification; noisy labels;
D O I
10.1109/TKDE.2024.3373500
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-label image classification relies on a large-scale, well-maintained dataset, which may easily be mislabeled due to various subjective reasons. Existing methods for coping with noise usually focus on improving the model robustness in the case of single-label noise. However, compared with noisy single-label learning, noisy multi-label learning is more practical and challenging. To reduce the negative impact of noisy multi-annotations, we propose a universal approach for noisy multi-label learning (UNM). In UNM, we propose the label-wise embedding network which investigates the semantic alignment between label embeddings and their corresponding output features to learn robust feature representations. Meanwhile, mining the co-occurrence of multi-labels is also added to regularize the noisy network predictions. We cyclically change the fitting status of our label-wise embedding network to distinguish the noisy samples and generate pseudo labels for them. As a result, UNM provides an effective way to exploit the label-wise features and semantic label embeddings in noisy scenarios. To verify the generalizability of our method, we also test our method on Partial Multi-label Learning (PML) and Multi-label Learning with Missing Labels (MLML). Extensive experiments on benchmark datasets including Microsoft COCO, Pascal VOC, and Visual Genome explicitly validate the proposed method.
引用
收藏
页码:4968 / 4980
页数:13
相关论文
共 55 条
[1]  
Arpit D, 2017, PR MACH LEARN RES, V70
[2]  
Bai JW, 2020, Arxiv, DOI arXiv:2007.06126
[3]  
Bai JW, 2022, PR MACH LEARN RES
[4]  
Bai YB, 2021, ADV NEUR IN
[5]  
Ben-Baruch E, 2021, Arxiv, DOI arXiv:2009.14119
[6]  
Chen TS, 2022, AAAI CONF ARTIF INTE, P339
[7]   Learning Semantic-Specific Graph Representation for Multi-Label Image Recognition [J].
Chen, Tianshui ;
Xu, Muxin ;
Hui, Xiaolu ;
Wu, Hefeng ;
Lin, Liang .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :522-531
[8]  
Chen TS, 2018, AAAI CONF ARTIF INTE, P6730
[9]   Multi-Label Image Recognition with Graph Convolutional Networks [J].
Chen, Zhao-Min ;
Wei, Xiu-Shen ;
Wang, Peng ;
Guo, Yanwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5172-5181
[10]   Randaugment: Practical automated data augmentation with a reduced search space [J].
Cubuk, Ekin D. ;
Zoph, Barret ;
Shlens, Jonathon ;
Le, Quoc, V .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3008-3017