Task-Specific Loss for Robust Instance Segmentation With Noisy Class Labels

被引：11

作者：

Yang, Longrong ^{[1
]}

Li, Hongliang ^{[1
]}

Meng, Fanman ^{[1
]}

Wu, Qingbo ^{[1
]}

Ngan, King Ngi ^{[1
]}

机构：

[1] Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Noisy class labels; instance segmentation; foreground-background sub-task; foreground-instance sub-task; self-supervised learning;

D O I：

10.1109/TCSVT.2021.3109084

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Deep learning methods have achieved significant progress in the presence of correctly annotated datasets in instance segmentation. However, object classes in large-scale datasets are sometimes ambiguous, which easily causes confusion. Besides, limited experience and knowledge of annotators can lead to mislabeled object semantic classes. To solve this issue, a novel method is proposed in this paper, which considers different roles of noisy class labels in different sub-tasks. Our method is based on two basic observations: firstly, the foreground-background annotation of a sample is correct even though its class label is noisy. Secondly, symmetric loss benefits the model robustness to noisy labels but harms the learning of hard samples, while cross entropy loss is the opposite. Based on the two basic observations, in the foreground-background sub-task, cross entropy loss is used to fully exploit correct gradient guidance. In the foreground-instance sub-task, symmetric loss is used to prevent incorrect gradient guidance provided by noisy class labels. Furthermore, we apply contrastive self-supervised loss to update features of all foreground, to compensate for insufficient guidance provided by partially correct labels especially in the highly noisy setting. Extensive experiments conducted with three popular datasets (i.e., Pascal VOC, Cityscapes and COCO) have demonstrated the effectiveness of our method in a wide range of noisy class label scenarios.

引用

页码：213 / 227

页数：15

共 68 条

[1] Arazo E, 2019, PR MACH LEARN RES, V97
[2] Bolya Daniel, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12348), P558, DOI 10.1007/978-3-030-58580-8_33
[3] AN ANALYSIS OF TRANSFORMATIONS
BOX, GEP
COX, DR
[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1964, 26 (02) : 211 - 252
[4] Cascade R-CNN: Delving into High Quality Object Detection
Cai, Zhaowei
Vasconcelos, Nuno
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6154 - 6162
[5] Chen K, 2019, Arxiv, DOI arXiv:1906.07155
[6] Hybrid Task Cascade for Instance Segmentation
Chen, Kai
Pang, Jiangmiao
Wang, Jiaqi
Xiong, Yu
Li, Xiaoxiao
Sun, Shuyang
Feng, Wansen
Liu, Ziwei
Shi, Jianping
Ouyang, Wanli
Loy, Chen Change
Lin, Dahua
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4969 - 4978
[7] SpatialFlow: Bridging All Tasks for Panoptic Segmentation
Chen, Qiang
Cheng, Anda
He, Xiangyu
Wang, Peisong
Cheng, Jian
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (06) : 2288 - 2300
[8] Chen T, 2020, PR MACH LEARN RES, V119
[9] Chen XL, 2020, Arxiv, DOI arXiv:2011.10566
[10] The Cityscapes Dataset for Semantic Urban Scene Understanding
Cordts, Marius
Omran, Mohamed
Ramos, Sebastian
Rehfeld, Timo
Enzweiler, Markus
Benenson, Rodrigo
Franke, Uwe
Roth, Stefan
Schiele, Bernt
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 3213 - 3223

← 1 2 3 4 5 6 7 →