To Be Critical: Self-calibrated Weakly Supervised Learning for Salient Object Detection
被引:0
|
作者:
Wang, Jian
论文数: 0引用数: 0
h-index: 0
机构:
Dalian Univ Technol, Dalian, Peoples R ChinaDalian Univ Technol, Dalian, Peoples R China
Wang, Jian
[1
]
Liu, Tingwei
论文数: 0引用数: 0
h-index: 0
机构:
Dalian Univ Technol, Dalian, Peoples R ChinaDalian Univ Technol, Dalian, Peoples R China
Liu, Tingwei
[1
]
Zhang, Miao
论文数: 0引用数: 0
h-index: 0
机构:
Dalian Univ Technol, Dalian, Peoples R China
Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian, Peoples R ChinaDalian Univ Technol, Dalian, Peoples R China
Zhang, Miao
[1
,2
]
Piao, Yongri
论文数: 0引用数: 0
h-index: 0
机构:
Dalian Univ Technol, Dalian, Peoples R ChinaDalian Univ Technol, Dalian, Peoples R China
Piao, Yongri
[1
]
机构:
[1] Dalian Univ Technol, Dalian, Peoples R China
[2] Key Lab Ubiquitous Network & Serv Software Liaoni, Dalian, Peoples R China
来源:
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XI
|
2024年
/
14435卷
基金:
中国国家自然科学基金;
关键词:
Salient object detection;
Weakly supervised learning;
Deep learning;
D O I:
10.1007/978-981-99-8552-4_15
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Weakly-supervised salient object detection (WSOD) aims to develop saliency models using image-level annotations. Despite of the success of previous works, explorations on an effective training strategy for the saliency network and accurate matches between image-level annotations and salient objects are still inadequate. In this work, 1) we propose a self-calibrated training strategy by explicitly establishing a mutual calibration loop between pseudo labels and network predictions, liberating the saliency network from error-prone propagation caused by pseudo labels. 2) we prove that even a much smaller dataset (merely 1.8% of ImageNet) with well-matched annotations can facilitate models to achieve better performance as well as generalizability. This sheds new light on the development of WSOD and encourages more contributions to the community. Comprehensive experiments demonstrate that our method outperforms all the existing WSOD methods by adopting the self-calibrated strategy only. Steady improvements are further achieved by training on the proposed dataset. Additionally, our method achieves 94.7% of the performance of fully-supervised methods on average. And what is more, the fully supervised models adopting our predicted results as "ground truths" achieve successful results (95.6% for BASNet and 97.3% for ITSD on F-measure), while costing only 0.32% of labeling time for pixel-level annotation. The code and dataset are available at https://github.com/DUTyimmy/SCW.
机构:
CVTE Res, Guangzhou, Guangdong, Peoples R China
Guangdong Univ Technol, Guangzhou, Guangdong, Peoples R ChinaCVTE Res, Guangzhou, Guangdong, Peoples R China
Wang, Jiawei
Zhu, Shuai
论文数: 0引用数: 0
h-index: 0
机构:
CVTE Res, Guangzhou, Guangdong, Peoples R ChinaCVTE Res, Guangzhou, Guangdong, Peoples R China
Zhu, Shuai
Xu, Jiao
论文数: 0引用数: 0
h-index: 0
机构:
CVTE Res, Guangzhou, Guangdong, Peoples R ChinaCVTE Res, Guangzhou, Guangdong, Peoples R China
Xu, Jiao
Cao, Da
论文数: 0引用数: 0
h-index: 0
机构:
Hunan Univ, Changsha, Hunan, Peoples R ChinaCVTE Res, Guangzhou, Guangdong, Peoples R China
Cao, Da
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19),
2019,
: 2548
-
2552