A dual-stream learning framework for weakly supervised salient object detection with multi-strategy integration

被引：0

作者：

Liu, Yuyan ^{[1
]}

Zhang, Qing ^{[1
]}

Zhao, Yilin ^{[1
]}

Shi, Yanjiao ^{[1
]}

机构：

[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China

来源：

VISUAL COMPUTER | 2025年

基金：

上海市自然科学基金;

关键词：

Salient object detection; Scribble annotations; Weakly supervised learning; Dual-stream network; Feature integration;

D O I：

10.1007/s00371-025-03798-9

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

Recently, some scribble-based weakly supervised salient object detection (SOD) methods have been proposed to alleviate the heavy burden of expensive and time-consuming pixel-level data labeling in fully supervised SOD. However, due to the lack of salient object structure information in scribble annotations, it is difficult for a model to accurately discriminate and learn explicit boundaries during training. In this paper, we propose a dual-stream learning framework that employs an individual encoding stream to obtain boundary information to help the network identify integral salient regions and accurate structural details. Additionally, we adopt different strategies (i.e., the boundary-aware semantics enhancement module for the high levels, the boundary-aware detail enhancement module for the low levels) to better integrate boundary information with object features at different levels to take full advantage of different salient object feature properties. Extensive experiments show that our model achieves competitive performance against the state-of-the-art weakly supervised SOD methods, demonstrating the superiority and effectiveness of our proposed network. The code and results are released from the link: https://github.com/boom118/BSnet.

引用

页数：16

共 46 条

[1] Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
[2] A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels
Cong, Runmin
Qin, Qi
Zhang, Chen
Jiang, Qiuping
Wang, Shiqi
Zhao, Yao
Kwong, Sam
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) : 534 - 548
[3] A deep learning system for detecting diabetic retinopathy across the disease spectrum
Dai, Ling
Wu, Liang
Li, Huating
Cai, Chun
Wu, Qiang
Kong, Hongyu
Liu, Ruhan
Wang, Xiangning
Hou, Xuhong
Liu, Yuexing
Long, Xiaoxue
Wen, Yang
Lu, Lina
Shen, Yaxin
Chen, Yan
Shen, Dinggang
Yang, Xiaokang
Zou, Haidong
Sheng, Bin
Jia, Weiping
[J]. NATURE COMMUNICATIONS, 2021, 12 (01)
[4] Dai LT, 2024, NAT PROD RES, DOI [10.1038/s41591-023-02702-z, 10.1080/14786419.2024.2394834]
[5] The Pascal Visual Object Classes (VOC) Challenge
Everingham, Mark
Van Gool, Luc
Williams, Christopher K. I.
Winn, John
Zisserman, Andrew
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
[6] Fan DP, 2018, Arxiv, DOI arXiv:1805.10421
[7] Salient Objects in Clutter
Fan, Deng-Ping
Zhang, Jing
Xu, Gang
Cheng, Ming-Ming
Shao, Ling
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2344 - 2366
[8] Structure-measure: A New Way to Evaluate Foreground Maps
Fan, Deng-Ping
Cheng, Ming-Ming
Liu, Yun
Li, Tao
Borji, Ali
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4558 - 4567
[9] Gao SY, 2022, AAAI CONF ARTIF INTE, P670
[10] He JF, 2012, PROC CVPR IEEE, P3005, DOI 10.1109/CVPR.2012.6248030

← 1 2 3 4 5 →