A dual-stream learning framework for weakly supervised salient object detection with multi-strategy integration

被引:0
作者
Liu, Yuyan [1 ]
Zhang, Qing [1 ]
Zhao, Yilin [1 ]
Shi, Yanjiao [1 ]
机构
[1] Shanghai Inst Technol, Sch Comp Sci & Informat Engn, Shanghai 201418, Peoples R China
基金
上海市自然科学基金;
关键词
Salient object detection; Scribble annotations; Weakly supervised learning; Dual-stream network; Feature integration;
D O I
10.1007/s00371-025-03798-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Recently, some scribble-based weakly supervised salient object detection (SOD) methods have been proposed to alleviate the heavy burden of expensive and time-consuming pixel-level data labeling in fully supervised SOD. However, due to the lack of salient object structure information in scribble annotations, it is difficult for a model to accurately discriminate and learn explicit boundaries during training. In this paper, we propose a dual-stream learning framework that employs an individual encoding stream to obtain boundary information to help the network identify integral salient regions and accurate structural details. Additionally, we adopt different strategies (i.e., the boundary-aware semantics enhancement module for the high levels, the boundary-aware detail enhancement module for the low levels) to better integrate boundary information with object features at different levels to take full advantage of different salient object feature properties. Extensive experiments show that our model achieves competitive performance against the state-of-the-art weakly supervised SOD methods, demonstrating the superiority and effectiveness of our proposed network. The code and results are released from the link: https://github.com/boom118/BSnet.
引用
收藏
页数:16
相关论文
共 46 条
  • [1] Chen ZY, 2020, AAAI CONF ARTIF INTE, V34, P10599
  • [2] A Weakly Supervised Learning Framework for Salient Object Detection via Hybrid Labels
    Cong, Runmin
    Qin, Qi
    Zhang, Chen
    Jiang, Qiuping
    Wang, Shiqi
    Zhao, Yao
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (02) : 534 - 548
  • [3] A deep learning system for detecting diabetic retinopathy across the disease spectrum
    Dai, Ling
    Wu, Liang
    Li, Huating
    Cai, Chun
    Wu, Qiang
    Kong, Hongyu
    Liu, Ruhan
    Wang, Xiangning
    Hou, Xuhong
    Liu, Yuexing
    Long, Xiaoxue
    Wen, Yang
    Lu, Lina
    Shen, Yaxin
    Chen, Yan
    Shen, Dinggang
    Yang, Xiaokang
    Zou, Haidong
    Sheng, Bin
    Jia, Weiping
    [J]. NATURE COMMUNICATIONS, 2021, 12 (01)
  • [4] Dai LT, 2024, NAT PROD RES, DOI [10.1038/s41591-023-02702-z, 10.1080/14786419.2024.2394834]
  • [5] The Pascal Visual Object Classes (VOC) Challenge
    Everingham, Mark
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2010, 88 (02) : 303 - 338
  • [6] Fan DP, 2018, Arxiv, DOI arXiv:1805.10421
  • [7] Salient Objects in Clutter
    Fan, Deng-Ping
    Zhang, Jing
    Xu, Gang
    Cheng, Ming-Ming
    Shao, Ling
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (02) : 2344 - 2366
  • [8] Structure-measure: A New Way to Evaluate Foreground Maps
    Fan, Deng-Ping
    Cheng, Ming-Ming
    Liu, Yun
    Li, Tao
    Borji, Ali
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 4558 - 4567
  • [9] Gao SY, 2022, AAAI CONF ARTIF INTE, P670
  • [10] He JF, 2012, PROC CVPR IEEE, P3005, DOI 10.1109/CVPR.2012.6248030