Towards salient object detection via parallel dual-decoder network

被引：0

作者：

Cen, Chaojun ^{[1
]}

Li, Fei ^{[2
]}

Li, Zhenbo ^{[1
,3
,4
]}

Wang, Yun ^{[5
]}

机构：

[1] China Agr Univ, Sch Coll Informat & Elect Engn, Beijing 100083, Peoples R China

[2] Univ Wisconsin Madison, Coll Agr & Life Sci, Madison, WI 53706 USA

[3] Minist Agr & Rural Affairs, Natl Innovat Ctr Digital Fishery, Beijing 100083, Peoples R China

[4] Minist Agr & Rural Affairs, Key Lab Smart Farming Technol Aquat Anim & Livesto, Beijing 100083, Peoples R China

[5] Henan Univ Technol, Sch Coll Informat Sci & Engn, Zhengzhou 450001, Heinan, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2025年 / 139卷

基金：

国家重点研发计划;

关键词：

Salient object detection; Parallel dual-decoder; Transformer; Cross-attention;

D O I：

10.1016/j.engappai.2024.109638

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Salient object detection, an important preprocessing step in computer vision, segments the most prominent objects in an image. However, existing research in this field utilizes transformer-based methods to capture global context information, failing to effectively obtain local spatial features. To solve this issue, we propose a parallel dual-decoder network, which consists of a novel semantic decoder and a modified salient decoder. Specifically, the proposed semantic decoder is designed to learn the local spatial details, and the salient decoder utilizes the learnable queries to establish global saliency dependencies among objects. Moreover, the two decoders establish correlations between saliency and multi-scale semantic representations through cross-attention interaction, significantly enhancing the performance of salient object detection. In other words, we obtain global context information in the decoder to prevent discriminative features from being diluted during information propagation. Extensive experiments on 15 benchmark datasets demonstrate that our model significantly outperforms other comparison methods and shows promising potential for real-world applications such as challenging optical remote sensing, underwater, low-light, and other open scenarios. In addition, our method shows excellent performance in other downstream tasks such as camouflaged object detection, transparent object detection, shadow detection, and semantic segmentation.

引用

页数：19

共 124 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2] Path Planning of Unmanned Aerial Systems for Visual Inspection of Power Transmission Lines and Towers [J].

Ahmed, M. D. Faiyaz ;

Mohanta, J. C. ;

Sanyal, Alok ;

Yadav, Pankaj Singh .

IETE JOURNAL OF RESEARCH, 2024, 70 (03) :3259-3279

[3] Inspection and identification of transmission line insulator breakdown based on deep learning using aerial images [J].

Ahmed, MD. Faiyaz ;

Mohanta, J. C. ;

Sanyal, Alok .

ELECTRIC POWER SYSTEMS RESEARCH, 2022, 211

[4] Salient Object Detection: A Benchmark [J].

Borji, Ali ;

Cheng, Ming-Ming ;

Jiang, Huaizu ;

Li, Jia .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2015, 24 (12) :5706-5722

[5] PEM: Prototype-based Efficient MaskFormer for Image Segmentation [J].

Cavagnero, Niccolo ;

Rosi, Gabriele ;

Cuttano, Claudia ;

Pistilli, Francesca ;

Ciccone, Marco ;

Averta, Giuseppe ;

Cermelli, Fabio .

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, :15804-15813

[6] Deep Cross-Modal Audio-Visual Generation [J].

Chen, Lele ;

Srivastava, Sudhanshu ;

Duan, Zhiyao ;

Xu, Chenliang .

PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, :349-357

[7] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[8] Saliency Guided Inter- and Intra-Class Relation Constraints for Weakly Supervised Semantic Segmentation [J].

Chen, Tao ;

Yao, Yazhou ;

Zhang, Lei ;

Wang, Qiong ;

Xie, Guo-Sen ;

Shen, Fumin .

IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :1727-1737

[9] Background-foreground interaction for moving object detection in dynamic scenes [J].

Chen, Zhe ;

Wang, Ruili ;

Zhang, Zhen ;

Wang, Huibin ;

Xu, Lizhong .

INFORMATION SCIENCES, 2019, 483 :65-81

[10] Dual-path multi-branch feature residual network for salient object detection [J].

Chen, Zhensen ;

Lu, Yaosheng ;

Long, Shun ;

Bai, Jieyun .

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133

← 1 2 3 4 5 6 7 8 9 10 →