Iterative Saliency Aggregation and Assignment Network for Efficient Salient Object Detection in Optical Remote Sensing Images

被引：6

作者：

Yao, Zhaojian ^{[1
]}

Gao, Wei ^{[1
,2
]}

机构：

[1] Peking Univ, Sch Elect & Comp Engn, Shenzhen Grad Sch, Shenzhen 518055, Peoples R China

[2] Peng Cheng Lab, Shenzhen 518066, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

关键词：

Decoding; Accuracy; Object detection; Feature extraction; Iterative methods; Object recognition; Hierarchical interaction and dynamic integration (HIDI); iterative saliency aggregation and assignment; remote sensing; salient object detection;

D O I：

10.1109/TGRS.2024.3425658

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Motivated by the pursuit of efficient salient object detection in remote sensing, researchers have devoted considerable efforts to devising lightweight models due to low running efficiency of the cumbersome models. Although the existing lightweight models have made impressive progress in improving efficiency, there is often a notable sacrifice in inference accuracy, making it challenging to attain both high-quality output and high efficiency. In this article, we propose an iterative saliency aggregation and assignment network (ISAANet) to reconcile the dilemma of balancing accuracy and efficiency. ISAANet adopts a recurrent architecture with bidirectional communication between the encoder and the decoder to boost detection performance. Specifically, a saliency aggregation mechanism is used to generate complementary information by integrating multilevel saliency cues, corresponding to the feedforward information flow from the encoder to the decoder. As for the feedback flows from the decoder to the encoder, a saliency assignment approach is proposed to inject the region and edge prompts into the encoder for hierarchical feature updates, emphasizing the representation of salient information. The encoder is progressively reinforced by alternating iterations of saliency aggregation and assignment, which further helps the decoder to produce more reliable detection results. Moreover, the effective fusion of cross-level features is key to realizing information complementarity and enhancing aggregation quality. However, conventional fusion schemes often neglect feature importance disparities, which can lead to noise accumulation. To address this issue, we introduce a hierarchical interaction and dynamic integration (HIDI) module for feature fusion. The HIDI module groups fusion features and dynamically adjusts their weights based on their information quality, which suppresses noise interference and strengthens the model's learning of meaningful features. The experimental results demonstrate that the proposed ISAANet not only effectively compensates for the accuracy deficiency of the existing lightweight models but also achieves a faster running speed of 108 FPS, which shows a favorable tradeoff between efficiency and accuracy. The code will be available at https://github.com/YiuCK/ISAANet and https://openi.pcl.ac.cn/OpenVision/ISAANet.

引用

页数：13

共 57 条

[1]

Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596

[2] Recurrent Multi-scale Transformer for High-Resolution Salient Object Detection [J].

Deng, Xinhao ;

Zhang, Pingping ;

Liu, Wei ;

Lu, Huchuan .

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, :7413-7423

[3] Structure-measure: A New Way to Evaluate Foreground Maps [J].

Fan, Deng-Ping ;

Cheng, Ming-Ming ;

Liu, Yun ;

Li, Tao ;

Borji, Ali .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567

[4]

Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698

[5] Screen-based 3D Subjective Experiment Software [J].

Fan, Songlin ;

Gao, Wei .

PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, :9672-9675

[6] Salient Object Detection for Point Clouds [J].

Fan, Songlin ;

Gao, Wei ;

Li, Ge .

COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 :1-19

[7] Densely nested top-down flows for salient object detection [J].

Fang, Chaowei ;

Tian, Haibin ;

Zhang, Dingwen ;

Zhang, Qiang ;

Han, Jungong ;

Han, Junwei .

SCIENCE CHINA-INFORMATION SCIENCES, 2022, 65 (08)

[8] Highly Efficient Salient Object Detection with 100K Parameters [J].

Gao, Shang-Hua ;

Tan, Yong-Qiang ;

Cheng, Ming-Ming ;

Lu, Chengze ;

Chen, Yunpeng ;

Yan, Shuicheng .

COMPUTER VISION - ECCV 2020, PT VI, 2020, 12351 :702-721

[9] A Thorough Benchmark and a New Model for Light Field Saliency Detection [J].

Gao, Wei ;

Fan, Songlin ;

Li, Ge ;

Lin, Weisi .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (07) :8003-8019

[10] Unified Information Fusion Network for Multi-Modal RGB-D and RGB-T Salient Object Detection [J].

Gao, Wei ;

Liao, Guibiao ;

Ma, Siwei ;

Li, Ge ;

Liang, Yongsheng ;

Lin, Weisi .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) :2091-2106

← 1 2 3 4 5 6 →