Exploring a Lightweight and Efficient Network for Salient Object Detection in ORSI

被引:0
作者
Han, Jinyu [1 ]
Sun, Fuming [1 ]
Hou, Yaoyao [1 ]
Sun, Jing [1 ]
Li, Haojie [2 ]
机构
[1] Dalian Minzu Univ, Sch Informat & Commun Engn, Dalian 116600, Peoples R China
[2] Shandong Univ Sci & Technol, Sch Comp Sci & Engn, Qingdao 266590, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2025年 / 63卷
基金
中国国家自然科学基金;
关键词
Feature extraction; Object detection; Computational modeling; Decoding; Accuracy; Computational efficiency; Transformers; Sun; Remote sensing; Optical sensors; Lightweight; optical remote sensing images (ORSIs); parameters; plug-and-play; salient object detection (SOD);
D O I
10.1109/TGRS.2025.3584963
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
In recent years, optical remote sensing image salient object detection (ORSI-SOD) has made substantial progress. Nevertheless, it remains an open-ended research area with complex challenges. Most existing ORSI-SOD methods, aiming for high-performance detection, demand large-scale parameters and high computational costs. This significantly restricts their application on resource-constrained devices, which have limited computing power and memory capacity. To tackle this issue, we propose a lightweight and highly efficient ORSI-SOD network, termed RAMENet. With only 5.18 M parameters and 8.72 G FLOPs, RAMENet can achieve competitive detection accuracy compared to state-of-the-art (SOTA) methods. Specifically, we devise a dynamic region-aware block (DRB) that can be nested within the encoder to realize plug-and-play functionality. This enables the network to learn ORSI domain-specific feature representations, thus more effectively locating salient object regions. Furthermore, we present a novel multipath-enhanced M-shaped decoder (MED), which integrates both bottom-up and top-down paradigms. Comprising two feature extraction sub-branches and a central feature refinement branch, this architecture achieves multigranularity feature aggregation via cross-level feature interaction. Consequently, it significantly improves the detailed representation capability while maintaining the integrity of the object structure. Extensive experimental results indicate that the RAMENet outperforms five SOTA lightweight methods in terms of S-alpha , F-beta (mean), and MAE on EORSSD and ORSSD datasets, with improvement reaching 0.68%, 0.92%, 0.13%, 0.60%, 1.13%, and 0.07%, respectively. The code and results are available at https://github.com/hjy0518/RAMENet/
引用
收藏
页数:14
相关论文
共 51 条
[1]  
Achanta R, 2009, PROC CVPR IEEE, P1597, DOI 10.1109/CVPRW.2009.5206596
[2]   Lightweight Progressive Multilevel Feature Collaborative Network for Remote Sensing Image Salient Object Detection [J].
Cheng, Bei ;
Liu, Zao ;
Wang, Qingwang ;
Shen, Tao ;
Fu, Chengbiao ;
Tian, Anhong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[3]   Masked-attention Mask Transformer for Universal Image Segmentation [J].
Cheng, Bowen ;
Misra, Ishan ;
Schwing, Alexander G. ;
Kirillov, Alexander ;
Girdhar, Rohit .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :1280-1289
[4]   A tutorial on the cross-entropy method [J].
De Boer, PT ;
Kroese, DP ;
Mannor, S ;
Rubinstein, RY .
ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) :19-67
[5]   Multiscale and Multidimensional Weighted Network for Salient Object Detection in Optical Remote Sensing Images [J].
Di, Lamei ;
Zhang, Bin ;
Wang, Yiming .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-14
[6]   Structure-measure: A New Way to Evaluate Foreground Maps [J].
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Liu, Yun ;
Li, Tao ;
Borji, Ali .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :4558-4567
[7]  
Fan DP, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P698
[8]   Dynamic Saliency-Aware Regularization for Correlation Filter-Based Object Tracking [J].
Feng, Wei ;
Han, Ruize ;
Guo, Qing ;
Zhu, Jianke ;
Wang, Song .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (07) :3232-3245
[9]   A Lightweight Collective-attention Network for Change Detection [J].
Feng, Yuchao ;
Shao, Yanyan ;
Xu, Honghui ;
Xu, Jinshan ;
Zheng, Jianwei .
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, :8195-8203
[10]   Edge and Skeleton Guidance Network for Salient Object Detection in Optical Remote Sensing Images [J].
Gong, Aojun ;
Nie, Junfei ;
Niu, Chen ;
Yu, Yuan ;
Li, Jun ;
Guo, Lianbo .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) :7109-7120