Bidirectional mutual guidance transformer for salient object detection in optical remote sensing images

被引:6
作者
Huang, Kan [1 ]
Tian, Chunwei [2 ]
Li, Ge [3 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai, Peoples R China
[2] Northwestern Polytech Univ, Sch Software, Xian, Peoples R China
[3] Peking Univ, Sch Elect & Comp Engn, Shenzhen, Peoples R China
基金
中国博士后科学基金; 美国国家科学基金会;
关键词
Salient object detection; optical remote sensing images; Transformer; NETWORK;
D O I
10.1080/01431161.2023.2229494
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Salient object detection in optical remote sensing images presents great challenges due to the characteristics of remote sensing images such as cluttered background, varying object scales, and unstable imaging conditions, etc. In this paper, we present a Bidirectional Mutual Guidance Transformer (BMGT), which mitigates the locality issue of CNN-based models, and exploits the mutual guidance between global context-aware object representations and fine-grained boundary structures. It contains a hierarchically structured Transformer encoder that extracts multi-level multi-scale token representations, and a dual-stream cross-task MLP decoder that performs joint salient object detection and salient boundary detection in an end-to-end manner. In particular, the dual-stream decoder consists of two sub-branch networks with symmetric architectures, that are connected by a newly proposed Mutual Guidance MLP layer (MG-MLP). Through MG-MLP, salient object features and salient boundary features interact with each other, facilitating complementary learning at multiple network levels. Extensive evaluations demonstrate that our proposed method outperforms other existing methods in two public remote sensing image benchmarks. It proves that our BMGT is advantageous in exploiting long-range context dependencies as well as preserving fine-grained boundary structures.
引用
收藏
页码:4016 / 4033
页数:18
相关论文
共 50 条
[41]   LSHNet: Leveraging Structure-Prior With Hierarchical Features Updates for Salient Object Detection in Optical Remote Sensing Images [J].
Lee, Seunghoon ;
Cho, Suhwan ;
Park, Chaewon ;
Park, Seungwook ;
Kim, Jaeyeob ;
Lee, Sangyoun .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[42]   A survey on object detection in optical remote sensing images [J].
Cheng, Gong ;
Han, Junwei .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2016, 117 :11-28
[43]   Iterative Saliency Aggregation and Assignment Network for Efficient Salient Object Detection in Optical Remote Sensing Images [J].
Yao, Zhaojian ;
Gao, Wei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[44]   A Lightweight Multistream Framework for Salient Object Detection in Optical Remote Sensing [J].
Ai, Zhenxin ;
Luo, Huilan ;
Wang, Jianqin .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63
[45]   Nested Network With Two-Stream Pyramid for Salient Object Detection in Optical Remote Sensing Images [J].
Li, Chongyi ;
Cong, Runmin ;
Hou, Junhui ;
Zhang, Sanyi ;
Qian, Yue ;
Kwong, Sam .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (11) :9156-9166
[46]   Semantic-Guided Attention Refinement Network for Salient Object Detection in Optical Remote Sensing Images [J].
Huang, Zhou ;
Chen, Huaixin ;
Liu, Biyuan ;
Wang, Zhixi .
REMOTE SENSING, 2021, 13 (11)
[47]   One-stop multiscale reconciliation attention network with scribble supervision for salient object detection in optical remote sensing images [J].
Ruixiang Yan ;
Longquan Yan ;
Yufei Cao ;
Guohua Geng ;
Pengbo Zhou .
Applied Intelligence, 2024, 54 :3737-3755
[48]   Deeply Hybrid Contrastive Learning Based on Semantic Pseudo-Label for Salient Object Detection in Optical Remote Sensing Images [J].
Qiu, Yu ;
Sun, Yuhang ;
Mei, Jie ;
Xu, Jing .
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 :10892-10907
[49]   One-stop multiscale reconciliation attention network with scribble supervision for salient object detection in optical remote sensing images [J].
Yan, Ruixiang ;
Yan, Longquan ;
Cao, Yufei ;
Geng, Guohua ;
Zhou, Pengbo .
APPLIED INTELLIGENCE, 2024, 54 (05) :3737-3755
[50]   Multi-source information fusion attention network for weakly supervised salient object detection in optical remote sensing images [J].
Yan, Longquan ;
Yang, Shuhui ;
Zhang, Qi ;
Yan, Ruixiang ;
Wang, Tao ;
Liu, Hengzhi ;
Zhou, Mingquan .
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 261