Adaptive Spatial Tokenization Transformer for Salient Object Detection in Optical Remote Sensing Images

被引:22
|
作者
Gao, Lina [1 ]
Liu, Bing [1 ]
Fu, Ping [1 ]
Xu, Mingzhu [2 ]
机构
[1] Harbin Inst Technol, Sch Elect & Informat Engn, Harbin 150001, Peoples R China
[2] Shandong Univ, Sch Software, Jinan 250101, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Transformers; Adaptation models; Object detection; Tokenization; Optical imaging; Optical sensors; Feature extraction; Adaptive tokenization; optical remote sensing images (ORSIs); salient object detection (SOD); transformer; REGION DETECTION; TARGET DETECTION; NETWORK;
D O I
10.1109/TGRS.2023.3242987
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Convolutional neural network (CNN)-based salient object detection (SOD) models have achieved promising performance in optical remote sensing images (ORSIs) in recent years. However, the restriction concerning the local sliding window operation of CNN has caused many existing CNN-based ORSI SOD models to still struggle with learning long-range relationships. To this end, a novel transformer framework is proposed for ORSI SOD, which is inspired by the powerful global dependency relationships of transformer networks. This is the first attempt to explore global and local details using transformer architecture for SOD in ORSIs. Concretely, we design an adaptive spatial tokenization transformer encoder to extract global-local features, which can accurately sparsify tokens for each input image and achieve competitive performance in ORSI SOD tasks. Then, a specific dense token aggregation decoder (DTAD) is proposed to generate saliency results, including three cascade decoders to integrate the global-local tokens and contextual dependencies. Extensive experiments indicate that the proposed model greatly surpasses 20 state-of-the-art (SOTA) SOD approaches on two standard ORSI SOD datasets under seven evaluation metrics. We also report comparison results to demonstrate the generalization capacity on the latest challenging ORSI datasets. In addition, we validate the contributions of different modules through a series of ablation analyses, especially the proposed adaptive spatial tokenization module (ASTM), which can halve the computational budget.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] United Domain Cognition Network for Salient Object Detection in Optical Remote Sensing Images
    Sun, Yanguang
    Yang, Jian
    Luo, Lei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [22] Progressive Enhancement of Foreground Features for Salient Object Detection in Optical Remote Sensing Images
    Meng, Lingbing
    Li, Haiqun
    Han, Huihui
    Xu, Meng
    Wu, Jinhua
    Hou, Shuonan
    Duan, Weiwei
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 18 : 7572 - 7591
  • [23] Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images
    Li, Gongyang
    Liu, Zhi
    Zeng, Dan
    Lin, Weisi
    Ling, Haibin
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (01) : 526 - 538
  • [24] Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation
    Li, Gongyang
    Liu, Zhi
    Bai, Zhen
    Lin, Weisi
    Ling, Haibin
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [25] Heterogeneous Feature Collaboration Network for Salient Object Detection in Optical Remote Sensing Images
    Liu, Yutong
    Xu, Mingzhu
    Xiao, Tianxiang
    Tang, Haoyu
    Hu, Yupeng
    Nie, Liqiang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [26] Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images
    Li, Gongyang
    Liu, Zhi
    Zeng, Dan
    Lin, Weisi
    Ling, Haibin
    IEEE Transactions on Cybernetics, 2023, 53 (01): : 526 - 538
  • [27] Boundary-Aware Salient Object Detection in Optical Remote-Sensing Images
    Yu, Longxuan
    Zhou, Xiaofei
    Wang, Lingbo
    Zhang, Jiyong
    ELECTRONICS, 2022, 11 (24)
  • [28] Dense Attention Fluid Network for Salient Object Detection in Optical Remote Sensing Images
    Zhang, Qijian
    Cong, Runmin
    Li, Chongyi
    Cheng, Ming-Ming
    Fang, Yuming
    Cao, Xiaochun
    Zhao, Yao
    Kwong, Sam
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 1305 - 1317
  • [29] Dense Attention Fluid Network for Salient Object Detection in Optical Remote Sensing Images
    Zhang, Qijian
    Cong, Runmin
    Li, Chongyi
    Cheng, Ming-Ming
    Fang, Yuming
    Cao, Xiaochun
    Zhao, Yao
    Kwong, Sam
    IEEE Transactions on Image Processing, 2021, 30 : 1305 - 1317
  • [30] Recurrent Adaptive Graph Reasoning Network With Region and Boundary Interaction for Salient Object Detection in Optical Remote Sensing Images
    Zhao, Jie
    Jia, Yun
    Ma, Lin
    Yu, Lidan
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62