Adaptive Spatial Tokenization Transformer for Salient Object Detection in Optical Remote Sensing Images

被引：22

作者：

Gao, Lina ^{[1
]}

Liu, Bing ^{[1
]}

Fu, Ping ^{[1
]}

Xu, Mingzhu ^{[2
]}

机构：

[1] Harbin Inst Technol, Sch Elect & Informat Engn, Harbin 150001, Peoples R China

[2] Shandong Univ, Sch Software, Jinan 250101, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷

基金：

中国国家自然科学基金;

关键词：

Transformers; Adaptation models; Object detection; Tokenization; Optical imaging; Optical sensors; Feature extraction; Adaptive tokenization; optical remote sensing images (ORSIs); salient object detection (SOD); transformer; REGION DETECTION; TARGET DETECTION; NETWORK;

D O I：

10.1109/TGRS.2023.3242987

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

Convolutional neural network (CNN)-based salient object detection (SOD) models have achieved promising performance in optical remote sensing images (ORSIs) in recent years. However, the restriction concerning the local sliding window operation of CNN has caused many existing CNN-based ORSI SOD models to still struggle with learning long-range relationships. To this end, a novel transformer framework is proposed for ORSI SOD, which is inspired by the powerful global dependency relationships of transformer networks. This is the first attempt to explore global and local details using transformer architecture for SOD in ORSIs. Concretely, we design an adaptive spatial tokenization transformer encoder to extract global-local features, which can accurately sparsify tokens for each input image and achieve competitive performance in ORSI SOD tasks. Then, a specific dense token aggregation decoder (DTAD) is proposed to generate saliency results, including three cascade decoders to integrate the global-local tokens and contextual dependencies. Extensive experiments indicate that the proposed model greatly surpasses 20 state-of-the-art (SOTA) SOD approaches on two standard ORSI SOD datasets under seven evaluation metrics. We also report comparison results to demonstrate the generalization capacity on the latest challenging ORSI datasets. In addition, we validate the contributions of different modules through a series of ablation analyses, especially the proposed adaptive spatial tokenization module (ASTM), which can halve the computational budget.

引用

页数：15

共 50 条

[31] GGRNet: Global Graph Reasoning Network for Salient Object Detection in Optical Remote Sensing Images
Liu, Xuan
Zhang, Yumo
Cong, Runmin
Zhang, Chen
Yang, Ning
Zhang, Chunjie
Zhao, Yao
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2021, PT II, 2021, 13020 : 584 - 596
[32] Semantic-Edge Interactive Network for Salient Object Detection in Optical Remote Sensing Images
Luo, Huilan
Liang, Bocheng
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 6980 - 6994
[33] Salient Object Detection in Optical Remote Sensing Images Based on Global Context Mixed Attention
Yan, Longquan
Yan, Ruixiang
Geng, Guohua
Zhou, Mingquan
Chen, Rong
JOURNAL OF THE INDIAN SOCIETY OF REMOTE SENSING, 2024, 52 (07) : 1489 - 1499
[34] Progressive Complementation Network With Semantics and Details for Salient Object Detection in Optical Remote Sensing Images
Zhao, Rundong
Zheng, Panpan
Zhang, Cui
Wang, Liejun
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 17 : 8626 - 8641
[35] Alignment Integration Network for Salient Object Detection and Its Application for Optical Remote Sensing Images
Zhang, Xiaoning
Yu, Yi
Wang, Yuqing
Chen, Xiaolin
Wang, Chenglong
SENSORS, 2023, 23 (14)
[36] Multi-Content Complementation Network for Salient Object Detection in Optical Remote Sensing Images
Li, Gongyang
Liu, Zhi
Lin, Weisi
Ling, Haibin
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[37] Multiscale Feature Adaptive Fusion for Object Detection in Optical Remote Sensing Images
Lv, Hao
Qian, Weixing
Chen, Tianxiao
Yang, Han
Zhou, Xuecheng
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[38] Global Perception Network for Salient Object Detection in Remote Sensing Images
Liu, Yu
Zhang, Shanwen
Wang, Zhen
Zhao, Baoping
Zou, Lincheng
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[39] Learning to Adapt Using Test-Time Images for Salient Object Detection in Optical Remote Sensing Images
Huang, Kan
Fang, Leyuan
Tian, Chunwei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
[40] GINet:Graph interactive network with semantic-guided spatial refinement for salient object detection in optical remote sensing images
Zhu, Chenwei
Zhou, Xiaofei
Bao, Liuxin
Wang, Hongkui
Wang, Shuai
Zhu, Zunjie
Yan, Chenggang
Zhang, Jiyong
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 104

← 1 2 3 4 5 →