Distilling Knowledge From Super-Resolution for Efficient Remote Sensing Salient Object Detection

被引:47
作者
Liu, Yanfeng [1 ,2 ]
Xiong, Zhitong [3 ]
Yuan, Yuan [2 ]
Wang, Qi [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Peoples R China
[2] Northwestern Polytech Univ, Sch Artificial Intelligence Opt & Elect iOPEN, Xian 710072, Peoples R China
[3] Tech Univ Munich TUM, Chair Data Sci Earth Observat, D-80333 Munich, Germany
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2023年 / 61卷
基金
中国国家自然科学基金;
关键词
Task analysis; Superresolution; Remote sensing; Optical sensors; Optical imaging; Decoding; Convolution; Auxiliary super-resolution (SR); cross-task knowledge transfer; multitask learning (MTL); optical remote sensing image (RSI); salient object detection (SOD); NETWORK;
D O I
10.1109/TGRS.2023.3267271
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Current state-of-the-art remote sensing salient object detectors always require high-resolution spatial context to ensure excellent performance, which incurs enormous computation costs and hinders real-time efficiency. In this work, we propose a universal super-resolution-assisted learning (SRAL) framework to boost performance and accelerate the inference efficiency of existing approaches. To this end, we propose to reduce the spatial resolution of the input remote sensing images (RSIs), which is model-agnostic and can be applied to existing algorithms without extra computation cost. Specifically, a transposed saliency detection decoder (TSDD) is designed to upsample interim features progressively. On top of it, an auxiliary SR decoder (ASRD) is proposed to build a multitask learning (MTL) framework to investigate an efficient complementary paradigm of saliency detection and SR. Furthermore, a novel task-fusion guidance module (TFGM) is proposed to effectively distill domain knowledge from the SR auxiliary task to the salient object detection task in optical RSIs. The presented ASRD and TFGM can be omitted in the inference phase without any extra computational budget. Extensive experiments on three datasets show that the presented SRAL with 224 x 224 input is superior to more than 20 algorithms. Moreover, it can be successfully generalized to existing typical networks with significant accuracy improvements in a parameter-free manner. Codes and models are available at https://github.com/lyf0801/SRAL.
引用
收藏
页数:16
相关论文
共 74 条
[31]  
Liu M., 2022, IEEE T GEOSCI ELECT, V60, P1
[32]   End-to-End Multi-Task Learning with Attention [J].
Liu, Shikun ;
Johns, Edward ;
Davison, Andrew J. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1871-1880
[33]   SINGLE-SHOT BALANCED DETECTOR FOR GEOSPATIAL OBJECT DETECTION [J].
Liu, Yanfeng ;
Li, Qiang ;
Yuan, Yuan ;
Wang, Qi .
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, :2529-2533
[34]   ABNet: Adaptive Balanced Network for Multiscale Object Detection in Remote Sensing Imagery [J].
Liu, Yanfeng ;
Li, Qiang ;
Yuan, Yuan ;
Du, Qian ;
Wang, Qi .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[35]   Dual-Frequency Autoencoder for Anomaly Detection in Transformed Hyperspectral Imagery [J].
Liu, Yidan ;
Xie, Weiying ;
Li, Yunsong ;
Li, Zan ;
Du, Qian .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[36]   SAMNet: Stereoscopically Attentive Multi-Scale Network for Lightweight Salient Object Detection [J].
Liu, Yun ;
Zhang, Xin-Yu ;
Bian, Jia-Wang ;
Zhang, Le ;
Cheng, Ming-Ming .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 :3804-3814
[37]   Lightweight Salient Object Detection via Hierarchical Visual Perception Learning [J].
Liu, Yun ;
Gu, Yu-Chao ;
Zhang, Xin-Yu ;
Wang, Weiwei ;
Cheng, Ming-Ming .
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (09) :4439-4449
[38]   Instance-Aware Dynamic Neural Network Quantization [J].
Liu, Zhenhua ;
Wang, Yunhe ;
Han, Kai ;
Ma, Siwei ;
Gao, Wen .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :12424-12433
[39]   Non-Local Deep Features for Salient Object Detection [J].
Luo, Zhiming ;
Mishra, Akshaya ;
Achkar, Andrew ;
Eichel, Justin ;
Li, Shaozi ;
Jodoin, Pierre-Marc .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6593-6601
[40]   Deep Face Super-Resolution with Iterative Collaboration between Attentive Recovery and Landmark Estimation [J].
Ma, Cheng ;
Jiang, Zhenyu ;
Rao, Yongming ;
Lu, Jiwen ;
Zhou, Jie .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :5568-5577