Knowledge-Guided Multi-Task Network for Remote Sensing Imagery

被引:0
作者
Li, Meixuan [1 ]
Wang, Guoqing [1 ]
Li, Tianyu [1 ]
Yang, Yang [1 ]
Li, Wei [2 ]
Liu, Xun [2 ]
Liu, Ying [2 ]
机构
[1] Univ Elect Sci & Technol China, Ctr Future Media, Chengdu 611731, Peoples R China
[2] Beijing Inst Space Mech & Elect, Dept Imaging Technol, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-task learning; remote sensing; semantic segmentation; height estimation; edge detection; SEMANTIC SEGMENTATION; HEIGHT ESTIMATION; OBJECT DETECTION;
D O I
10.3390/rs17030496
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Semantic segmentation and height estimation tasks in remote sensing imagery exhibit distinctive characteristics, including scale sensitivity, category imbalance, and insufficient fine details. Recent approaches have leveraged multi-task learning methods to jointly predict these tasks along with auxiliary tasks, such as edge detection, to improve the accuracy of fine-grained details. However, most approaches only acquire knowledge from auxiliary tasks, disregarding the inter-task knowledge guidance across all tasks. To address these challenges, we propose KMNet, a novel architecture referred to as a knowledge-guided multi-task network, which can be applied to different primary and auxiliary task combinations. KMNet employs a multi-scale methodology to extract feature information from the input image. Subsequently, the architecture incorporates the multi-scale knowledge-guided fusion (MKF) module, which is designed to generate a comprehensive knowledge bank serving as a resource for guiding the feature fusion process. The knowledge-guided fusion feature is then utilized to generate the final predictions for the primary tasks. Comprehensive experiments conducted on two publicly available remote sensing datasets, namely the Potsdam dataset and the Vaihingen dataset, demonstrate the effectiveness of the proposed method in achieving impressive performance on both semantic segmentation and height estimation tasks. Codes, pre-trained models, and more results will be publicly available.
引用
收藏
页数:19
相关论文
共 84 条
[1]   Height estimation from single aerial images using a deep convolutional encoder-decoder network [J].
Amirkolaee, Hamed Amini ;
Arefi, Hossein .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2019, 149 :50-66
[2]  
[Anonymous], 2014, Semantic Labeling Contest
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]  
Bergstra J.S., 2011, ADV NEURAL INFORM PR, DOI DOI 10.5555/2986459.2986743
[5]   Digital terrain models derived from digital surface model uniform regions in urban areas [J].
Beumier, Charles ;
Idrissa, Mahamadou .
INTERNATIONAL JOURNAL OF REMOTE SENSING, 2016, 37 (15) :3477-3493
[6]   Multitask learning [J].
Caruana, R .
MACHINE LEARNING, 1997, 28 (01) :41-75
[7]   Multitask Learning of Height and Semantics From Aerial Images [J].
Carvalho, Marcela ;
Le Saux, Bertrand ;
Trouve-Peloux, Pauline ;
Champagnat, Frederic ;
Almansa, Andres .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2020, 17 (08) :1391-1395
[8]  
Carvalho M, 2018, IEEE IMAGE PROC, P2915, DOI 10.1109/ICIP.2018.8451312
[9]   MP3: A Unified Model to Map, Perceive, Predict and Plan [J].
Casas, Sergio ;
Sadat, Abbas ;
Urtasun, Raquel .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14398-14407
[10]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851