Multitask Collaborative Multi-modal Remote Sensing Target Segmentation Algorithm

被引：0

作者：

Mao, Xiuhua ^{[1
]}

Zhang, Qiang ^{[1
]}

Ruan, Hang ^{[1
]}

Yang, Yuang ^{[1
]}

机构：

[1] (Beijing Institute of Tracking and Telecommunications Technology, Beijing 100094, China) (National Key Laboratory of Space Integrated Information System

来源：

Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology | 2024年 / 46卷 / 08期

关键词：

Deep learning; Elevation estimation; Multi-modal data; Remote sensing images; Semantic segmentation;

D O I：

10.11999/JEIT231267

中图分类号：

学科分类号：

摘要：

The use of semantic segmentation technology to extract high-resolution remote sensing image object segmentation has important application prospects. With the rapid development of multi-sensor technology, the good complementary advantages between multimodal remote sensing images have received widespread attention, and joint analysis of them has become a research hotspot. This article analyzes both optical remote sensing images and elevation data, and proposes a multi-task collaborative model based on multimodal remote sensing data (United Refined PSPNet, UR-PSPNet) to address the issue of insufficient fusion classification accuracy of the two types of data due to insufficient fully registered elevation data in real scenarios. This model extracts deep features of optical images, predicts semantic labels and elevation values, and embeds elevation data as supervised information, to improve the accuracy of target segmentation. This article designs a comparative experiment based on ISPRS, which proves that this algorithm can better fuse multimodal data features and improve the accuracy of object segmentation in optical remote sensing images. © 2024 Science Press. All rights reserved.

引用

页码：3363 / 3371

页数：8

共 15 条

[1] LI Shutao, LI Congyu, KANG Xudong, Development status and future prospects of multi-source remote sensing image fusion[J], National Remote Sensing Bulletin, 25, 1, pp. 148-166, (2021)
[2] QIN Rongjun, FANG Wei, A hierarchical building detection method for very high resolution remotely sensed images combined with DSM using graph cut optimization[J], Photogrammetric Engineering & Remote Sensing, 80, 9, pp. 873-883, (2014)
[3] LONG J, SHELHAMER E, DARRELL T., Fully convolutional networks for semantic segmentation[C], 2015 IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431-3440, (2015)
[4] ZHAO Hengshuang, SHI Jianping, QI Xiaojuan, Et al., Pyramid scene parsing network[C], 2017 IEEE Conference on Computer Vision and Pattern Recognition, pp. 6230-6239, (2017)
[5] Lichao MOU, ZHU Xiaoxiang, IM2HEIGHT: Height estimation from single monocular imagery via fully residual convolutional-deconvolutional network[J], (2018)
[6] GHAMISI P, YOKOYA N., IMG2DSM: Height simulation from single imagery using conditional generative adversarial net[J], IEEE Geoscience and Remote Sensing Letters, 15, 5, pp. 794-798, (2018)
[7] YUAN Min, REN Dingbang, FENG Qisheng, Et al., MCAFNet: A multiscale channel attention fusion network for semantic segmentation of remote sensing images, Remote Sensing, 15, 2, (2023)
[8] WENG Liguo, PANG Kai, XIA Min, Et al., Sgformer: A local and global features coupling network for semantic segmentation of land cover[J], IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 16, pp. 6812-6824, (2023)
[9] HAO Xuejie, YIN Lizeyan, LI Xiuhong, Et al., A multiobjective semantic segmentation algorithm based on improved U-Net networks, Remote Sensing, 15, 7, (2023)
[10] Ning LV, ZHANG Zenghui, LI Cong, Et al., A hybrid-attention semantic segmentation network for remote sensing interpretation in land-use surveillance[J], International Journal of Machine Learning and Cybernetics, 14, 2, pp. 395-406, (2023)

← 1 2 →