Multi-Stage Fusion and Multi-Source Attention Network for Multi-Modal Remote Sensing Image Segmentation

被引：17

作者：

Zhao, Jiaqi ^{[1
]}

Zhou, Yong ^{[1
]}

Shi, Boyu ^{[1
]}

Yang, Jingsong ^{[1
]}

Zhang, Di ^{[1
]}

Yao, Rui ^{[1
]}

机构：

[1] China Univ Min & Technol, Engn Res Ctr Mine Digitizat, Sch Comp Sci & Technol, Minist Educ Peoples Republ China, 1 Daxue Rd, Xuzhou, Jiangsu, Peoples R China

来源：

ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY | 2021年 / 12卷 / 06期

关键词：

Semantic segmentation; multi-modal remote sensing images; attention; feature fusion;

D O I：

10.1145/3484440

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

With the rapid development of sensor technology, lots of remote sensing data have been collected. It effectively obtains good semantic segmentation performance by extracting feature maps based on multi-modal remote sensing images since extra modal data provides more information. How to make full use of multi-model remote sensing data for semantic segmentation is challenging. Toward this end, we propose a new network called Multi-Stage Fusion and Multi-Source Attention Network ((MS)(2)-Net) for multi-modal remote sensing data segmentation. The multi-stage fusion module fuses complementary information after calibrating the deviation information by filtering the noise from the multi-modal data. Besides, similar feature points are aggregated by the proposed multi-source attention for enhancing the discriminability of features with different modalities. The proposed model is evaluated on publicly available multi-modal remote sensing data sets, and results demonstrate the effectiveness of the proposed method.

引用

页数：20

共 45 条

[1] Abdulrahim Khairi, 2019, NEURAL COMPUT APPL, V2019, P1
[2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[3] The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks
Berman, Maxim
Triki, Amal Rannen
Blaschko, Matthew B.
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4413 - 4421
[4] Semantic Stereo for Incidental Satellite Images
Bosch, Marc
Foster, Kevin
Christie, Gordon
Wang, Sean
Hager, Gregory D.
Brown, Myron
[J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1524 - 1532
[5] DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving
Chen, Chenyi
Seff, Ari
Kornhauser, Alain
Xiao, Jianxiong
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2722 - 2730
[6] Chen L.C., 2014, arXiv, V4, P357
[7] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Chen, Liang-Chieh
Zhu, Yukun
Papandreou, George
Schroff, Florian
Adam, Hartwig
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
[8] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[9] Locality-Sensitive Deconvolution Networks with Gated Fusion for RGB-D Indoor Semantic Segmentation
Cheng, Yanhua
Cai, Rui
Li, Zhiwei
Zhao, Xin
Huang, Kaiqi
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1475 - 1483
[10] Big Data for Remote Sensing: Challenges and Opportunities
Chi, Mingmin
Plaza, Antonio
Benediktsson, Jon Atli
Sun, Zhongyi
Shen, Jinsheng
Zhu, Yangyong
[J]. PROCEEDINGS OF THE IEEE, 2016, 104 (11) : 2207 - 2219

← 1 2 3 4 5 →