Multi-Stage Fusion and Multi-Source Attention Network for Multi-Modal Remote Sensing Image Segmentation

被引:17
作者
Zhao, Jiaqi [1 ]
Zhou, Yong [1 ]
Shi, Boyu [1 ]
Yang, Jingsong [1 ]
Zhang, Di [1 ]
Yao, Rui [1 ]
机构
[1] China Univ Min & Technol, Engn Res Ctr Mine Digitizat, Sch Comp Sci & Technol, Minist Educ Peoples Republ China, 1 Daxue Rd, Xuzhou, Jiangsu, Peoples R China
关键词
Semantic segmentation; multi-modal remote sensing images; attention; feature fusion;
D O I
10.1145/3484440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the rapid development of sensor technology, lots of remote sensing data have been collected. It effectively obtains good semantic segmentation performance by extracting feature maps based on multi-modal remote sensing images since extra modal data provides more information. How to make full use of multi-model remote sensing data for semantic segmentation is challenging. Toward this end, we propose a new network called Multi-Stage Fusion and Multi-Source Attention Network ((MS)(2)-Net) for multi-modal remote sensing data segmentation. The multi-stage fusion module fuses complementary information after calibrating the deviation information by filtering the noise from the multi-modal data. Besides, similar feature points are aggregated by the proposed multi-source attention for enhancing the discriminability of features with different modalities. The proposed model is evaluated on publicly available multi-modal remote sensing data sets, and results demonstrate the effectiveness of the proposed method.
引用
收藏
页数:20
相关论文
共 45 条
  • [1] Abdulrahim Khairi, 2019, NEURAL COMPUT APPL, V2019, P1
  • [2] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [3] The Lovasz-Softmax loss: A tractable surrogate for the optimization of the intersection-over-union measure in neural networks
    Berman, Maxim
    Triki, Amal Rannen
    Blaschko, Matthew B.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4413 - 4421
  • [4] Semantic Stereo for Incidental Satellite Images
    Bosch, Marc
    Foster, Kevin
    Christie, Gordon
    Wang, Sean
    Hager, Gregory D.
    Brown, Myron
    [J]. 2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1524 - 1532
  • [5] DeepDriving: Learning Affordance for Direct Perception in Autonomous Driving
    Chen, Chenyi
    Seff, Ari
    Kornhauser, Alain
    Xiao, Jianxiong
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2722 - 2730
  • [6] Chen L.C., 2014, arXiv, V4, P357
  • [7] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
    Chen, Liang-Chieh
    Zhu, Yukun
    Papandreou, George
    Schroff, Florian
    Adam, Hartwig
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 833 - 851
  • [8] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
  • [9] Locality-Sensitive Deconvolution Networks with Gated Fusion for RGB-D Indoor Semantic Segmentation
    Cheng, Yanhua
    Cai, Rui
    Li, Zhiwei
    Zhao, Xin
    Huang, Kaiqi
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1475 - 1483
  • [10] Big Data for Remote Sensing: Challenges and Opportunities
    Chi, Mingmin
    Plaza, Antonio
    Benediktsson, Jon Atli
    Sun, Zhongyi
    Shen, Jinsheng
    Zhu, Yangyong
    [J]. PROCEEDINGS OF THE IEEE, 2016, 104 (11) : 2207 - 2219