More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification

被引:981
|
作者
Hong, Danfeng [1 ]
Gao, Lianru [2 ]
Yokoya, Naoto [3 ,4 ]
Yao, Jing [5 ]
Chanussot, Jocelyn [6 ,7 ]
Du, Qian [8 ]
Zhang, Bing [2 ,9 ]
机构
[1] Univ Grenoble Alpes, CNRS, Grenoble INP, GIPSA Lab, F-38000 Grenoble, France
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Univ Tokyo, Grad Sch Frontier Sci, Chiba 2778561, Japan
[4] RIKEN, RIKEN Ctr Adv Intelligence Project AIP, Geoinformat Unit, Tokyo 1030027, Japan
[5] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[6] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[7] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[8] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
[9] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
来源
基金
中国国家自然科学基金; 日本学术振兴会;
关键词
Classification; convolutional neural networks (CNNS); cross modality; deep learning (DL); feature learning; fusion; hyperspectral; light detection and ranging (LiDAR); multimodal; multispectral; network architecture; remote sensing (RS); synthetic aperture radar (SAR); CONVOLUTIONAL NEURAL-NETWORK; LAND-COVER; DATA FUSION; LIDAR DATA; MANIFOLD ALIGNMENT; FRAMEWORK;
D O I
10.1109/TGRS.2020.3016820
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Classification and identification of the materials lying over or beneath the earth's surface have long been a fundamental but challenging research topic in geoscience and remote sensing (RS), and have garnered a growing concern owing to the recent advancements of deep learning techniques. Although deep networks have been successfully applied in single-modality-dominated classification tasks, yet their performance inevitably meets the bottleneck in complex scenes that need to be finely classified, due to the limitation of information diversity. In this work, we provide a baseline solution to the aforementioned difficulty by developing a general multimodal deep learning (MDL) framework. In particular, we also investigate a special case of multi-modality learning (MML)-cross-modality learning (CML) that exists widely in RS image classification applications. By focusing on "what," "where," and "how" to fuse, we show different fusion strategies as well as how to train deep networks and build the network architecture. Specifically, five fusion architectures are introduced and developed, further being unified in our MDL framework. More significantly, our framework is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with convolutional neural networks (CNNs). To validate the effectiveness and superiority of the MDL framework, extensive experiments related to the settings of MML and CML are conducted on two different multimodal RS data sets. Furthermore, the codes and data sets will be available at https://github.com/danfenghong/IEEE_TGRS_MDLRS, contributing to the RS community.
引用
收藏
页码:4340 / 4354
页数:15
相关论文
共 50 条
  • [41] Learning with transductive SVM for semisupervised pixel classification of remote sensing imagery
    Maulik, Ujjwal
    Chakraborty, Debasis
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2013, 77 : 66 - 78
  • [42] Gradient Decoupled Learning With Unimodal Regularization for Multimodal Remote Sensing Classification
    Wei, Shicai
    Luo, Chunbo
    Ma, Xiaoguang
    Luo, Yang
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [43] A review of deep learning methods for semantic segmentation of remote sensing imagery
    Yuan, Xiaohui
    Shi, Jianfang
    Gu, Lichuan
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 169
  • [44] A Deep Learning Method for Ocean Front Extraction in Remote Sensing Imagery
    Li, Yangdong
    Liang, Junhao
    Da, Hengrong
    Chang, Liang
    Li, Hongli
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [45] BUILDING EXTRACTION IN VHR REMOTE SENSING IMAGERY THROUGH DEEP LEARNING
    Atik, Saziye Ozge
    Ipbuker, Cengizhan
    FRESENIUS ENVIRONMENTAL BULLETIN, 2022, 31 (8A): : 8468 - 8473
  • [46] Multimodal crop cover identification using deep learning and remote sensing
    Zeeshan Ramzan
    H. M. Shahzad Asif
    Muhammad Shahbaz
    Multimedia Tools and Applications, 2024, 83 : 33141 - 33159
  • [47] Multimodal crop cover identification using deep learning and remote sensing
    Ramzan, Zeeshan
    Asif, H. M. Shahzad
    Shahbaz, Muhammad
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 33141 - 33159
  • [48] Deep learning in multimodal remote sensing data fusion: A comprehensive review
    Li, Jiaxin
    Hong, Danfeng
    Gao, Lianru
    Yao, Jing
    Zheng, Ke
    Zhang, Bing
    Chanussot, Jocelyn
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2022, 112
  • [49] Contextual Information-Preserved Architecture Learning for Remote-Sensing Scene Classification
    Chen, Jie
    Huang, Haozhe
    Peng, Jian
    Zhu, Jiawei
    Chen, Li
    Tao, Chao
    Li, Haifeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [50] Convolutional Neural Network for Remote-Sensing Scene Classification: Transfer Learning Analysis
    de Lima, Rafael Pires
    Marfurt, Kurt
    REMOTE SENSING, 2020, 12 (01)