More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification

被引:981
|
作者
Hong, Danfeng [1 ]
Gao, Lianru [2 ]
Yokoya, Naoto [3 ,4 ]
Yao, Jing [5 ]
Chanussot, Jocelyn [6 ,7 ]
Du, Qian [8 ]
Zhang, Bing [2 ,9 ]
机构
[1] Univ Grenoble Alpes, CNRS, Grenoble INP, GIPSA Lab, F-38000 Grenoble, France
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Univ Tokyo, Grad Sch Frontier Sci, Chiba 2778561, Japan
[4] RIKEN, RIKEN Ctr Adv Intelligence Project AIP, Geoinformat Unit, Tokyo 1030027, Japan
[5] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[6] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[7] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[8] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
[9] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
来源
基金
中国国家自然科学基金; 日本学术振兴会;
关键词
Classification; convolutional neural networks (CNNS); cross modality; deep learning (DL); feature learning; fusion; hyperspectral; light detection and ranging (LiDAR); multimodal; multispectral; network architecture; remote sensing (RS); synthetic aperture radar (SAR); CONVOLUTIONAL NEURAL-NETWORK; LAND-COVER; DATA FUSION; LIDAR DATA; MANIFOLD ALIGNMENT; FRAMEWORK;
D O I
10.1109/TGRS.2020.3016820
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Classification and identification of the materials lying over or beneath the earth's surface have long been a fundamental but challenging research topic in geoscience and remote sensing (RS), and have garnered a growing concern owing to the recent advancements of deep learning techniques. Although deep networks have been successfully applied in single-modality-dominated classification tasks, yet their performance inevitably meets the bottleneck in complex scenes that need to be finely classified, due to the limitation of information diversity. In this work, we provide a baseline solution to the aforementioned difficulty by developing a general multimodal deep learning (MDL) framework. In particular, we also investigate a special case of multi-modality learning (MML)-cross-modality learning (CML) that exists widely in RS image classification applications. By focusing on "what," "where," and "how" to fuse, we show different fusion strategies as well as how to train deep networks and build the network architecture. Specifically, five fusion architectures are introduced and developed, further being unified in our MDL framework. More significantly, our framework is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with convolutional neural networks (CNNs). To validate the effectiveness and superiority of the MDL framework, extensive experiments related to the settings of MML and CML are conducted on two different multimodal RS data sets. Furthermore, the codes and data sets will be available at https://github.com/danfenghong/IEEE_TGRS_MDLRS, contributing to the RS community.
引用
收藏
页码:4340 / 4354
页数:15
相关论文
共 50 条
  • [1] A UNIFIED MULTIMODAL DEEP LEARNING FRAMEWORK FOR REMOTE SENSING IMAGERY CLASSIFICATION
    Hong, Danfeng
    Gao, Lianru
    Wu, Xin
    Yao, Jing
    Yokoya, Naoto
    Zhang, Bing
    2021 11TH WORKSHOP ON HYPERSPECTRAL IMAGING AND SIGNAL PROCESSING: EVOLUTION IN REMOTE SENSING (WHISPERS), 2021,
  • [2] Deep Feature Reconstruction Learning for Open-Set Classification of Remote-Sensing Imagery
    Sun, Hao
    Li, Qianqian
    Yu, Jie
    Zhou, Dongbo
    Chen, Wenjing
    Zheng, Xiangtao
    Lu, Xiaoqiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20
  • [3] SPATIAL AND SPECTRAL CLASSIFICATION OF REMOTE-SENSING IMAGERY
    FRANKLIN, SE
    WILSON, BA
    COMPUTERS & GEOSCIENCES, 1991, 17 (08) : 1151 - 1172
  • [4] Land-Cover Classification Using Deep Learning with High-Resolution Remote-Sensing Imagery
    Fayaz, Muhammad
    Nam, Junyoung
    Dang, L. Minh
    Song, Hyoung-Kyu
    Moon, Hyeonjoon
    APPLIED SCIENCES-BASEL, 2024, 14 (05):
  • [5] Transfer Representation Learning Meets Multimodal Fusion Classification for Remote Sensing Images
    Ma, Mengru
    Ma, Wenping
    Jiao, Licheng
    Liu, Xu
    Liu, Fang
    Li, Lingling
    Yang, Shuyuan
    Hou, Biao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Online Bayesian Learning for Remote-Sensing Imagery Compression
    Zhang, Zizhuo
    Li, Shaoyang
    Tao, Xiaoming
    Dong, Linhao
    Lu, Jianhua
    2017 IEEE 85TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2017,
  • [7] Cross-Domain Transfer Learning for Natural Scene Classification of Remote-Sensing Imagery
    Akhtar, Muhammad
    Murtza, Iqbal
    Adnan, Muhammad
    Saadia, Ayesha
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [8] Deep-learning-based information mining from ocean remote-sensing imagery
    Xiaofeng Li
    Bin Liu
    Gang Zheng
    Yibin Ren
    Shuangshang Zhang
    Yingjie Liu
    Le Gao
    Yuhai Liu
    Bin Zhang
    Fan Wang
    NationalScienceReview, 2020, 7 (10) : 1584 - 1605
  • [9] A Deep Transfer Learning Framework Using Teacher-Student Structure for Land Cover Classification of Remote-Sensing Imagery
    Zhang, Xiaodong
    Li, Xianwei
    Chen, Guanzhou
    Liao, Puyun
    Wang, Tong
    Yang, Haobo
    He, Chanjuan
    Zhou, Wenlin
    Sun, Yufeng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [10] Deep-learning-based information mining from ocean remote-sensing imagery
    Li X.
    Liu B.
    Zheng G.
    Ren Y.
    Zhang S.
    Liu Y.
    Gao L.
    Liu Y.
    Zhang B.
    Wang F.
    National Science Review, 2021, 7 (10) : 1584 - 1605