More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification

被引:981
|
作者
Hong, Danfeng [1 ]
Gao, Lianru [2 ]
Yokoya, Naoto [3 ,4 ]
Yao, Jing [5 ]
Chanussot, Jocelyn [6 ,7 ]
Du, Qian [8 ]
Zhang, Bing [2 ,9 ]
机构
[1] Univ Grenoble Alpes, CNRS, Grenoble INP, GIPSA Lab, F-38000 Grenoble, France
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Univ Tokyo, Grad Sch Frontier Sci, Chiba 2778561, Japan
[4] RIKEN, RIKEN Ctr Adv Intelligence Project AIP, Geoinformat Unit, Tokyo 1030027, Japan
[5] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[6] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[7] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[8] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
[9] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
来源
基金
中国国家自然科学基金; 日本学术振兴会;
关键词
Classification; convolutional neural networks (CNNS); cross modality; deep learning (DL); feature learning; fusion; hyperspectral; light detection and ranging (LiDAR); multimodal; multispectral; network architecture; remote sensing (RS); synthetic aperture radar (SAR); CONVOLUTIONAL NEURAL-NETWORK; LAND-COVER; DATA FUSION; LIDAR DATA; MANIFOLD ALIGNMENT; FRAMEWORK;
D O I
10.1109/TGRS.2020.3016820
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Classification and identification of the materials lying over or beneath the earth's surface have long been a fundamental but challenging research topic in geoscience and remote sensing (RS), and have garnered a growing concern owing to the recent advancements of deep learning techniques. Although deep networks have been successfully applied in single-modality-dominated classification tasks, yet their performance inevitably meets the bottleneck in complex scenes that need to be finely classified, due to the limitation of information diversity. In this work, we provide a baseline solution to the aforementioned difficulty by developing a general multimodal deep learning (MDL) framework. In particular, we also investigate a special case of multi-modality learning (MML)-cross-modality learning (CML) that exists widely in RS image classification applications. By focusing on "what," "where," and "how" to fuse, we show different fusion strategies as well as how to train deep networks and build the network architecture. Specifically, five fusion architectures are introduced and developed, further being unified in our MDL framework. More significantly, our framework is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with convolutional neural networks (CNNs). To validate the effectiveness and superiority of the MDL framework, extensive experiments related to the settings of MML and CML are conducted on two different multimodal RS data sets. Furthermore, the codes and data sets will be available at https://github.com/danfenghong/IEEE_TGRS_MDLRS, contributing to the RS community.
引用
收藏
页码:4340 / 4354
页数:15
相关论文
共 50 条
  • [31] Classification of hyperspectral remote-sensing images based on sparse manifold learning
    Huang, Hong
    Huang, H. (hhuang.cqu@gmail.com), 1600, SPIE (07):
  • [32] Spatial contextual Gaussian process learning for remote-sensing image classification
    Hassouna, Houda
    Melgani, Farid
    Mokhtari, Zouhir
    REMOTE SENSING LETTERS, 2015, 6 (07) : 519 - 528
  • [33] Classification of hyperspectral remote-sensing images based on sparse manifold learning
    Huang, Hong
    JOURNAL OF APPLIED REMOTE SENSING, 2013, 7
  • [34] Joint Distance Transfer Metric Learning for Remote-Sensing Image Classification
    Dong, Yanni
    Liang, Tianyang
    Yang, Cong
    Luo, Hui
    Zhang, Yuxiang
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
  • [35] Deep Symmetric Fusion Transformer for Multimodal Remote Sensing Data Classification
    Chang, Honghao
    Bi, Haixia
    Li, Fan
    Xu, Chen
    Chanussot, Jocelyn
    Hong, Danfeng
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62
  • [36] Deep learning for remote sensing image classification: A survey
    Li, Ying
    Zhang, Haokui
    Xue, Xizhe
    Jiang, Yenan
    Shen, Qiang
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2018, 8 (06)
  • [37] Multimodal Ground-Based Remote Sensing Cloud Classification via Learning Heterogeneous Deep Features
    Liu, Shuang
    Duan, Linlin
    Zhang, Zhong
    Cao, Xiaozhong
    Durrani, Tariq S.
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (11): : 7790 - 7800
  • [38] More Diverse Training, Better Compositionality! Evidence from Multimodal Language Learning
    Volquardsen, Caspar
    Lee, Jae Hee
    Weber, Cornelius
    Wermter, Stefan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT III, 2022, 13531 : 417 - 428
  • [39] Machine Learning-Based Lithological Mapping from ASTER Remote-Sensing Imagery
    Bahrami, Hazhir
    Esmaeili, Pouya
    Homayouni, Saeid
    Pour, Amin Beiranvand
    Chokmani, Karem
    Bahroudi, Abbas
    MINERALS, 2024, 14 (02)
  • [40] A neural architecture for the classification of Remote Sensing imagery with advanced learning algorithms
    Goncalves, ML
    Netto, MLD
    Zullo, J
    NEURAL NETWORKS FOR SIGNAL PROCESSING VIII, 1998, : 577 - 586