More Diverse Means Better: Multimodal Deep Learning Meets Remote-Sensing Imagery Classification

被引:981
|
作者
Hong, Danfeng [1 ]
Gao, Lianru [2 ]
Yokoya, Naoto [3 ,4 ]
Yao, Jing [5 ]
Chanussot, Jocelyn [6 ,7 ]
Du, Qian [8 ]
Zhang, Bing [2 ,9 ]
机构
[1] Univ Grenoble Alpes, CNRS, Grenoble INP, GIPSA Lab, F-38000 Grenoble, France
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Digital Earth Sci, Beijing 100094, Peoples R China
[3] Univ Tokyo, Grad Sch Frontier Sci, Chiba 2778561, Japan
[4] RIKEN, RIKEN Ctr Adv Intelligence Project AIP, Geoinformat Unit, Tokyo 1030027, Japan
[5] Xi An Jiao Tong Univ, Sch Math & Stat, Xian 710049, Peoples R China
[6] Univ Grenoble Alpes, INRIA, CNRS, Grenoble INP,LJK, F-38000 Grenoble, France
[7] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[8] Mississippi State Univ, Dept Elect & Comp Engn, Starkville, MS 39762 USA
[9] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
来源
基金
中国国家自然科学基金; 日本学术振兴会;
关键词
Classification; convolutional neural networks (CNNS); cross modality; deep learning (DL); feature learning; fusion; hyperspectral; light detection and ranging (LiDAR); multimodal; multispectral; network architecture; remote sensing (RS); synthetic aperture radar (SAR); CONVOLUTIONAL NEURAL-NETWORK; LAND-COVER; DATA FUSION; LIDAR DATA; MANIFOLD ALIGNMENT; FRAMEWORK;
D O I
10.1109/TGRS.2020.3016820
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
Classification and identification of the materials lying over or beneath the earth's surface have long been a fundamental but challenging research topic in geoscience and remote sensing (RS), and have garnered a growing concern owing to the recent advancements of deep learning techniques. Although deep networks have been successfully applied in single-modality-dominated classification tasks, yet their performance inevitably meets the bottleneck in complex scenes that need to be finely classified, due to the limitation of information diversity. In this work, we provide a baseline solution to the aforementioned difficulty by developing a general multimodal deep learning (MDL) framework. In particular, we also investigate a special case of multi-modality learning (MML)-cross-modality learning (CML) that exists widely in RS image classification applications. By focusing on "what," "where," and "how" to fuse, we show different fusion strategies as well as how to train deep networks and build the network architecture. Specifically, five fusion architectures are introduced and developed, further being unified in our MDL framework. More significantly, our framework is not only limited to pixel-wise classification tasks but also applicable to spatial information modeling with convolutional neural networks (CNNs). To validate the effectiveness and superiority of the MDL framework, extensive experiments related to the settings of MML and CML are conducted on two different multimodal RS data sets. Furthermore, the codes and data sets will be available at https://github.com/danfenghong/IEEE_TGRS_MDLRS, contributing to the RS community.
引用
收藏
页码:4340 / 4354
页数:15
相关论文
共 50 条
  • [21] Remote Sensing Image Scene Classification Meets Deep Learning: Challenges, Methods, Benchmarks, and Opportunities
    Cheng, Gong
    Xie, Xingxing
    Han, Junwei
    Guo, Lei
    Xia, Gui-Song
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 : 3735 - 3756
  • [22] An Improved Rotation Forest for Multi-Feature Remote-Sensing Imagery Classification
    Xiu, Yingchang
    Liu, Wenbao
    Yang, Wenjing
    REMOTE SENSING, 2017, 9 (11)
  • [23] When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs
    Cheng, Gong
    Yang, Ceyuan
    Yao, Xiwen
    Guo, Lei
    Han, Junwei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05): : 2811 - 2821
  • [24] Scale-aware deep reinforcement learning for high resolution remote sensing imagery classification
    Liu, Yinhe
    Zhong, Yanfei
    Shi, Sunan
    Zhang, Liangpei
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2024, 209 : 296 - 311
  • [25] Enhancing land cover classification in remote sensing imagery using an optimal deep learning model
    Motwake, Abdelwahed
    Hashim, Aisha Hassan Abdalla
    Obayya, Marwa
    Eltahir, Majdy M.
    AIMS MATHEMATICS, 2024, 9 (01): : 140 - 159
  • [26] Extraction of Urban Water Bodies from High-Resolution Remote-Sensing Imagery Using Deep Learning
    Chen, Yang
    Fan, Rongshuang
    Yang, Xiucheng
    Wang, Jingxue
    Latif, Aamir
    WATER, 2018, 10 (05)
  • [27] Feature Fusion with Deep Supervision for Remote-Sensing Image Scene Classification
    Muhammad, Usman
    Wang, Weiqiang
    Hadid, Abdenour
    2018 IEEE 30TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2018, : 249 - 253
  • [28] A review on remote sensing imagery augmentation using deep learning
    Lalitha, V
    Latha, B.
    MATERIALS TODAY-PROCEEDINGS, 2022, 62 : 4772 - 4778
  • [29] Weakly Supervised Deep Learning for Segmentation of Remote Sensing Imagery
    Wang, Sherrie
    Chen, William
    Xie, Sang Michael
    Azzari, George
    Lobell, David B.
    REMOTE SENSING, 2020, 12 (02)
  • [30] An incremental-learning neural network for the classification of remote-sensing images
    Bruzzone, L
    Prieto, DF
    PATTERN RECOGNITION LETTERS, 1999, 20 (11-13) : 1241 - 1248