Urban Land Cover Classification With Missing Data Modalities Using Deep Convolutional Neural Networks

被引:43
作者
Kampffmeyer, Michael [1 ]
Salberg, Arnt-Borre [2 ]
Jenssen, Robert [1 ,2 ]
机构
[1] UiT Arctic Univ Norway, Machine Learning Grp, N-9019 Tromso, Norway
[2] Norwegian Comp Ctr, N-0373 Oslo, Norway
关键词
Convolutional neural networks (CNN); deep learning; land cover classification; missing data modalities; remote sensing; DATA FUSION; IMAGES;
D O I
10.1109/JSTARS.2018.2834961
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Automatic urban land cover classification is a fundamental problem in remote sensing, e.g., for environmental monitoring. The problem is highly challenging, as classes generally have high interclass and low intraclass variances. Techniques to improve urban land cover classification performance in remote sensing include fusion of data from different sensors with different data modalities. However, such techniques require all modalities to be available to the classifier in the decision-making process, i.e., at test time, as well as in training. If a data modality is missing at test time, current state-of-the-art approaches have in general no procedure available for exploiting information from these modalities. This represents a waste of potentially useful information. We propose as a remedy a convolutional neural network (CNN) architecture for urban land cover classification which is able to embed all available training modalities in the so-called hallucination network. The network will in effect replace missing data modalities in the test phase, enabling fusion capabilities even when data modalities are missing in testing. We demonstrate the method using two datasets consisting of optical and digital surface model (DSM) images. We simulate missing modalities by assuming that DSM images are missing during testing. Our method outperforms both standard CNNs trained only on optical images as well as an ensemble of two standard CNNs. We further evaluate the potential of our method to handle situations where only some DSM images are missing during testing. Overall, we show that we can clearly exploit training time information of the missing modality during testing.
引用
收藏
页码:1758 / 1768
页数:11
相关论文
共 36 条
[1]   Land Cover Classification with Multi-Sensor Fusion of Partly Missing Data [J].
Aksoy, Selim ;
Koperski, Krzysztof ;
Tusk, Carsten ;
Marchisio, Giovanni .
PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2009, 75 (05) :577-593
[2]  
[Anonymous], 2015, ARXIV PREPRINT ARXIV
[3]  
[Anonymous], ISPRS 2D SEMANTIC LA
[4]  
[Anonymous], 2012, ARXIV E PRINTS
[5]  
[Anonymous], 2015, PROC CVPR IEEE
[6]  
[Anonymous], 2017, 2017 JOINT URBAN REM, DOI DOI 10.1109/JURSE.2017.7924566
[7]  
[Anonymous], 2017, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2016.2644615
[8]  
[Anonymous], P IEEE C COMP VIS PA
[9]   Segment-before-Detect: Vehicle Detection and Classification through Semantic Segmentation of Aerial Images [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
REMOTE SENSING, 2017, 9 (04)
[10]   The Time Variable in Data Fusion: A Change Detection Perspective [J].
Bovolo, Francesca ;
Bruzzone, Lorenzo .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2015, 3 (03) :8-26