Multi-Temporal Scene Classification and Scene Change Detection With Correlation Based Fusion

被引:70
作者
Ru, Lixiang [1 ,2 ]
Du, Bo [1 ]
Wu, Chen [2 ,3 ]
机构
[1] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Sch Comp Sci, Inst Artificial Intelligence, Wuhan 430072, Peoples R China
[2] Wuhan Univ, Hubei Key Lab Multimedia & Network Commun Engn, Wuhan 430072, Peoples R China
[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Correlation; Remote sensing; Semantics; Task analysis; Training; Spatial resolution; Change detection; scene change detection; multi-temporal scene classification; canonical correlation analysis; convolutional neural network; IMAGES; SEGMENTATION;
D O I
10.1109/TIP.2020.3039328
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Classifying multi-temporal scene land-use categories and detecting their semantic scene-level changes for remote sensing imagery covering urban regions could straightly reflect the land-use transitions. Existing methods for scene change detection rarely focus on the temporal correlation of bi-temporal features, and are mainly evaluated on small scale scene change detection datasets. In this work, we proposed a CorrFusion module that fuses the highly correlated components in bi-temporal feature embeddings. We first extract the deep representations of the bi-temporal inputs with deep convolutional networks. Then the extracted features will be projected into a lower-dimensional space to extract the most correlated components and compute the instance-level correlation. The cross-temporal fusion will be performed based on the computed correlation in CorrFusion module. The final scene classification results are obtained with softmax layers. In the objective function, we introduced a new formulation to calculate the temporal correlation more efficiently and stably. The detailed derivation of backpropagation gradients for the proposed module is also given. Besides, we presented a much larger scale scene change detection dataset with more semantic categories and conducted extensive experiments on this dataset. The experimental results demonstrated that our proposed CorrFusion module could remarkably improve the multi-temporal scene classification and scene change detection results.
引用
收藏
页码:1382 / 1394
页数:13
相关论文
共 51 条
[1]  
Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/3022670.2976746, 10.1145/2951913.2976746]
[2]  
Andrew G., 2013, PMLR, V28, P1247
[3]   A Multiple-Instance Densely-Connected ConvNet for Aerial Scene Classification [J].
Bi, Qi ;
Qin, Kun ;
Li, Zhili ;
Zhang, Han ;
Xu, Kai ;
Xia, Gui-Song .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :4911-4926
[4]   Scalable and Effective Deep CCA via Soft Decorrelation [J].
Chang, Xiaobin ;
Xiang, Tao ;
Hospedales, Timothy M. .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1488-1497
[5]   Remote Sensing Image Scene Classification: Benchmark and State of the Art [J].
Cheng, Gong ;
Han, Junwei ;
Lu, Xiaoqiang .
PROCEEDINGS OF THE IEEE, 2017, 105 (10) :1865-1883
[6]  
Cogswell M., 2016, INT C LEARN REPR ICL
[7]   Unsupervised Scene Change Detection via Latent Dirichlet Allocation and Multivariate Alteration Detection [J].
Du, Bo ;
Wang, Yong ;
Wu, Chen ;
Zhang, Liangpei .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2018, 11 (12) :4676-4689
[8]   Detection of Cars in High-Resolution Aerial Images of Complex Urban Environments [J].
ElMikaty, Mohamed ;
Stathaki, Tania .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (10) :5913-5924
[9]   Stacked U-Nets for Ground Material Segmentation in Remote Sensing Imagery [J].
Ghosh, Arthita ;
Ehrlich, Max ;
Shah, Sohil ;
Davis, Larry ;
Chellappa, Rama .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :252-256
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587