CRFNet: A Deep Convolutional Network to Learn the Potentials of a CRF for the Semantic Segmentation of Remote Sensing Images

被引:4
作者
Pastorino, Martina [1 ,2 ]
Moser, Gabriele [1 ]
Serpico, Sebastiano B. [1 ]
Zerubia, Josiane [2 ]
机构
[1] Univ Genoa, Dept Elect Elect Telecommun Engn & Naval Architect, I-16145 Genoa, Italy
[2] Univ Cote Azur, Inst Natl Rech Informat & Automat INRIA, F-06902 Sophia Antipolis, France
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷
关键词
Convolutional neural networks; Remote sensing; Semantic segmentation; Semantics; Task analysis; Computer architecture; Conditional random fields; Conditional random fields (CRFs); convolutional neural network (CNN); fully convolutional network (FCN); remote sensing; semantic segmentation; HIGH-RESOLUTION LIDAR; FUSION CONTEST-PART; DOMAIN ADAPTATION; NEURAL-NETWORKS; RGB; CLASSIFICATION; RESTORATION;
D O I
10.1109/TGRS.2024.3452631
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
This article presents a method for the automatic learning of the potentials of a stochastic model, in particular a conditional random field (CRF), in a non-parametric fashion. The proposed model is based on a neural architecture, in order to leverage the modeling capabilities of deep learning (DL) approaches to directly learn semantic and spatial information from the input data. Specifically, the methodology is based on fully convolutional networks and fully connected neural networks. The idea is to access the multiscale information intrinsically extracted in the intermediate layers of a fully convolutional network through the integration of fully connected neural networks at different scales, while favoring the interpretability of the hidden layers as posterior probabilities. The potentials of the CRF are learned through an additional convolutional layer, whose kernel models the local spatial information considered. The loss function is computed as a linear combination of cross-entropy losses, accounting for the multiscale and the spatial information. To evaluate the capabilities of the proposed approach for the semantic segmentation of remote sensing images, the experimental validation was conducted with the ISPRS 2-D semantic labeling challenge Vaihingen and Potsdam datasets and with the IEEE GRSS data fusion contest Zeebruges dataset. As the ground truths of these benchmark datasets are spatially exhaustive, they have been modified to approximate the spatially sparse ground truths common in real remote sensing applications. The results are significant, as the proposed approach obtains higher average classification accuracies than recent state-of-the-art techniques considered in this article. The code is available at https://github.com/Ayana-Inria/CRFNet-RS.
引用
收藏
页数:19
相关论文
共 80 条
[1]  
Abouqora Y, 2020, COLLOQ INF SCI TECH, P296, DOI [10.1109/CIST49399.2021.9357275, 10.1109/CiSt49399.2021.9357275]
[2]   Conditional Random Field and Deep Feature Learning for Hyperspectral Image Classification [J].
Alam, Fahim Irfan ;
Zhou, Jun ;
Liew, Alan Wee-Chung ;
Jia, Xiuping ;
Chanussot, Jocelyn ;
Gao, Yongsheng .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (03) :1612-1628
[3]   CRF LEARNING WITH CNN FEATURES FOR HYPERSPECTRAL IMAGE SEGMENTATION [J].
Alam, Fahim Irfan ;
Zhou, Jun ;
Liew, Alan Wee-Chung ;
Jia, Xiuping .
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, :6890-6893
[4]  
[Anonymous], 2009, Markov Random Field Modeling in Image Analysis
[5]  
[Anonymous], 2010, JMLR WORKSHOP C P
[6]   Conditional Random Fields Meet Deep Neural Networks for Semantic Segmentation Combining probabilistic graphical models with deep learning for structured prediction [J].
Arnab, Anurag ;
Zheng, Shuai ;
Jayasumana, Sadeep ;
Romera-Paredes, Bernardino ;
Larsson, Mans ;
Kirillov, Alexander ;
Savchynskyy, Bogdan ;
Rother, Carsten ;
Kahl, Fredrik ;
Torr, Philip H. S. .
IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) :37-52
[7]   Higher Order Conditional Random Fields in Deep Neural Networks [J].
Arnab, Anurag ;
Jayasumana, Sadeep ;
Zheng, Shuai ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :524-540
[8]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[9]   Unsupervised Domain Adaptation Using Generative Adversarial Networks for Semantic Segmentation of Aerial Images [J].
Benjdira, Bilel ;
Bazi, Yakoub ;
Koubaa, Anis ;
Ouni, Kais .
REMOTE SENSING, 2019, 11 (11)
[10]  
Blake A, 2011, MARKOV RANDOM FIELDS FOR VISION AND IMAGE PROCESSING, P1