Attention-Based Pyramid Network for Segmentation and Classification of High-Resolution and Hyperspectral Remote Sensing Images

被引:21
作者
Xu, Qingsong [1 ,2 ]
Yuan, Xin [3 ]
Ouyang, Chaojun [1 ,2 ,4 ]
Zeng, Yue [5 ]
机构
[1] Chinese Acad Sci, Inst Mt Hazards & Environm, Key Lab Mt Hazards & Surface Proc, Chengdu 610041, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Bell Labs, Murray Hill, NJ 07974 USA
[4] Chinese Acad Sci, CAS Ctr Excellence Tibetan Plateau Earth Sci, Beijing 100101, Peoples R China
[5] Southwest Jiao Tong Univ, Sch Econ & Management, Chengdu 610031, Peoples R China
关键词
high-resolution and hyperspectral images; spatial object distribution diversity; spectral information extraction; attention-based pyramid network; heavy-weight spatial feature fusion pyramid network (FFPNet); spatial-spectral FFPNet; SEMANTIC SEGMENTATION; NEURAL-NETWORKS; FUSION;
D O I
10.3390/rs12213501
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Unlike conventional natural (RGB) images, the inherent large scale and complex structures of remote sensing images pose major challenges such as spatial object distribution diversity and spectral information extraction when existing models are directly applied for image classification. In this study, we develop an attention-based pyramid network for segmentation and classification of remote sensing datasets. Attention mechanisms are used to develop the following modules: (i) a novel and robust attention-based multi-scale fusion method effectively fuses useful spatial or spectral information at different and same scales; (ii) a region pyramid attention mechanism using region-based attention addresses the target geometric size diversity in large-scale remote sensing images; and (iii) cross-scale attention in our adaptive atrous spatial pyramid pooling network adapts to varied contents in a feature-embedded space. Different forms of feature fusion pyramid frameworks are established by combining these attention-based modules. First, a novel segmentation framework, called the heavy-weight spatial feature fusion pyramid network (FFPNet), is proposed to address the spatial problem of high-resolution remote sensing images. Second, an end-to-end spatial-spectral FFPNet is presented for classifying hyperspectral images. Experiments conducted on ISPRS Vaihingen and ISPRS Potsdam high-resolution datasets demonstrate the competitive segmentation accuracy achieved by the proposed heavy-weight spatial FFPNet. Furthermore, experiments on the Indian Pines and the University of Pavia hyperspectral datasets indicate that the proposed spatial-spectral FFPNet outperforms the current state-of-the-art methods in hyperspectral image classification.
引用
收藏
页码:1 / 34
页数:34
相关论文
共 67 条
[1]   Feature selection and classification of hyperspectral images, with support vector machines [J].
Archibald, Rick ;
Fann, George .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2007, 4 (04) :674-677
[2]   Deep Learning for Classification of Hyperspectral Data [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
IEEE GEOSCIENCE AND REMOTE SENSING MAGAZINE, 2019, 7 (02) :159-173
[3]   Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks [J].
Audebert, Nicolas ;
Le Saux, Bertrand ;
Lefevre, Sebastien .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 :20-32
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]   Hyperspectral Image Classification Using Dictionary-Based Sparse Representation [J].
Chen, Yi ;
Nasrabadi, Nasser M. ;
Tran, Trac D. .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2011, 49 (10) :3973-3985
[8]   Deep Feature Extraction and Classification of Hyperspectral Images Based on Convolutional Neural Networks [J].
Chen, Yushi ;
Jiang, Hanlu ;
Li, Chunyang ;
Jia, Xiuping ;
Ghamisi, Pedram .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (10) :6232-6251
[9]   Context Aggregation Network for Semantic Labeling in Aerial Images [J].
Cheng, Wensheng ;
Yang, Wen ;
Wang, Min ;
Wang, Gang ;
Chen, Jinyong .
REMOTE SENSING, 2019, 11 (10)
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848