Unsupervised Clustering and Active Learning of Hyperspectral Images With Nonlinear Diffusion

被引:62
作者
Murphy, James M. [1 ]
Maggioni, Mauro [2 ,3 ]
机构
[1] Tufts Univ, Dept Math, Medford, MA 02155 USA
[2] Johns Hopkins Univ, Dept Math, Dept Appl Math & Stat, Inst Data Intens Engn & Sci, Baltimore, MD 21218 USA
[3] Johns Hopkins Univ, Math Inst Data Sci, Baltimore, MD 21218 USA
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2019年 / 57卷 / 03期
关键词
Clustering algorithms; graph theory; harmonic analysis; hyperspectral imaging; machine learning; unsupervised learning; INDEPENDENT COMPONENT ANALYSIS; SPECTRAL-SPATIAL CLASSIFICATION; DIMENSIONALITY REDUCTION; LOGISTIC-REGRESSION; ALGORITHMS; SUBSPACE; FUSION;
D O I
10.1109/TGRS.2018.2869723
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The problem of unsupervised learning and segmentation of hyperspectral images is a significant challenge in remote sensing. The high dimensionality of hyperspectral data, presence of substantial noise, and overlap of classes all contribute to the difficulty of automatically clustering and segmenting hyperspectral images. We propose an unsupervised learning technique called spectral-spatial diffusion learning (DLSS) that combines a geometric estimation of class modes with a diffusion-inspired labeling that incorporates both spectral and spatial information. The mode estimation incorporates the geometry of the hyperspectral data by using diffusion distance to promote learning a unique mode from each class. These class modes are then used to label all the points by a joint spectral-spatial nonlinear diffusion process. A related variation of DLSS is also discussed, which enables active learning by requesting labels for a very small number of well-chosen pixels, dramatically boosting overall clustering results. Extensive experimental analysis demonstrates the efficacy of the proposed methods against benchmark and state-of-the-art hyperspectral analysis techniques on a variety of real data sets, their robustness to choices of parameters, and their low computational complexity.
引用
收藏
页码:1829 / 1845
页数:17
相关论文
共 75 条
[1]  
Acito N, 2003, INT GEOSCI REMOTE SE, P3745
[2]  
[Anonymous], LEARNING UNSUPERVISE
[3]  
[Anonymous], 2001, SPRINGER SERIES STAT, DOI DOI 10.1007/978-0-387-21606-5
[4]  
[Anonymous], 2016, NEURAL NETWORK FEATU
[5]  
[Anonymous], J BIOMED OPT
[6]  
[Anonymous], 2011, J CHEM PHYS
[7]  
[Anonymous], P 9 WORKSH HYP IM SI
[8]   Beyond kappa: A review of interrater agreement measures [J].
Banerjee, M .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1999, 27 (01) :3-23
[9]   Laplacian eigenmaps for dimensionality reduction and data representation [J].
Belkin, M ;
Niyogi, P .
NEURAL COMPUTATION, 2003, 15 (06) :1373-1396
[10]   Integration of heterogeneous data for classification in hyperspectral satellite imagery [J].
Benedetto, J. ;
Czaja, W. ;
Dobrosotskaya, J. ;
Doster, T. ;
Duke, K. ;
Gillis, D. .
ALGORITHMS AND TECHNOLOGIES FOR MULTISPECTRAL, HYPERSPECTRAL, AND ULTRASPECTRAL IMAGERY XVIII, 2012, 8390