Principal Component Analysis on Spatial Data: An Overview

被引:294
作者
Demsar, Urska [1 ]
Harris, Paul [2 ]
Brunsdon, Chris [3 ]
Fotheringham, A. Stewart [1 ]
McLoone, Sean [4 ]
机构
[1] Univ St Andrews, Sch Geog & Geosci, Ctr GeoInformat, St Andrews KY16 9AL, Fife, Scotland
[2] Natl Univ Ireland Maynooth, Natl Ctr Geocomputat, Maynooth, Kildare, Ireland
[3] Univ Liverpool, Sch Environm Sci, Liverpool L69 3BX, Merseyside, England
[4] Natl Univ Ireland Maynooth, Dept Elect Engn, Maynooth, Kildare, Ireland
基金
爱尔兰科学基金会;
关键词
dimensionality reduction; multivariate statistics; principal components analysis; spatial analysis and mathematical modeling; spatial data; FACTORIAL KRIGING ANALYSIS; GEOGRAPHICALLY WEIGHTED REGRESSION; COMPOSITIONAL DATA-ANALYSIS; CANCER-MORTALITY-RATES; GEOSTATISTICAL ANALYSIS; OUTLIER DETECTION; R-PACKAGE; TEMPORAL DISTRIBUTION; GIS; VEGETATION;
D O I
10.1080/00045608.2012.689236
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
This article considers critically how one of the oldest and most widely applied statistical methods, principal components analysis (PCA), is employed with spatial data. We first provide a brief guide to how PCA works: This includes robust and compositional PCA variants, links to factor analysis, latent variable modeling, and multilevel PCA. We then present two different approaches to using PCA with spatial data. First we look at the nonspatial approach, which avoids challenges posed by spatial data by using a standard PCA on attribute space only. Within this approach we identify four main methodologies, which we define as (1) PCA applied to spatial objects, (2) PCA applied to raster data, (3) atmospheric science PCA, and (4) PCA on flows. In the second approach, we look at PCA adapted for effects in geographical space by looking at PCA methods adapted for first-order nonstationary effects (spatial heterogeneity) and second-order stationary effects (spatial autocorrelation). We also describe how PCA can be used to investigate multiple scales of spatial autocorrelation. Furthermore, we attempt to disambiguate a terminology confusion by clarifying which methods are specifically termed "spatial PCA" in the literature and how this term has different meanings in different areas. Finally, we look at a further three variations of PCA that have not been used in a spatial context but show considerable potential in this respect: simple PCA, sparse PCA, and multilinear PCA.
引用
收藏
页码:106 / 128
页数:23
相关论文
共 189 条
[51]   Investigation of geographical and temporal distribution of tropospheric ozone in Catalonia (North-East Spain) during the period 2000-2004 using multivariate data analysis methods [J].
Felipe-Sotelo, M. ;
Gustems, L. ;
Hernandez, I. ;
Terrado, M. ;
Tauler, R. .
ATMOSPHERIC ENVIRONMENT, 2006, 40 (38) :7421-7436
[52]  
Filzmoser P, 1999, ENVIRONMETRICS, V10, P363, DOI 10.1002/(SICI)1099-095X(199907/08)10:4<363::AID-ENV362>3.0.CO
[53]  
2-0
[54]   Multivariate outlier detection in exploration geochemistry [J].
Filzmoser, P ;
Garrett, RG ;
Reimann, C .
COMPUTERS & GEOSCIENCES, 2005, 31 (05) :579-587
[55]   Outlier identification in high dimensions [J].
Filzmoser, Peter ;
Maronna, Ricardo ;
Werner, Mark .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (03) :1694-1711
[56]   Principal component analysis for compositional data with outliers [J].
Filzmoser, Peter ;
Hron, Karel ;
Reimann, Clemens .
ENVIRONMETRICS, 2009, 20 (06) :621-632
[57]  
Floras S.A., 1999, INT J APPL EARTH OBS, V1, P68
[58]  
Fodor I.K, 2002, SURVEY DIMENSION RED, DOI DOI 10.2172/15002155
[59]   How to get rid of W: a latent variables approach to modelling spatially lagged variables [J].
Folmer, Henk ;
Oud, Johan .
ENVIRONMENT AND PLANNING A-ECONOMY AND SPACE, 2008, 40 (10) :2526-2538
[60]  
Fotheringham A. S., 2002, Geographically weighted regression: The analysis of spatially varying relationships