Principal Component Analysis on Spatial Data: An Overview

被引:294
作者
Demsar, Urska [1 ]
Harris, Paul [2 ]
Brunsdon, Chris [3 ]
Fotheringham, A. Stewart [1 ]
McLoone, Sean [4 ]
机构
[1] Univ St Andrews, Sch Geog & Geosci, Ctr GeoInformat, St Andrews KY16 9AL, Fife, Scotland
[2] Natl Univ Ireland Maynooth, Natl Ctr Geocomputat, Maynooth, Kildare, Ireland
[3] Univ Liverpool, Sch Environm Sci, Liverpool L69 3BX, Merseyside, England
[4] Natl Univ Ireland Maynooth, Dept Elect Engn, Maynooth, Kildare, Ireland
基金
爱尔兰科学基金会;
关键词
dimensionality reduction; multivariate statistics; principal components analysis; spatial analysis and mathematical modeling; spatial data; FACTORIAL KRIGING ANALYSIS; GEOGRAPHICALLY WEIGHTED REGRESSION; COMPOSITIONAL DATA-ANALYSIS; CANCER-MORTALITY-RATES; GEOSTATISTICAL ANALYSIS; OUTLIER DETECTION; R-PACKAGE; TEMPORAL DISTRIBUTION; GIS; VEGETATION;
D O I
10.1080/00045608.2012.689236
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
This article considers critically how one of the oldest and most widely applied statistical methods, principal components analysis (PCA), is employed with spatial data. We first provide a brief guide to how PCA works: This includes robust and compositional PCA variants, links to factor analysis, latent variable modeling, and multilevel PCA. We then present two different approaches to using PCA with spatial data. First we look at the nonspatial approach, which avoids challenges posed by spatial data by using a standard PCA on attribute space only. Within this approach we identify four main methodologies, which we define as (1) PCA applied to spatial objects, (2) PCA applied to raster data, (3) atmospheric science PCA, and (4) PCA on flows. In the second approach, we look at PCA adapted for effects in geographical space by looking at PCA methods adapted for first-order nonstationary effects (spatial heterogeneity) and second-order stationary effects (spatial autocorrelation). We also describe how PCA can be used to investigate multiple scales of spatial autocorrelation. Furthermore, we attempt to disambiguate a terminology confusion by clarifying which methods are specifically termed "spatial PCA" in the literature and how this term has different meanings in different areas. Finally, we look at a further three variations of PCA that have not been used in a spatial context but show considerable potential in this respect: simple PCA, sparse PCA, and multilinear PCA.
引用
收藏
页码:106 / 128
页数:23
相关论文
共 189 条
[1]  
Afifi A., 2003, COMPUTER AIDED MULTI
[2]   Statistical shape analysis via principal factor analysis [J].
Aguirre, Mauricio Reyes ;
Linguraru, Marius George ;
Marias, Kostas ;
Ayache, Nicholas ;
Nolte, Lutz-Peter ;
Ballester, Miguel Angel Gonzalez .
2007 4TH IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING : MACRO TO NANO, VOLS 1-3, 2007, :1216-+
[3]  
AITCHISON J, 1982, J ROY STAT SOC B, V44, P139
[4]   Compositional data analysis: Where are we and where should we be heading? [J].
Aitchison, J ;
Egozcue, JJ .
MATHEMATICAL GEOLOGY, 2005, 37 (07) :829-850
[5]  
AITCHISON J, 1983, BIOMETRIKA, V70, P57
[6]   Identifying the boundaries of the historical site of Persepolis using remote sensing [J].
Aminzadeh, Behnaz ;
Samani, Firuzeh .
REMOTE SENSING OF ENVIRONMENT, 2006, 102 (1-2) :52-62
[7]   ESTIMATING LINEAR STATISTICAL RELATIONSHIPS [J].
ANDERSON, TW .
ANNALS OF STATISTICS, 1984, 12 (01) :1-45
[8]   The role of the Iceland Ice Sheet in the North Atlantic during the late Quaternary: a review and evidence from Denmark Strait [J].
Andrews, John T. .
JOURNAL OF QUATERNARY SCIENCE, 2008, 23 (01) :3-20
[9]  
[Anonymous], 2002, Principal components analysis
[10]  
[Anonymous], 1967, Applied Statistics, DOI DOI 10.1038/S41598-017-00047-5