Distance metric choice can both reduce and induce collinearity in geographically weighted regression

被引:20
作者
Comber, Alexis [1 ]
Chi, Khanh [2 ]
Huy, Man Q. [3 ]
Nguyen, Quan [4 ]
Lu, Binbin [5 ]
Phe, Hoang H. [6 ]
Harris, Paul [7 ]
机构
[1] Univ Leeds, Spatial Data Analyt, Leeds, W Yorkshire, England
[2] GeoViet Consulting Co Ltd, Hanoi, Vietnam
[3] Vietnam Natl Univ, Hanoi, Vietnam
[4] Natl Univ Civil Engn, Fac Construct Econ, Hanoi, Vietnam
[5] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan, Hubei, Peoples R China
[6] Vinaconex R&D, Hanoi, Vietnam
[7] Rothamsted Res, Harpenden, Herts, England
基金
英国生物技术与生命科学研究理事会;
关键词
Geographically weighted regression; distance metrics; model fit; collinearity; GENERAL FRAMEWORK; CROSS-VALIDATION; MONTE-CARLO; MODELS; SELECTION; COEFFICIENTS; ASSOCIATION; INFERENCE;
D O I
10.1177/2399808318784017
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper explores the impact of different distance metrics on collinearity in local regression models such as geographically weighted regression. Using a case study of house price data collected in Ha Noi, Vietnam, and by fully varying both power and rotation parameters to create different Minkowski distances, the analysis shows that local collinearity can be both negatively and positively affected by distance metric choice. The Minkowski distance that maximised collinearity in a geographically weighted regression was approximate to a Manhattan distance with (power = 0.70) with a rotation of 30 degrees, and that which minimised collinearity was parameterised with power = 0.05 and a rotation of 70 degrees. The results indicate that distance metric choice can provide a useful extra tuning component to address local collinearity issues in spatially varying coefficient modelling and that understanding the interaction of distance metric and collinearity can provide insight into the nature and structure of the data relationships. The discussion considers first, the exploration and selection of different distance metrics to minimise collinearity as an alternative to localised ridge regression, lasso and elastic net approaches. Second, it discusses the how distance metric choice could extend the methods that additionally optimise local model fit (lasso and elastic net) by selecting a distance metric that further helped minimise local collinearity. Third, it identifies the need to investigate the relationship between kernel bandwidth, distance metrics and collinearity as an area of further work.
引用
收藏
页码:489 / 507
页数:19
相关论文
共 50 条
  • [1] Akaike H., 1973, P 2 INT S INF THEOR, P267, DOI 10.1007/978-1-4612-1694-0
  • [2] [Anonymous], 2005, Regression Diagnostics: Identifying Influential Data and Sources of Collinearity
  • [3] Space varying coefficient models for small area data
    Assunçao, RM
    [J]. ENVIRONMETRICS, 2003, 14 (05) : 453 - 473
  • [4] Alleviating the effect of collinearity in geographically weighted regression
    Barcena, M. J.
    Menendez, P.
    Palacios, M. B.
    Tusell, F.
    [J]. JOURNAL OF GEOGRAPHICAL SYSTEMS, 2014, 16 (04) : 441 - 466
  • [5] BOWMAN AW, 1984, BIOMETRIKA, V71, P353
  • [6] Some notes on parametric significance tests for geographically weighted regression
    Brunsdon, C
    Fotheringham, AS
    Charlton, M
    [J]. JOURNAL OF REGIONAL SCIENCE, 1999, 39 (03) : 497 - 524
  • [7] Geographically weighted regression: A method for exploring spatial nonstationarity
    Brunsdon, C
    Fotheringham, AS
    Charlton, ME
    [J]. GEOGRAPHICAL ANALYSIS, 1996, 28 (04) : 281 - 298
  • [8] Brunsdon C., 2012, ACC 2012 10 INT S SP
  • [9] GENERATING MODELS BY EXPANSION METHOD - APPLICATIONS TO GEOGRAPHICAL RESEARCH
    CASETTI, E
    [J]. GEOGRAPHICAL ANALYSIS, 1972, 4 (01) : 81 - 91
  • [10] Charlton M., 2003, GWR 3 SOFTWARE GEOGR