Support Vector Machine for Spatial Variation

被引:14
作者
Andris, Clio [1 ]
Cowen, David [2 ]
Wittenbach, Jason [3 ]
机构
[1] MIT, Cambridge, MA 02139 USA
[2] Univ S Carolina, Columbia, SC 29208 USA
[3] Penn State Univ, University Pk, PA 16802 USA
关键词
GEOGRAPHICALLY WEIGHTED REGRESSION; VISUALIZATION;
D O I
10.1111/j.1467-9671.2012.01354.x
中图分类号
P9 [自然地理学]; K9 [地理];
学科分类号
0705 ; 070501 ;
摘要
Large, multivariate geographic datasets have been used to characterize geographic space with the help of spatial data mining tools. In our study, we explore the sufficiency of the Support Vector Machine (SVM), a popular machine-learning technique for unsupervised classification and clustering, to help recognize hidden patterns in a college admissions dataset. Our college admissions dataset holds over 10,000 students applying to an undisclosed university during one undisclosed year. Students are qualified almost exclusively by their standardized test scores and school records, and a known admissions decision is rendered based on these criteria. Given that the university has a number of political, social and geographic econometric factors in its admissions decisions, we use SVM to find implicit spatial patterns that may favor students from certain geographic regions. We first explore the characteristics of the applicants in the college admissions case study. Next, we explain the SVM technique and our unique threshold line' methodology for both discrete (regional) and continuous (k-neighbors) space. We then analyze the results of the regional and k-neighbor tests in order to respond to the methodological and geographic research questions.
引用
收藏
页码:41 / 61
页数:21
相关论文
共 56 条
  • [21] The variance-based cross-variogram: You can add apples and oranges
    Cressie, N
    Wikle, CK
    [J]. MATHEMATICAL GEOLOGY, 1998, 30 (07): : 789 - 799
  • [22] Cressie N., 1992, Terra Nova, V4, P613, DOI [10.1111/j.1365-3121.1992.tb00605.x, DOI 10.1111/J.1365-3121.1992.TB00605.X]
  • [23] Fayyad U, 1996, AI MAG, V17, P37
  • [24] The statistical utilization of multiple measurements
    Fisher, RA
    [J]. ANNALS OF EUGENICS, 1938, 8 : 376 - 386
  • [25] Spatial nonstationarity and scale-dependency in the relationship between species richness and environmental determinants for the sub-Saharan endemic avifauna
    Foody, GM
    [J]. GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2004, 13 (04): : 315 - 320
  • [26] Fotheringham A. S., 2002, Geographically weighted regression: The analysis of spatially varying relationships
  • [27] FOTHERINGHAM AS, 1995, GEOGR ANAL, V27, P60
  • [28] Geary R. C., 1954, Incorp Stat, V5, P115, DOI [DOI 10.2307/2986645, 10.2307/2986645]
  • [29] Multivariate analysis and geovisualization with an integrated geographic knowledge discovery approach
    Guo, Diansheng
    Gahegan, Mark
    MacEachren, Alan M.
    Zhou, Biliang
    [J]. Cartography and Geographic Information Science, 2005, 32 (02) : 113 - 132
  • [30] Hawkins D.A., 2006, STATE COLL ADMISSION