Analyzing driving factors of land values in urban scale based on big data and non-linear machine learning techniques

被引:99
作者
Ma, Jun [1 ]
Cheng, Jack C. P. [1 ]
Jiang, Feifeng [2 ]
Chen, Weiwei [1 ]
Zhang, Jingcheng [3 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Civil & Environm Engn, Hong Kong, Peoples R China
[2] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China
[3] Hong Kong Univ Sci & Technol, Sch Engn, Hong Kong, Peoples R China
关键词
Big data; Land values per square foot; Machine learning; Place of interest; Recursive feature elimination; ENERGY USE INTENSITY; RANDOM FOREST; PRICE; CREDITS; SELECTION; IMPACT; CITY;
D O I
10.1016/j.landusepol.2020.104537
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Land value plays a vital role in the real estate market. It is a critical reference for urban planners to reallocate land resources and introduce valid policies. Studying the influential factors on land value can help better understand the spatial-temporal variation of land values and design effective control policies. This attracted a number of scholars to study the spatial and temporal relationships between land value and its possible influential factors from the perspective of macro and micro. However, the majority of the existing studies have the problems of linear assumption and multicollinearity in research models. Limited features and the lack of feature selection procedure are another two commonly seen limitations. To overcome the gaps, this paper adopts non-linear machine learning (ML) methods to investigate the influential factors on land values per square foot based on "big data" in New York City. More than one thousand potential factors are considered, covering from the land attribute, point of interest, demographics, housing, to economic, education, and social. They are further selected using a feature extraction model named Recursive Feature Elimination (RFE). Six ML algorithms, including Random Forest (RF), Gradient Boosting Decision Tree (GBDT), Multi Linear Regression (MLR), Linear Support Vector Regression (SVR), Multilayer Perceptron (MLP) Regression, and K-Nearest Neighbor (KNN) Regression are evaluated and compared. The optimal one with an R-square value of 0.933 is used to calculate the feature importance further. Several important impact features are disclosed, including the number of newsstands, and the vacant housing percentage.
引用
收藏
页数:13
相关论文
共 39 条
[1]   Classification and Recognition of 3D Image of Charlier moments using a Multilayer Perceptron Architecture [J].
Amakdouf, Hicham ;
El Mallahi, Mostafa ;
Zouhri, Amal ;
Tahiri, Ahmed ;
Qjidaa, Hassan .
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017), 2018, 127 :226-235
[2]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[3]   THE FUNDAMENTALS OF LAND PRICES AND URBAN-GROWTH [J].
CAPOZZA, DR ;
HELSLEY, RW .
JOURNAL OF URBAN ECONOMICS, 1989, 26 (03) :295-306
[4]   A non-linear case-based reasoning approach for retrieval of similar cases and selection of target credits in LEED projects [J].
Cheng, Jack C. P. ;
Ma, Lucky J. .
BUILDING AND ENVIRONMENT, 2015, 93 :349-361
[5]   A data-driven study of important climate factors on the achievement of LEED-EB credits [J].
Cheng, Jack C. P. ;
Ma, Lucky J. .
BUILDING AND ENVIRONMENT, 2015, 90 :232-244
[6]   The value of open spaces in residential land use [J].
Geoghegan, J .
LAND USE POLICY, 2002, 19 (01) :91-98
[7]   Recursive feature elimination with random forest for PTR-MS analysis of agroindustrial products [J].
Granitto, Pablo M. ;
Furlanello, Cesare ;
Biasioli, Franco ;
Gasperi, Flavia .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2006, 83 (02) :83-90
[8]  
Hillier J., 2002, Shadows of power: An allegory of prudence in land-use planning, The RTPI library series
[9]   Spatially non-stationary relationships between urban residential land price and impact factors in Wuhan city, China [J].
Hu, Shougeng ;
Yang, Shengfu ;
Li, Weidong ;
Zhang, Chuanrong ;
Xu, Feng .
APPLIED GEOGRAPHY, 2016, 68 :48-56
[10]   Modeling land price distribution using multifractal IDW interpolation and fractal filtering method [J].
Hu, Shougeng ;
Cheng, Qiuming ;
Wang, Le ;
Xu, Deyi .
LANDSCAPE AND URBAN PLANNING, 2013, 110 :25-35