Monitoring housing rental prices based on social media:An integrated approach of machine-learning algorithms and hedonic modeling to inform equitable housing policies

被引:169
作者
Hu, Lirong [1 ]
He, Shenjing [2 ]
Han, Zixuan [1 ]
Xiao, He [3 ]
Su, Shiliang [1 ,4 ]
Weng, Min [1 ]
Cai, Zhongliang [1 ]
机构
[1] Wuhan Univ, Sch Resource & Environm Sci, Wuhan, Hubei, Peoples R China
[2] Univ Hong Kong, Shenzhen Inst Res & Innovat, Shenzhen, Peoples R China
[3] Chongqing Geomat Ctr, Chongqing, Peoples R China
[4] Wuhan Univ, Collaborat Innovat Ctr Geospatial Technol, Wuhan, Hubei, Peoples R China
基金
美国国家科学基金会;
关键词
Housing rental price; Social media; Hedonic model; Multilevel determinants; Machine learning; Equitable housing policies; LOCAL MORANS I; LAND-USE; SPATIAL PREDICTION; VARIABLE SELECTION; MIGRANT WORKERS; PUBLIC-HEALTH; TEA EXPANSION; CHINA; DETERMINANTS; REGRESSION;
D O I
10.1016/j.landusepol.2018.12.030
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
National land use policies and strategies worldwide have attempted to establish a healthy housing rental market towards urban sustainability. Monitoring fine-scale housing rental prices should provide essential implications for equitable housing policies. However, doing so remains a challenge because aggregated data were traditionally collected at a coarse scale through census or social surveys. On-line housing rental websites (OHRWs) have become popular social media platforms in the housing studies. This paper attempts to demonstrate how to monitor fine-scale housing rental prices based on OHRWs using the case of Shenzhen in China. Employing hedonic model, a set of housing rental determinants are initially selected from three characteristics (neighborhood, location and structure) and at three levels (nearest accessibility, 15-minute walking distance availability and sub-district availability). Housing rent prediction models are then established (respectively for October 2017 and February 2018) using the training samples collected from the OHRWs and six machine learning algorithms, including random forest regression (RFR), extra-trees regression (ETR), gradient-boosting regression (GBR), support vector regression (SVR), multi-layer perceptron neural network (MLP-NN) and k-nearest neighbor algorithm (k-NN). Thereafter, the relative importance of the determinants is calculated and visualized using partial dependence plots. Finally, the models are used to monitor housing rental price dynamics for all of the communities within Shenzhen. Results show that all of the algorithms except SVR generally present good performance. Among them, RFR and ETR are the best one in October 2017 and February 2018, respectively. Concerning the spatial pattern of housing rental, the high-high clusters merge in the central districts, whereas the low-low clusters are located in the outskirts, and the growth rate is the greatest in the farthest outskirts from the central districts. Each determinant affects the housing rent across different scale and subdistrict availability and nearest accessibility are more important than 15-minute walking distance availability. The two most influential determinants are sub-district job opportunity and nearest accessibility to health care facilities. The case of Shenzhen shows that the demonstrated framework, which integrates machine-learning algorithms and the hedonic modeling, is practical and efficient. The approach is believed to provide an essential tool to inform equitable housing policies.
引用
收藏
页码:657 / 673
页数:17
相关论文
共 120 条
[1]   The impact of employment on housing prices: Detailed evidence from FDI in Ireland [J].
Agnew, Kerri ;
Lyons, Ronan C. .
REGIONAL SCIENCE AND URBAN ECONOMICS, 2018, 70 :174-189
[3]   Land, urban form, and politics: A study on Dubai's housing landscape and rental affordability [J].
Alawadi, Khaled ;
Khana, Asim ;
Almulla, Ahmed .
CITIES, 2018, 81 :115-130
[4]   Partitioning the variation among spatial, temporal and environmental components in a multivariate data set [J].
Anderson, MJ ;
Gribble, NA .
AUSTRALIAN JOURNAL OF ECOLOGY, 1998, 23 (02) :158-167
[5]  
[Anonymous], 2004, Introduction to machine learning
[6]   GeoDa:: An introduction to spatial data analysis [J].
Anselin, L ;
Syabri, I ;
Kho, Y .
GEOGRAPHICAL ANALYSIS, 2006, 38 (01) :5-22
[7]   LOCAL INDICATORS OF SPATIAL ASSOCIATION - LISA [J].
ANSELIN, L .
GEOGRAPHICAL ANALYSIS, 1995, 27 (02) :93-115
[8]   Accidental, open and everywhere: Emerging data sources for the understanding of cities [J].
Arribas-Bel, Daniel .
APPLIED GEOGRAPHY, 2014, 49 :45-53
[9]   Gentrification, suburban decline, and the financialization of multi-family rental housing: The case of Toronto [J].
August, Martine ;
Walks, Alan .
GEOFORUM, 2018, 89 :124-136
[10]   A hybrid fuzzy regression-fuzzy cognitive map algorithm for forecasting and optimization of housing market fluctuations [J].
Azadeh, A. ;
Ziaei, B. ;
Moghaddam, M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (01) :298-315