Understanding house price appreciation using multi-source big geo-data and machine learning

被引:119
|
作者
Kang, Yuhao [1 ,2 ]
Zhang, Fan [1 ]
Peng, Wenzhe [3 ]
Gao, Song [2 ]
Rao, Jinmeng [2 ]
Duarte, Fabio [1 ,4 ]
Ratti, Carlo [1 ]
机构
[1] MIT, Dept Urban Studies & Planning, Senseable City Lab, Cambridge, MA 02139 USA
[2] Univ Wisconsin, Dept Geog, Geospatial Data Sci Lab, Madison, WI 53703 USA
[3] MIT, Dept Architecture, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] PUCPR, Urban Management Program, BR-80215910 Curitiba, Parana, Brazil
基金
中国国家自然科学基金;
关键词
House price appreciation rate; Street view images; House photos; Human mobility patterns; Geographically weighted regression; STREET VIEW; NEIGHBORHOODS; IMAGERY; MARKET;
D O I
10.1016/j.landusepol.2020.104919
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Understanding house price appreciation benefits place-based decision makings and real estate market analyses. Although large amounts of interests have been paid in the house price modeling, limited work has focused on evaluating the price appreciation rate. In this study, we propose a data-fusion framework to examine how well house price appreciation potentials can be predicted by combining multiple data sources. We used data sets including house structural attributes, house photos, locational amenities, street view images, transportation accessibility, visitor patterns, and socioeconomic attributes of neighborhoods to enrich our understanding of the real estate appreciation and its predictive modeling. As a case study, we investigate more than 20,000 houses in the Greater Boston Area, and discuss the spatial dependency of house price appreciations, influential variables and their relationships. In detail, we extract deep features from street view images and house photos using a deep learning model, merging features from multi-source data and modeling house price appreciation using machine learning models and the geographically weighted regression at two spatial scales: fine-scale point level and aggregated neighborhood level. Results show that the house price appreciation rate can be modeled with high accuracy using the proposed framework (R-2 = 0.74 for gradient boosting machine at neighborhood-scale). We discovered that houses with low house prices and small house areas may have a higher house appreciation potential. Our results provide insights into how multi-source big geo-data can be employed in machine learning frameworks to characterize real estate price trends and help understand human settlements for policy-making.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Measuring Community Greening Merging Multi-Source Geo-Data
    Gu, Weiying
    Chen, Yiyong
    Dai, Muye
    SUSTAINABILITY, 2019, 11 (04)
  • [2] Measuring Housing Vitality from Multi-Source Big Data and Machine Learning
    Zhou, Yang
    Xue, Lirong
    Shi, Zhengyu
    Wu, Libo
    Fan, Jianqing
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1045 - 1059
  • [3] House Price Prediction: A Multi-Source Data Fusion Perspective
    Zhao, Yaping
    Zhao, Jichang
    Lam, Edmund Y.
    BIG DATA MINING AND ANALYTICS, 2024, 7 (03): : 603 - 620
  • [4] Comments on "Measuring Housing Vitality from Multi-Source Big Data and Machine Learning"
    Tu, Wei
    Jiang, Bei
    Kong, Linglong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1060 - 1062
  • [5] Discussion of "Measuring Housing Vitality from Multi-Source Big Data and Machine Learning"
    Banerjee, Sudipto
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1063 - 1065
  • [6] Delineating Peri-Urban Areas Using Multi-Source Geo-Data: A Neural Network Approach and SHAP Explanation
    Sun, Xiaomeng
    Liu, Xingjian
    Zhou, Yang
    REMOTE SENSING, 2023, 15 (16)
  • [7] Mapping Himalayan leucogranites by machine learning using multi-source data
    Wang Z.
    Zuo R.
    Earth Science Frontiers, 2023, 30 (05) : 216 - 226
  • [8] Geo-Object-Based Soil Organic Matter Mapping Using Machine Learning Algorithms With Multi-Source Geo-Spatial Data
    Wu, Tianjun
    Luo, Jiancheng
    Dong, Wen
    Sun, Yingwei
    Xia, Liegang
    Zhang, Xuejian
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (04) : 1091 - 1106
  • [9] A Machine Learning Approach for Convective Initiation Detection Using Multi-source Data
    Liu, Xuan
    Chen, Haonan
    Han, Lei
    Ge, Yurong
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 6518 - 6521
  • [10] Dynamic Maize Yield Predictions Using Machine Learning on Multi-Source Data
    Croci, Michele
    Impollonia, Giorgio
    Meroni, Michele
    Amaducci, Stefano
    REMOTE SENSING, 2023, 15 (01)