Understanding house price appreciation using multi-source big geo-data and machine learning

被引:119
|
作者
Kang, Yuhao [1 ,2 ]
Zhang, Fan [1 ]
Peng, Wenzhe [3 ]
Gao, Song [2 ]
Rao, Jinmeng [2 ]
Duarte, Fabio [1 ,4 ]
Ratti, Carlo [1 ]
机构
[1] MIT, Dept Urban Studies & Planning, Senseable City Lab, Cambridge, MA 02139 USA
[2] Univ Wisconsin, Dept Geog, Geospatial Data Sci Lab, Madison, WI 53703 USA
[3] MIT, Dept Architecture, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] PUCPR, Urban Management Program, BR-80215910 Curitiba, Parana, Brazil
基金
中国国家自然科学基金;
关键词
House price appreciation rate; Street view images; House photos; Human mobility patterns; Geographically weighted regression; STREET VIEW; NEIGHBORHOODS; IMAGERY; MARKET;
D O I
10.1016/j.landusepol.2020.104919
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Understanding house price appreciation benefits place-based decision makings and real estate market analyses. Although large amounts of interests have been paid in the house price modeling, limited work has focused on evaluating the price appreciation rate. In this study, we propose a data-fusion framework to examine how well house price appreciation potentials can be predicted by combining multiple data sources. We used data sets including house structural attributes, house photos, locational amenities, street view images, transportation accessibility, visitor patterns, and socioeconomic attributes of neighborhoods to enrich our understanding of the real estate appreciation and its predictive modeling. As a case study, we investigate more than 20,000 houses in the Greater Boston Area, and discuss the spatial dependency of house price appreciations, influential variables and their relationships. In detail, we extract deep features from street view images and house photos using a deep learning model, merging features from multi-source data and modeling house price appreciation using machine learning models and the geographically weighted regression at two spatial scales: fine-scale point level and aggregated neighborhood level. Results show that the house price appreciation rate can be modeled with high accuracy using the proposed framework (R-2 = 0.74 for gradient boosting machine at neighborhood-scale). We discovered that houses with low house prices and small house areas may have a higher house appreciation potential. Our results provide insights into how multi-source big geo-data can be employed in machine learning frameworks to characterize real estate price trends and help understand human settlements for policy-making.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Prediction of High-Resolution Soil Moisture Using Multi-source Data and Machine Learning
    Sudhakara, B.
    Bhattacharjee, Shrutilipi
    DISTRIBUTED COMPUTING AND INTELLIGENT TECHNOLOGY, ICDCIT 2024, 2024, 14501 : 282 - 292
  • [22] Multi-source Machine Learning for AQI Estimation
    Duong, Dat Q.
    Le, Quang M.
    Nguyen-Tai, Tan-Loc
    Dong Bo
    Dat Nguyen
    Dao, Minh-Son
    Nguyen, Binh T.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 4567 - 4576
  • [23] Multi-source precipitation estimation using machine learning: Clarification and benchmarking
    Xu, Yue
    Tang, Guoqiang
    Li, Lingjie
    Wan, Wei
    JOURNAL OF HYDROLOGY, 2024, 635
  • [24] Understanding Accessibility of Health and Fitness with Big Data Techniques: Facility Visualization in Shanghai with Multi-Source Data
    Shen, Zhuya
    Wu, Yue
    JOURNAL OF ENGINEERING RESEARCH, 2022, 10
  • [25] Multi-Source Cyber-Attacks Detection using Machine Learning
    Taheri, Sona
    Gondal, Iqbal
    Bagirov, Adil
    Harkness, Greg
    Brown, Simon
    Chi, CHihung
    2019 IEEE INTERNATIONAL CONFERENCE ON INDUSTRIAL TECHNOLOGY (ICIT), 2019, : 1167 - 1172
  • [26] Multi-source data fusion of big spatial-temporal data in soil, geo-engineering and environmental studies
    Di Curzio, Diego
    Castrignano, Annamaria
    Fountas, Spyros
    Romic, Marija
    Rossel, Raphael A. Viscarra
    SCIENCE OF THE TOTAL ENVIRONMENT, 2021, 788
  • [27] Recent trends of machine learning applied to multi-source data of medicinal plants
    Zhang, Yanying
    Wang, Yuanzhong
    JOURNAL OF PHARMACEUTICAL ANALYSIS, 2023, 13 (12) : 1388 - 1407
  • [28] Using multi-source geospatial big data to identify the structure of polycentric cities
    Cai, Jixuan
    Huang, Bo
    Song, Yimeng
    REMOTE SENSING OF ENVIRONMENT, 2017, 202 : 210 - 221
  • [29] Monitoring water quality parameters in urban rivers using multi-source data and machine learning approach
    Liang, Yongchun
    Ding, Fangyu
    Liu, Lei
    Yin, Fang
    Hao, Mengmeng
    Kang, Tingting
    Zhao, Chuanpeng
    Wang, Ziteng
    Jiang, Dong
    JOURNAL OF HYDROLOGY, 2025, 648
  • [30] City scale urban flooding risk assessment using multi-source data and machine learning approach
    Wei, Qing
    Zhang, Huijin
    Chen, Yongqi
    Xie, Yifan
    Yin, Hailong
    Xu, Zuxin
    JOURNAL OF HYDROLOGY, 2025, 651