Understanding house price appreciation using multi-source big geo-data and machine learning

被引:119
|
作者
Kang, Yuhao [1 ,2 ]
Zhang, Fan [1 ]
Peng, Wenzhe [3 ]
Gao, Song [2 ]
Rao, Jinmeng [2 ]
Duarte, Fabio [1 ,4 ]
Ratti, Carlo [1 ]
机构
[1] MIT, Dept Urban Studies & Planning, Senseable City Lab, Cambridge, MA 02139 USA
[2] Univ Wisconsin, Dept Geog, Geospatial Data Sci Lab, Madison, WI 53703 USA
[3] MIT, Dept Architecture, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] PUCPR, Urban Management Program, BR-80215910 Curitiba, Parana, Brazil
基金
中国国家自然科学基金;
关键词
House price appreciation rate; Street view images; House photos; Human mobility patterns; Geographically weighted regression; STREET VIEW; NEIGHBORHOODS; IMAGERY; MARKET;
D O I
10.1016/j.landusepol.2020.104919
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Understanding house price appreciation benefits place-based decision makings and real estate market analyses. Although large amounts of interests have been paid in the house price modeling, limited work has focused on evaluating the price appreciation rate. In this study, we propose a data-fusion framework to examine how well house price appreciation potentials can be predicted by combining multiple data sources. We used data sets including house structural attributes, house photos, locational amenities, street view images, transportation accessibility, visitor patterns, and socioeconomic attributes of neighborhoods to enrich our understanding of the real estate appreciation and its predictive modeling. As a case study, we investigate more than 20,000 houses in the Greater Boston Area, and discuss the spatial dependency of house price appreciations, influential variables and their relationships. In detail, we extract deep features from street view images and house photos using a deep learning model, merging features from multi-source data and modeling house price appreciation using machine learning models and the geographically weighted regression at two spatial scales: fine-scale point level and aggregated neighborhood level. Results show that the house price appreciation rate can be modeled with high accuracy using the proposed framework (R-2 = 0.74 for gradient boosting machine at neighborhood-scale). We discovered that houses with low house prices and small house areas may have a higher house appreciation potential. Our results provide insights into how multi-source big geo-data can be employed in machine learning frameworks to characterize real estate price trends and help understand human settlements for policy-making.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] A novel framework to predict chlorophyll-a concentrations in water bodies through multi-source big data and machine learning algorithms
    Hamed Karimian
    Jinhuang Huang
    Youliang Chen
    Zhaoru Wang
    Jinsong Huang
    Environmental Science and Pollution Research, 2023, 30 : 79402 - 79422
  • [42] Research on Medical Multi-Source Data Fusion Based on Big Data
    Hu S.
    Recent Advances in Computer Science and Communications, 2022, 15 (03) : 376 - 387
  • [43] Incorporating Multi-Source Market Sentiment and Price Data for Stock Price Prediction
    Fu, Kui
    Zhang, Yanbin
    MATHEMATICS, 2024, 12 (10)
  • [44] Multi-Source Neural Machine Translation With Missing Data
    Nishimura, Yuta
    Sudoh, Katsuhito
    Neubig, Graham
    Nakamura, Satoshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 569 - 580
  • [45] Multi-Source Neural Machine Translation with Missing Data
    Nishimura, Yuta
    Sudoh, Katsuhito
    Neubig, Graham
    Nakamura, Satoshi
    NEURAL MACHINE TRANSLATION AND GENERATION, 2018, : 92 - 99
  • [46] Multi-source data fusion using deep learning for smart refrigerators
    Zhang, Weishan
    Zhang, Yuanjie
    Zhai, Jia
    Zhao, Dehai
    Xu, Liang
    Zhou, Jiehan
    Li, Zhongwei
    Yang, Su
    COMPUTERS IN INDUSTRY, 2018, 95 : 15 - 21
  • [47] Construction of Gazetteers from Geo Big Data Using Machine Learning Technique on Hadoop
    Pradeepa, S.
    Manjula, K. R.
    PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 1619 - 1622
  • [48] Multi-source data ensemble for energy price trend forecasting
    Braz, Douglas Donizeti de Castilho
    dos Santos, Moises Rocha
    de Paula, Marcos Basile Saviano
    da Silva Filho, Donato
    Guarnier, Ewerton
    Alipio, Lucas Penido
    Tinos, Renato
    Carvalho, Andre C. P. L. F.
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [49] Person Localization using Machine Learning in Multi-Source Camera Surveillance System
    Nabil, Mahmoud
    Sherif, Ahmed
    Mahmoud, Mohamed
    Alsmary, Waleed
    Alsabaan, Maazen
    SOUTHEASTCON 2022, 2022, : 110 - 116
  • [50] Multi-source data integration for soil mapping using deep learning
    Wadoux, Alexandre M. J-C
    Padarian, Jose
    Minasny, Budiman
    SOIL, 2019, 5 (01) : 107 - 119