Explicable Machine Learning Models Using Rich Geospatial Data

被引:0
|
作者
Bramson, Aaron [1 ]
Mita, Masayoshi [1 ]
机构
[1] GA Technol Inc, AI Strategy Ctr, Tokyo, Japan
来源
2024 IEEE 48TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC 2024 | 2024年
关键词
machine learning; geospatial data; explainable AI; evidence-based decision-making; price prediction;
D O I
10.1109/COMPSAC61105.2024.00382
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Machine learning includes a variety of analysis tools with improved predictive performance compared to traditional statistical approaches. However, that performance comes at a cost of transparency and explainability because much of the models' details are not accessible or interpretable by humans. This problem is compounded by the ability of AI to make use of any correlated data to improve predictions, and practitioners' willingness to sacrifice insight for accuracy. However, there is growing push for "explainable AI", especially in applications where people's lives, safety, or well-being are affected. Here we explore an intermediate solution suited to policy decision making: leverage the power of tree-based machine learning methods (such as LightGBM) but restrict oneself to variables with plausible causal influence on the target feature. We explore this "explicable AI" approach with an application to housing price prediction and show that using rich geospatial data can replace implicitly spatial variables (such as coordinates and neighborhood names) and achieve similar or better accuracy. And because the explanatory features are naturally interpretable, feature importance analyses provide genuine insights into contributing factors in a way that is similar to hedonic pricing models.
引用
收藏
页码:2381 / 2386
页数:6
相关论文
共 50 条
  • [1] Enhanced machine learning models development for flash flood mapping using geospatial data
    Hasnaoui, Yacine
    Tachi, Salah Eddine
    Bouguerra, Hamza
    Benmamar, Saadia
    Gilja, Gordon
    Szczepanek, Robert
    Navarro-Pedreno, Jose
    Yaseen, Zaher Mundher
    EURO-MEDITERRANEAN JOURNAL FOR ENVIRONMENTAL INTEGRATION, 2024, 9 (03) : 1087 - 1107
  • [2] Groundwater Quality Prediction and Analysis Using Machine Learning Models and Geospatial Technology
    Rammohan, Bommi
    Partheeban, Pachaivannan
    Ranganathan, Ranihemamalini
    Balaraman, Sundarambal
    SUSTAINABILITY, 2024, 16 (22)
  • [3] An ensemble framework for explainable geospatial machine learning models
    Liu, Lingbo
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2024, 132
  • [4] Flood vulnerability assessment of buildings using geospatial data and machine learning classifiers
    Tam, Tze Huey
    Abd Rahman, Muhammad Zulkarnain
    Harun, Sobri
    Kaoje, Ismaila Usman
    Salleh, Mohd Radhie Mohd
    Asmadi, Mohd Asraff
    ACTA GEOPHYSICA, 2025, : 2879 - 2907
  • [5] Production Machine Learning Frameworks for Geospatial Big Data
    Ntouskos, Valsamis
    Iliopoulou, Chrysa
    Karantzalos, Konstantinos
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5972 - 5974
  • [6] Geospatial data extraction algorithm based on machine learning
    Zhu, Xiao-Long
    Xie, Zhong
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2021, 51 (03): : 1011 - 1016
  • [7] Machine learning algorithms for analysis and modeling of GeoSpatial data
    Pozdnoukhov, Alexei
    Kanevski, Mikhail
    PROCEEDINGS OF THE IAMG '07: GEOMATHEMATICS AND GIS ANALYSIS OF RESOURCES, ENVIRONMENT AND HAZARDS, 2007, : 216 - +
  • [8] Identification and classification of landslide susceptible zone using geospatial techniques and machine learning models
    Ansar, Shameem A.
    Sudha, S.
    Francis, Suresh
    GEOCARTO INTERNATIONAL, 2022, 37 (27) : 18328 - 18355
  • [9] Analytics of Epidemiological Data using Machine Learning Models
    Barapatre, Harshita
    Jangir, Jatin
    Bajpai, Sudhanshu
    Chawla, Bhavesh
    Keswani, Gunjan
    INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 255 - 262
  • [10] The relationship between multiple hazards and deprivation using open geospatial data and machine learning
    Kabiru, Priscilla
    Kuffer, Monika
    Sliuzas, Richard
    Vanhuysse, Sabine
    NATURAL HAZARDS, 2023, 119 (02) : 907 - 941