Examining nonlinearity in population inflow estimation using big data: An empirical comparison of explainable machine learning models

被引:26
|
作者
Hu, Songhua [1 ]
Xiong, Chenfeng [2 ]
Chen, Peng [3 ]
Schonfeld, Paul [1 ]
机构
[1] Univ Maryland, Dept Civil & Environm Engn, College Pk, MD 20742 USA
[2] Villanova Univ, Dept Civil & Environm Engn, Villanova, PA 19085 USA
[3] Univ S Florida, Sch Publ Affairs, Tampa, FL 33620 USA
关键词
Explainable machine learning; Interpretability; Nonlinearity; Mobile device location data; Travel demand; BUILT ENVIRONMENT; TRAVEL; BEHAVIOR; CHOICE;
D O I
10.1016/j.tra.2023.103743
中图分类号
F [经济];
学科分类号
02 ;
摘要
Mobile device location data (MDLD) contain population-representative, fine-grained travel de-mand information, facilitating opportunities to validate established relations between travel de-mand and underlying factors from a big data perspective. Using the nationwide census block group (CBG)-level population inflow derived from MDLD as the proxy of travel demand, this study examines its relations with various factors including socioeconomics, demographics, land use, and CBG attributes. A host of tree-based machine learning (ML) models and interpretation techniques (feature importance, partial dependence plot (PDP), accumulated local effect (ALE), SHapley Additive exPlanations (SHAP)) are extensively compared to determine the best model architecture and justify interpretation robustness. Empirical results show that: 1) Boosting trees perform the best among all models, followed by bagging trees, single trees, and linear regressions. (2) Feature importance holds consistently among different tree-based models but is influenced by measures of importance and hyperparameter settings. 3) Pronounced nonlinearities, threshold effects, and interaction effects are observed in relations among population inflow and most of its determinants. 4) Compared with PDP, ALE and SHAP plots are more reliable in the presence of outliers, feature dependency, and local heterogeneity. Taken together, techniques introduced in this study can either be integrated into customary travel demand models to enhance model ac-curacy or serve as interpretation tools that offer a comprehensive understanding of intricate relations.
引用
收藏
页数:20
相关论文
共 43 条
  • [1] Advancing subseasonal reservoir inflow forecasts using an explainable machine learning method
    Fan, Ming
    Liu, Siyan
    Lu, Dan
    JOURNAL OF HYDROLOGY-REGIONAL STUDIES, 2023, 50
  • [2] Explainable machine learning models for corn yield prediction using UAV multispectral data☆
    Kumar, Chandan
    Dhillon, Jagman
    Huang, Yanbo
    Reddy, Krishna
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 231
  • [3] Comparison of traffic accident injury severity prediction models with explainable machine learning
    Cicek, Elif
    Akin, Murat
    Uysal, Furkan
    Topcu Aytas, ReyhanMerve
    TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH, 2023, 15 (09): : 1043 - 1054
  • [4] Understanding predictions of drug profiles using explainable machine learning models
    Konig, Caroline
    Vellido, Alfredo
    BIODATA MINING, 2024, 17 (01):
  • [5] Evaluating fire resistance of timber columns using explainable machine learning models
    Esteghamati, Mohsen Zaker
    Gernay, Thomas
    Banerji, Srishti
    ENGINEERING STRUCTURES, 2023, 296
  • [6] Enhanced SpO2 estimation using explainable machine learning and neck photoplethysmography
    Zhong, Yuhao
    Jatav, Ashish
    Afrin, Kahkashan
    Shivaram, Tejaswini
    Bukkapatnam, Satish T. S.
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2023, 145
  • [7] Comparison and use of explainable machine learning-based survival models for heart failure patients
    Shi, Tao
    Yang, Jianping
    Zhang, Ningli
    Rong, Wei
    Gao, Lusha
    Xia, Ping
    Zou, Jie
    Zhu, Na
    Yang, Fazhi
    Chen, Lixing
    DIGITAL HEALTH, 2024, 10
  • [8] Generation of Automatic Data-Driven Feedback to Students Using Explainable Machine Learning
    Afzaal, Muhammad
    Nouri, Jalal
    Zia, Aayesha
    Papapetrou, Panagiotis
    Fors, Uno
    Wu, Yongchao
    Li, Xiu
    Weegar, Rebecka
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2021), PT II, 2021, 12749 : 37 - 42
  • [9] Shedding Light on the Black Box: Integrating Prediction Models and Explainability Using Explainable Machine Learning
    Zhang, Yucheng
    Zheng, Yuyan
    Wang, Dan
    Gu, Xiaowei
    Zyphur, Michael J.
    Xiao, Lin
    Liao, Shudi
    Deng, Yangyang
    ORGANIZATIONAL RESEARCH METHODS, 2025,
  • [10] Big Data Mining and Classification of Intelligent Material Science Data Using Machine Learning
    Chittam, Swetha
    Gokaraju, Balakrishna
    Xu, Zhigang
    Sankar, Jagannathan
    Roy, Kaushik
    APPLIED SCIENCES-BASEL, 2021, 11 (18):