Predicting and analyzing ferry transit delays using open data and machine learning

被引:0
作者
Sarhani, Malek [1 ]
Nourmohammadzadeh, Abtin [2 ]
Voss, Stefan [2 ,4 ]
EL Amrani, Mohammed [3 ]
机构
[1] Al Akhawayn Univ Ifrane, Sch Business Adm, Ave Hassan II,POB 104, Ifrane 53000, Morocco
[2] Univ Hamburg, Inst Informat Syst, Von Melle Pk 5, D-20146 Hamburg, Germany
[3] Mohammed V Univ Rabat, Fac Sci, Rabat, Morocco
[4] Pontificia Univ Catolica Valparaiso, Escuela Ingn Ind, Valparaiso, Chile
关键词
Ferry transit; Machine learning; Delay prediction; Open data; MODEL; TIME;
D O I
10.1016/j.jpubtr.2025.100124
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The utilization of public transport data has evolved rapidly in recent decades. Ferries, with their unique characteristics and sensitivity to weather conditions, pose significant challenges for delay prediction. Given their pivotal role in the transportation systems of numerous cities, accurately predicting ferry delays is crucial for synchronizing transit services. This paper demonstrates the value of open data for improving ferry delay predictions through machine learning, focusing on two case studies. Our approach leverages General Transit Feed Specification (GTFS) data, ridership and vessel information, and hourly weather data, combined with SHAP explainable artificial intelligence analysis to assess key delay determinants. While support vector regression and deep neural networks showed high accuracy in individual case studies, gradient boosting consistently offered the best balance between prediction accuracy and computational efficiency. Moreover, SHAP analysis reveals that operational and temporal features-such as stop sequence, trip start time, headway, and vehicle label-are the dominant drivers of delays, with weather-related factors exerting only a modest influence.
引用
收藏
页数:14
相关论文
共 42 条
[1]   Computer vision for transit travel time prediction: an end-to-end framework using roadside urban imagery [J].
Abdelhalim, Awad ;
Zhao, Jinhua .
PUBLIC TRANSPORT, 2025, 17 (01) :221-246
[2]  
Barbeau S.J., 2018, 97 ANN M TRANSP RES
[3]   Access to transit? Validating local transit accessibility measures using transit ridership [J].
Bree, Sarah ;
Fuller, Daniel ;
Diab, Ehab .
TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2020, 141 :430-442
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Feature Engineering and Selection: A Practical Approach for Predictive Models [J].
Butcher, Brandon ;
Smith, Brian J. .
AMERICAN STATISTICIAN, 2020, 74 (03) :308-309
[6]  
Cammin P., 2020, Case Studies in Public and Personal Interactive Systems, P31, DOI [10.1007/978-3-030-49757-6_3, DOI 10.1007/978-3-030-49757-6_3]
[7]  
Chondrodima E., 2022, International Journal of Information Management Data Insights, V2, DOI [10.1016/j.jjimei.2022.100086, DOI 10.1016/J.JJIMEI.2022.100086]
[8]   Smart "Predict, then Optimize" [J].
Elmachtoub, Adam N. ;
Grigas, Paul .
MANAGEMENT SCIENCE, 2022, 68 (01) :9-26
[9]   Robustness and disturbances in public transport [J].
Ge, Liping ;
Voss, Stefan ;
Xie, Lin .
PUBLIC TRANSPORT, 2022, 14 (01) :191-261
[10]   Revisiting the richness of integrated vehicle and crew scheduling [J].
Ge, Liping ;
Kliewer, Natalia ;
Nourmohammadzadeh, Abtin ;
Voss, Stefan ;
Xie, Lin .
PUBLIC TRANSPORT, 2022, 16 (3) :775-801