Understanding Ridesplitting Behavior with Interpretable Machine Learning Models Using Chicago Transportation Network Company Data

被引:11
作者
Abkarian, Hoseb [1 ]
Chen, Ying [1 ]
Mahmassani, Hani S. [1 ]
机构
[1] Northwestern Univ, Transportat Ctr, Evanston, IL 60208 USA
关键词
data and data science; artificial intelligence and advanced computing applications; machine learning (artificial intelligence); support vector machines; planning and analysis; transportation demand forecasting; ridership estimation modeling; public transportation; innovative public transportation services and technologies; information technologies (cellphone based apps); ride hailing; ridesharing; shared; transportation network companies (tnc); transformative trends in transit data;
D O I
10.1177/03611981211036363
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
As congestion levels increase in cities, it is important to analyze people's choices of different services provided by transportation network companies (TNCs). Using machine learning techniques in conjunction with large TNC data, this paper focuses on uncovering complex relationships underlying ridesplitting market share. A real-world dataset provided by TNCs in Chicago is used in analyzing ridesourcing trips from November 2018 to December 2019 to understand trends in the city. Aggregated origin-destination trip-level characteristics, such as mean cost, mean time, and travel time reliability, are extracted and combined with origin-destination community-level characteristics. Three tree-based algorithms are then utilized to model the market share of ridesplitting trips. The most significant factors are extracted as well as their marginal effect on ridesplitting behavior, using partial dependency plots for interpretation of the machine learning model results. The results suggest that, overall, community-level factors are as or more important than trip-level characteristics. Additionally, the percentage of White people highly affects ridesplitting market share as well as the percentage of bachelor's degree holders and households with two people residing in them. Travel time reliability and cost variability are also deemed more important than travel time and cost savings. Finally, the potential impact of taxes, crimes, cultural differences, and comfort is discussed in driving the market share, and suggestions are presented for future research and data collection attempts.
引用
收藏
页码:83 / 99
页数:17
相关论文
共 35 条
[1]   Dynamic Ride-Sharing: a Simulation Study in Metro Atlanta [J].
Agatz, Niels ;
Erera, Alan L. ;
Savelsbergh, Martin W. P. ;
Wang, Xing .
PAPERS SELECTED FOR THE 19TH INTERNATIONAL SYMPOSIUM ON TRANSPORTATION AND TRAFFIC THEORY, 2011, 17 :532-550
[2]  
[Anonymous], 2016, THESIS
[3]  
Benjamin J, 1998, TRANSPORT RES REC, P60
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]  
Breiman L., 1984, CLASSIFICATION REGRE, V432, P151, DOI DOI 10.1016/S0169-5347(03)00018-1
[6]  
CDOT (Chicago Department of Transportation), 2021, Open Streets
[7]   Ridesharing in North America: Past, Present, and Future [J].
Chan, Nelson D. ;
Shaheen, Susan A. .
TRANSPORT REVIEWS, 2012, 32 (01) :93-112
[8]   XGBoost: A Scalable Tree Boosting System [J].
Chen, Tianqi ;
Guestrin, Carlos .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :785-794
[9]   Exploring impacts of on-demand ridesplitting on mobility via real-world ridesourcing data and questionnaires [J].
Chen, Xiaowei ;
Zheng, Hongyu ;
Wang, Ze ;
Chen, Xiqun .
TRANSPORTATION, 2021, 48 (04) :1541-1561
[10]   Understanding ridesplitting behavior of on-demand ride services: An ensemble learning approach [J].
Chen, Xiqun ;
Zahiri, Majid ;
Zhang, Shuaichao .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2017, 76 :51-70