Machine Learning Travel Mode Choices: Comparing the Performance of an Extreme Gradient Boosting Model with a Multinomial Logit Model

被引:140
作者
Wang, Fangru [1 ]
Ross, Catherine L. [2 ]
机构
[1] Georgia Inst Technol, Sch City & Reg Planning, Atlanta, GA 30332 USA
[2] Georgia Inst Technol, Sch City & Reg Planning & Civil & Environm Engn, Atlanta, GA 30332 USA
关键词
NEURAL-NETWORKS; URBAN FORM; HOUSEHOLD; COMPLEXITY; TIME;
D O I
10.1177/0361198118773556
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The multinomial logit (MNL) model and its variations have been dominating the travel mode choice modeling field for decades. Advantages of the MNL model include its elegant closed-form mathematical structure and its interpretable model estimation results based on random utility theory, while its main limitation is the strict statistical assumptions. Recent computational advancement has allowed easier application of machine learning models to travel behavior analysis, though research in this field is not thorough or conclusive. In this paper, we explore the application of the extreme gradient boosting (XGB) model to travel mode choice modeling and compare the result with an MNL model, using the Delaware Valley 2012 regional household travel survey data. The XGB model is an ensemble method based on the decision-tree algorithm and it has recently received a great deal of attention and use because of its high machine learning performance. The modeling and predicting results of the XGB model and the MNL model are compared by examining their multi-class predictive errors. We found that the XGB model has overall higher prediction accuracy than the MNL model especially when the dataset is not extremely unbalanced. The MNL model has great explanatory power and it also displays strong consistency between training and testing errors. Multiple trip characteristics, socio-demographic traits, and built-environment variables are found to be significantly associated with people's mode choices in the region, but mode-specific travel time is found to be the most determinant factor for mode choice.
引用
收藏
页码:35 / 45
页数:11
相关论文
共 41 条
[1]  
Ben-Akiva M., 1985, Discrete Choice Analysis: Theory and Application to Travel Demand
[2]   Analysis of travel mode and departure time choice for urban shopping trips [J].
Bhat, CR .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 1998, 32 (06) :361-371
[3]   Work travel mode choice and number of non-work commute stops [J].
Bhat, CR .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 1997, 31 (01) :41-54
[4]  
Biagioni JP, 2008, 88 ANN M TRANSP RES
[5]   Activity-based disaggregate travel demand model system with activity schedules [J].
Bowman, JL ;
Ben-Akiva, ME .
TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2001, 35 (01) :1-28
[6]   Application of radial basis function and generalized regression neural networks in non-linear utility function specification for travel mode choice modelling [J].
Celikoglu, Hilmi Berk .
MATHEMATICAL AND COMPUTER MODELLING, 2006, 44 (7-8) :640-658
[7]   Built environments and mode choice: toward a normative framework [J].
Cervero, R .
TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2002, 7 (04) :265-284
[8]   Walking, bicycling, and urban landscapes: Evidence from the San Francisco Bay area [J].
Cervero, R ;
Duncan, M .
AMERICAN JOURNAL OF PUBLIC HEALTH, 2003, 93 (09) :1478-1483
[9]   Investigating household vehicle ownership, mode choice and trip sharing decisions using a combined revealed preference/stated preference Nested Logit model: case study in Bangkok Metropolitan Region [J].
Dissanayake, Dilum ;
Morikawa, Takayuki .
JOURNAL OF TRANSPORT GEOGRAPHY, 2010, 18 (03) :402-410
[10]   School location and student travel - Analysis of factors affecting mode choice [J].
Ewing, R ;
Schroeer, W ;
Greene, W .
TRANSPORTATION PLANNING AND ANALYSIS 2004, 2004, (1895) :55-63