Determination of the influence factors on household vehicle ownership patterns in Phnom Penh using statistical and machine learning methods

被引:13
作者
Tran Vinh Ha [1 ,2 ]
Asada, Takumi [1 ]
Arimura, Mikiharu [1 ]
机构
[1] Muroran Inst Technol, Div Sustainable & Environm Engn, T 050-8585,27-1 Mizumoto Cho, Muroran, Hokkaido, Japan
[2] Hanoi Architectural Univ, Fac Urban Environm & Infrastruct Engn, Km 10 Nguyen Trai, Hanoi, Vietnam
关键词
Vehicle ownership; Phnom Penh; Features ranking; Multinomial logit model; Neural networks; Random forests; ARTIFICIAL NEURAL-NETWORKS; TRAVEL MODE CHOICE; CAR OWNERSHIP; VARIABLE IMPORTANCE; BUILT ENVIRONMENT; CROSS-VALIDATION; RANDOM FOREST; URBAN; MOTORCYCLE; PREDICTION;
D O I
10.1016/j.jtrangeo.2019.05.015
中图分类号
F [经济];
学科分类号
02 ;
摘要
Vehicle ownership patterns and their determinants play an important role in transportation policy-making. This issue has been paid even greater attention in developing countries that aspire to reach sustainable transportation development goals in the era of urbanization and globalization. In this study, the multinomial logit model, neural networks and random forest were applied to examine the features' impact level and to also predict vehicle ownership patterns in Phnom Penh city. The empirical results indicate that household income is the most powerful variable affecting motorization in Phnom Penh. Supplementation of individual trip characteristics such as total number of trips made, number of trips made for work purposes and overall travel distance all make effective contributions as classifiers. Furthermore, it is acknowledged that the machine-learning approach outperformed not only in terms of predicting accuracy, but also in dealing with unbalanced categories when compared with the statistical approach. This detection supplies the advantages of applying machine learning techniques in terms of, but not limited to, the field of vehicle ownership.
引用
收藏
页码:70 / 86
页数:17
相关论文
共 67 条
[1]   Daily activity pattern recognition by using support vector machines with multiple classes [J].
Allahviranloo, Mandieh ;
Recker, Will .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2013, 58 :16-43
[2]  
[Anonymous], 2004, The Measurement of Interrater Agreement. Statistical Methods for Rates and Proportions
[3]   Empirical characterization of random forest variable importance measures [J].
Archer, Kelfie J. ;
Kirnes, Ryan V. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2008, 52 (04) :2249-2260
[4]   NeuralNetTools: Visualization and Analysis Tools for Neural Networks [J].
Beck, Marcus W. .
JOURNAL OF STATISTICAL SOFTWARE, 2018, 85 (11)
[5]   Comparison of classification accuracy using Cohen's Weighted Kappa [J].
Ben-David, Arie .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (02) :825-832
[6]   A comparison of two alternative behavioral choice mechanisms for household auto ownership decisions [J].
Bhat, CR ;
Pulugurta, V .
TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 1998, 32 (01) :61-75
[7]   Measuring the prediction error. A comparison of cross-validation, bootstrap and covariance penalty methods [J].
Borra, Simone ;
Di Ciaccio, Agostino .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) :2976-2989
[8]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   Statistical modeling: The two cultures [J].
Breiman, L .
STATISTICAL SCIENCE, 2001, 16 (03) :199-215