Trip purpose prediction using travel survey data with POI information via gradient boosting decision trees

被引:2
|
作者
Zhao, De [1 ]
Zhou, Wei [1 ,2 ]
Wang, Wei [1 ]
Hua, Xuedong [1 ]
机构
[1] Southeast Univ, Sch Transportat, Jiangsu Prov Collaborat Innovat Ctr Modern Urban T, Jiangsu Key Lab Urban ITS, Nanjing, Jiangsu, Peoples R China
[2] Jiangsu Prov Collaborat Innovat Ctr Modern Urban T, Sch Transportat, Key Lab Urban ITS, 2 Southeast Univ Rd, Nanjing 211189, Jiangsu, Peoples R China
关键词
behavioural sciences; data mining; decision trees; demand forecasting; traveller information; GPS DATA; IMPUTATION; PATTERNS;
D O I
10.1049/itr2.12450
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
At present, data obtained from the Global Positioning System (GPS) is significantly valuable in mobility research. However, GPS-based data lacks include trip purpose information. Consequently, many researchers have endeavoured to predict or impute these missing attributes. Existing studies have focused on constructing more features to improve prediction accuracy, but paid less attention to the model's applicability and transferability. In this study, five trip purposes are extracted, including education, recreation, personal, shopping, and transportation, from Chengdu Household Travel Survey (HTS) data. The individual and trip characteristics that are common and can be easily derived from GPS data are carefully selected and extracted. Point of Interest (POI) data of the trip destination are also collected to enhance input characteristics. To obtain more accurate results, an ensemble learning model, Gradient Boosting Decision Trees (GBDT), is employed to predict trip purposes. grid search and cross-validation techniques are used to optimize the hyper-parameters. Empirical results show that the proposed model achieves 0.788 accuracy, which is 22.17%, 14.53%, 10.36%, and 6.77% higher than Multinominal Logit (MNL), Artificial Neural Network (ANN), Random Forest (RF), and Deep Belief Network (DBN), respectively. It is also found that although increasing trip features improve the model's accuracy, it simultaneously impairs model's transferability and generalizability. This study explores the trip purpose prediction problem using 2016 Household Travel Survey (HTS) data and POI data in Chengdu, China.image
引用
收藏
页码:269 / 289
页数:21
相关论文
共 5 条
  • [1] Varying Built Environment Contexts and Trip Chain Decisions: A Multinomial-Choice Gradient Boosting Decision Trees Analysis
    Zhang, Wenjia
    Lu, Daming
    Liu, Hongjin
    Li, Boyang
    TRAVEL BEHAVIOUR AND SOCIETY, 2024, 34
  • [2] Inclusion of genetic variants in an ensemble of gradient boosting decision trees does not improve the prediction of citalopram treatment response
    Shumake, Jason
    Mallard, Travis T.
    McGeary, John E.
    Beevers, Christopher G.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [3] Predicting the Gas Storage Capacity in Shale Formations Using the Extreme Gradient Boosting Decision Trees Method
    Wang, Jiaheng
    Li, Nong
    Huo, Xiangyu
    Yang, Mingli
    Zhang, Li
    ENERGY TECHNOLOGY, 2024, 12 (10)
  • [4] Predicting Bus Passenger Flow and Prioritizing Influential Factors Using Multi-Source Data: Scaled Stacking Gradient Boosting Decision Trees
    Wu, Weitiao
    Xia, Yisong
    Jin, Wenzhou
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (04) : 2510 - 2523
  • [5] Comparison of the Prediction Model of Adolescents' Suicide Attempt Using Logistic Regression and Decision Tree: Secondary Data Analysis of the 2019 Youth Health Risk Behavior Web-Based Survey
    Lee, Yoonju
    Kim, Heejin
    Lee, Yesul
    Jeong, Hyesun
    JOURNAL OF KOREAN ACADEMY OF NURSING, 2021, 51 (01) : 40 - 53