Prediction in Traffic Accident Duration Based on Heterogeneous Ensemble Learning

被引:29
作者
Zhao, Yuexu [1 ]
Deng, Wei [1 ]
机构
[1] Hangzhou Dianzi Univ, Coll Econ, Hangzhou, Peoples R China
关键词
BAYESIAN-NETWORKS; INCIDENT; METHODOLOGY;
D O I
10.1080/08839514.2021.2018643
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Based on millions of traffic accident data in the United States, we build an accident duration prediction model based on heterogeneous ensemble learning to study the problem of accident duration prediction in the initial stage of the accident. First, we focus on the earlier stage of the accident development, and select some effective information from five aspects of traffic, location, weather, points of interest and time attribute. Then, we improve data quality by means of data cleaning, outlier processing and missing value processing. In addition, we encode category features for high-frequency category variables and extract deeper information from the limited initial information through feature extraction. A pre-processing scheme of accident duration data is established. Finally, from the perspective of model, sample and parameter diversity, we use XGBoost, LightGBM, CatBoost, stacking and elastic network to build a heterogeneous ensemble learning model to predict the accident duration. The results show that the model not only has good prediction accuracy but can synthesize multiple models to give a comprehensive degree of importance of influencing factors, and the feature importance of the model shows that the time, location, weather and relevant historical statistics of the accident are important to the accident duration.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] Hazards-Based Duration Time Model with Priorities Considering Unobserved Heterogeneity Using Real-Time Traffic and Weather Big Data
    Lee, Songha
    Park, Juneyoung
    Abdel-Aty, Mohamed
    TRANSPORTATION RESEARCH RECORD, 2024, : 1695 - 1707
  • [32] Two-stage aging trajectory prediction of LFP lithium-ion battery based on transfer learning with the cycle life prediction
    Zhou, Ziyou
    Liu, Yonggang
    You, Mingxing
    Xiong, Rui
    Zhou, Xuan
    GREEN ENERGY AND INTELLIGENT TRANSPORTATION, 2022, 1 (01):
  • [33] Machine learning-based prediction of cognitive outcomes in de novo Parkinson's disease
    Harvey, Joshua
    Reijnders, Rick A.
    Cavill, Rachel
    Duits, Annelien
    Kohler, Sebastian
    Eijssen, Lars
    Rutten, Bart P. F.
    Shireby, Gemma
    Torkamani, Ali
    Creese, Byron
    Leentjens, Albert F. G.
    Lunnon, Katie
    Pishva, Ehsan
    NPJ PARKINSONS DISEASE, 2022, 8 (01)
  • [34] Development of shale gas production prediction models based on machine learning using early data
    Niu, Wente
    Lu, Jialiang
    Sun, Yuping
    ENERGY REPORTS, 2022, 8 : 1229 - 1237
  • [35] Markov-Based Time Series Modeling Framework for Traffic-Network State Prediction under Various External Conditions
    Li, Tao
    Ma, Jiaqi
    Lee, Changju
    JOURNAL OF TRANSPORTATION ENGINEERING PART A-SYSTEMS, 2020, 146 (06)
  • [36] Dual learning-based online ensemble regression approach for adaptive soft sensor modeling of nonlinear time-varying processes
    Jin, Huaiping
    Chen, Xiangguang
    Wang, Li
    Yang, Kai
    Wu, Lei
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2016, 151 : 228 - 244
  • [37] Prediction error-based clustering approach for multiple-model learning using statistical testing
    Sivadurgaprasad, Chinta
    Sivaram, Abhishek
    Rengaswamy, Raghunathan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 77 : 125 - 135
  • [38] Prediction of hotel booking cancellations: Integration of machine learning and probability model based on interpretable feature interaction
    Chen, Shuixia
    Ngai, Eric W. T.
    Ku, Yaoyao
    Xu, Zeshui
    Gou, Xunjie
    Zhang, Chenxi
    DECISION SUPPORT SYSTEMS, 2023, 170
  • [39] A novel GDP prediction technique based on transfer learning using CO2 emission dataset
    Kumar, Sandeep
    Muhuri, Pranab K.
    APPLIED ENERGY, 2019, 253
  • [40] A Local Weighted Linear Regression (LWLR) Ensemble of Surrogate Models Based on Stacking Strategy: Application to Hydrodynamic Response Prediction for Submerged Floating Tunnel (SFT)
    Xu, Guoji
    Wei, Huan
    Wang, Jinsheng
    Chen, Xuebin
    Zhu, Bing
    APPLIED OCEAN RESEARCH, 2022, 125