Train delays prediction based on feature selection and random forest

被引:2
|
作者
Ji, Yuanyuan [1 ]
Zheng, Wei [1 ,2 ]
Dong, Hairong [3 ]
Gao, Pengfei [1 ,2 ]
机构
[1] Beijing Jiaotong Univ, Natl Res Ctr Railway Safety Assessment, Beijing 100044, Peoples R China
[2] Beijing Jiaotong Univ, Beijing Key Lab Intelligent Traff Data Secur & Pr, Beijing 100044, Peoples R China
[3] Beijing Jiaotong Univ, State Key Lab Rail Traff Control & Safety, Beijing 100044, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/itsc45102.2020.9294653
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Although trains are more efficient and convenient than other transportation, delays often occur. Accurately predicting the delay time of trains is of great significance to both dispatchers and passengers. The method for predicting the arrival delay time of trains is based on feature selection algorithm and machine learning. First, we collect train delay cases to sort out the delay factors. In addition to internal factors, external factors such as weather and signal failure are also considered. Then, an improved max-relevance and min-redundancy method (mRMR) is used for feature selection. Finally, we apply the method of weighted random forest (wRF) to predict the delay time. The results demonstrate that the feature selection algorithm has a prominent effect on improving the accuracy of the model, and the mean square error based on the weighted random forest has an improvement potential in forecast precision.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] TEHRAN AIR POLLUTANTS PREDICTION BASED ON RANDOM FOREST FEATURE SELECTION METHOD
    Shamsoddini, A.
    Aboodi, M. R.
    Karami, J.
    ISPRS INTERNATIONAL JOINT CONFERENCES OF THE 2ND GEOSPATIAL INFORMATION RESEARCH (GI RESEARCH 2017); THE 4TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING (SMPR 2017); THE 6TH EARTH OBSERVATION OF ENVIRONMENTAL CHANGES (EOEC 2017), 2017, 42-4 (W4): : 483 - 488
  • [2] Feature selection algorithm based on random forest
    Yao, Deng-Ju
    Yang, Jing
    Zhan, Xiao-Juan
    Jilin Daxue Xuebao (Gongxueban)/Journal of Jilin University (Engineering and Technology Edition), 2014, 44 (01): : 137 - 141
  • [3] Prediction of Protein Cleavage Site with Feature Selection by Random Forest
    Li, Bi-Qing
    Cai, Yu-Dong
    Feng, Kai-Yan
    Zhao, Gui-Jun
    PLOS ONE, 2012, 7 (09):
  • [4] Prediction with Random Forest Involving Sampling and Feature Selection Strategies
    Cao, Min
    Zhang, Xiaolong
    Li, Bo
    Zhao, Jiafu
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 600 - 605
  • [5] Research on Feature Selection Methods based on Random Forest
    Wang, Zhuo
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2023, 30 (02): : 623 - 633
  • [6] Software Defect Prediction using Feature Selection and Random Forest Algorithm
    Ibrahim, Dyana Rashid
    Ghnemat, Rawan
    Hudaib, Amjad
    2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 252 - 257
  • [7] Hidden AS link prediction based on random forest feature selection and GWO-XGBoost model
    Wang, Zekang
    Yuan, Fuxiang
    Li, Ruixiang
    Zhang, Meng
    Luo, Xiangyang
    COMPUTER NETWORKS, 2025, 262
  • [8] An Innovative NOx Emissions Prediction Model Based on Random Forest Feature Selection and Evolutionary Reformer
    Meng, Xianyu
    Li, Xi
    Chen, Jialei
    Fu, Yongyan
    Zhang, Chu
    Nazir, Muhammad Shahzad
    Peng, Tian
    PROCESSES, 2025, 13 (01)
  • [9] A New Noisy Random Forest Based Method for Feature Selection
    Akhiat, Yassine
    Manzali, Youness
    Chahhou, Mohamed
    Zinedine, Ahmed
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2021, 21 (02) : 10 - 28
  • [10] Microgrid fault classification based on random forest feature selection
    Wang, Changhong
    Gao, Yanjie
    Tang, Min
    REVIEWS OF ADHESION AND ADHESIVES, 2023, 11 (02): : 220 - 237