Using data mining techniques for bike sharing demand prediction in metropolitan city

被引:75
|
作者
Sathishkumar, V. E. [1 ]
Park, Jangwoo [1 ]
Cho, Yongyun [1 ]
机构
[1] Sunchon Natl Univ, Dept Informat & Commun Engn, Suncheon Si, South Korea
关键词
Data mining; Predictive analytics; Public bikes; Regression; Bike sharing demand; FORESTS; TRENDS;
D O I
10.1016/j.comcom.2020.02.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently Rental bikes are introduced in many urban cities for the enhancement of mobility comfort. It is important to make the rental bike available and accessible to the public at the right time as it lessens the waiting time. Eventually, providing the city with a stable supply of rental bikes becomes a major concern. The crucial part is the prediction of bike count required at each hour for the stable supply of rental bikes. A Data mining technique is employed for overcoming the hurdles for the prediction of hourly rental bike demand. This paper discusses the models for hourly rental bike demand prediction. Data used include weather information (Temperature, Humidity, Windspeed, Visibility, Dewpoint, Solar radiation, Snowfall, Rainfall), the number of bikes rented per hour and date information. The paper also explores an filtering of features approach to eliminate the parameters which are not predictive and ranks the features based on its prediction performance. Five Statistical regression models were trained with their best hyperparameters using repeated cross-validation and the performance is evaluated using a testing set (a) Linear Regression (b) Gradient Boosting Machine (c) Support Vector Machine (Radial Basis Function Kernel) (d) Boosted Trees, and (e) Extreme Gradient Boosting Trees. When all the predictors are employed, the best model Gradient Boosting Machine can give the best and highest R-2 value of 0.96 in the training set and 0.92 in the test set. Furthermore, several analyzes are carried out in Gradient Boosting Machine with different combinations of predictors to identify the most significant predictors and the relationships between them.
引用
收藏
页码:353 / 366
页数:14
相关论文
共 50 条
  • [31] A Review on Consumer Behavior Prediction using Data Mining Techniques
    Kareena
    Kapoor, Nitika
    PROCEEDINGS OF THE 2019 6TH INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2019, : 1089 - 1093
  • [32] Prediction of Traffic-Violation Using Data Mining Techniques
    Amiruzzaman, Md
    PROCEEDINGS OF THE FUTURE TECHNOLOGIES CONFERENCE (FTC) 2018, VOL 1, 2019, 880 : 283 - 297
  • [33] Improvement of the Prediction of Drugs Demand Using Spatial Data Mining Tools
    M. Isabel Ramos
    Juan José Cubillas
    Francisco R. Feito
    Journal of Medical Systems, 2016, 40
  • [34] Characterising and Predicting Urban Mobility Dynamics By Mining Bike Sharing System Data
    Purnama, Ida Bagus Irawan
    Bergmann, Neil
    Jurdak, Raja
    Zhao, Kun
    IEEE 12TH INT CONF UBIQUITOUS INTELLIGENCE & COMP/IEEE 12TH INT CONF ADV & TRUSTED COMP/IEEE 15TH INT CONF SCALABLE COMP & COMMUN/IEEE INT CONF CLOUD & BIG DATA COMP/IEEE INT CONF INTERNET PEOPLE AND ASSOCIATED SYMPOSIA/WORKSHOPS, 2015, : 159 - 167
  • [35] Improvement of the Prediction of Drugs Demand Using Spatial Data Mining Tools
    Isabel Ramos, M.
    Jose Cubillas, Juan
    Feito, Francisco R.
    JOURNAL OF MEDICAL SYSTEMS, 2016, 40 (01) : 1 - 9
  • [36] Demand Forecasting of Short Life Cycle Products Using Data Mining Techniques
    Afifi, Ashraf A.
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2020, PT I, 2020, 583 : 151 - 162
  • [37] Prediction of rate of penetration in directional drilling using data mining techniques
    Shaygan, Kaveh
    Jamshidi, Saeid
    GEOENERGY SCIENCE AND ENGINEERING, 2023, 221
  • [38] Prediction of students' performance in elective subject using data mining techniques
    Sulaiman, S.
    Shibghatullah, A. S.
    Rahman, N. A.
    PROCEEDINGS OF MECHANICAL ENGINEERING RESEARCH DAY 2017 (MERD), 2017, : 222 - 224
  • [39] Software Fault Prediction Using Data Mining Techniques on Software Metrics
    Kumar, Rakesh
    Chaturvedi, Amrita
    MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 304 - 313
  • [40] Prediction of Academic Performance of Alcoholic Students Using Data Mining Techniques
    Sasikala, T.
    Rajesh, M.
    Sreevidya, B.
    COGNITIVE INFORMATICS AND SOFT COMPUTING, 2020, 1040 : 141 - 148