Analysis of motorcycle accidents using association rule mining-based framework with parameter optimization and GIS technology

被引:43
作者
Jiang, Feifeng [1 ]
Yuen, Kwok Kit Richard [1 ]
Lee, Eric Wai Ming [1 ]
机构
[1] City Univ Hong Kong, Dept Architecture & Civil Engn, Hong Kong, Peoples R China
关键词
Motorcycle Accidents; Association Rule Mining (ARM); threshold determination; Accurate and Efficient Classification Based on Multiple Class-Association Rules (CMAR); Key Factors; Geographic Information System (GIS); DRIVER INJURY SEVERITY; TRAFFIC CRASHES; SELECTION; CREDITS;
D O I
10.1016/j.jsr.2020.09.004
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
Introduction: Analyzing key factors of motorcycle accidents is an effective method to reduce fatalities and improve road safety. Association Rule Mining (ARM) is an efficient data mining method to identify critical factors associated with injury severity. However, the existing studies have some limitations in applying ARM: (a) Most studies determined parameter thresholds of ARM subjectively, which lacks objectiveness and efficiency; (b) Most studies only listed rules with high parameter thresholds, while lacking in-depth analysis of multiple-item rules. Besides, the existing studies seldom conducted a spatial analysis of motorcycle accidents, which can provide intuitive suggestions for policymakers. Method: To address these limitations, this study proposes an ARM-based framework to identify critical factors related to motorcycle injury severity. A method for parameter optimization is proposed to objectively determine parameter thresholds in ARM. A method of factor extraction is proposed to identify individual key factors from 2-item rules and boosting factors from multiple-item rules. Geographic information system (GIS) is adopted to explore the spatial relationship between key factors and motorcycle injury severity. Results and conclusions: The framework is applied to a case study of motorcycle accidents in Victoria, Australia. Fifteen attributes are selected after data preprocessing. 0.03 and 0.7 are determined as the best thresholds of support and confidence in ARM. Five individual key factors and four boosting factors are identified to be related to fatal injury. Spatial analysis is conducted by GIS to present hot spots of motorcycle accidents. The proposed framework has been validated to have better performance on parameter optimization and rule analysis in ARM. Practical applications: The hot spots of motorcycle accidents related to fatal factors are presented in GIS maps. Policymakers can refer to those maps straightforwardly when decision making. This framework can be applied to various kinds of traffic accidents to improve the performance of severity analysis. (C) 2020 National Safety Council and Elsevier Ltd. All rights reserved.
引用
收藏
页码:292 / 309
页数:18
相关论文
共 52 条
[2]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[3]   An introduction to modern missing data analyses [J].
Baraldi, Amanda N. ;
Enders, Craig K. .
JOURNAL OF SCHOOL PSYCHOLOGY, 2010, 48 (01) :5-37
[4]   DATA-DRIVEN VERSIONS OF PEARSONS CHISQUARE TEST FOR UNIFORMITY [J].
BOGDAN, M .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 1995, 52 (03) :217-237
[5]  
Cheng Jack C. P., 2013, International Journal of Engineering and Technology, V5, P438, DOI 10.7763/IJET.2013.V5.592
[6]   A non-linear case-based reasoning approach for retrieval of similar cases and selection of target credits in LEED projects [J].
Cheng, Jack C. P. ;
Ma, Lucky J. .
BUILDING AND ENVIRONMENT, 2015, 93 :349-361
[7]   A data-driven study of important climate factors on the achievement of LEED-EB credits [J].
Cheng, Jack C. P. ;
Ma, Lucky J. .
BUILDING AND ENVIRONMENT, 2015, 90 :232-244
[8]  
Das Subasish, 2018, International Journal of Transportation Science and Technology, V7, P114, DOI 10.1016/j.ijtst.2018.02.001
[9]   Secure Transmission for Downlink NOMA Visible Light Communication Networks [J].
Du, Chun ;
Zhang, Fan ;
Ma, Shuai ;
Tang, Yixiao ;
Li, Hang ;
Wang, Hongmei ;
Li, Shiyin .
IEEE ACCESS, 2019, 7 (65332-65341) :65332-65341
[10]  
Garcia S., 2016, Big Data Anal, P9, DOI 10.1186/s41044-016-0014-0