KNOWLEDGE DISCOVERY FROM ROAD TRAFFIC ACCIDENT DATA IN ETHIOPIA: DATA QUALITY, ENSEMBLING AND TREND ANALYSIS FOR IMPROVING ROAD SAFETY

被引:5
作者
Beshah, Tibebe [1 ]
Ejigu, Dejene [1 ]
Abraham, Ajith [2 ,3 ]
Kroemer, Pavel [2 ,3 ]
Snasel, Vaclav [2 ,3 ]
机构
[1] Univ Addis Ababa, IT Doctoral Program, Addis Ababa, Ethiopia
[2] VSB Tech Univ Ostrava, Fac Elect Engn & Comp Sci, Dept Comp Sci, Ostrava 70833, Czech Republic
[3] European Ctr Excelence, IT4Innovat, Ostrava 70833, Czech Republic
关键词
Road safety; road accident; CART; Random Forest; Tree Net; data quality; INJURY SEVERITY;
D O I
10.14311/NNW.2012.22.013
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Descriptive analysis of the magnitude and situation of road safety in general and road accidents in particular is important, but understanding of data quality, factors related with dangerous situations and various interesting patterns in data is of even greater importance. Under the umbrella of information architecture research for road safety in developing countries, the objective of this machine learning experimental research is to explore data quality issues, analyze trends and predict the role of road users on possible injury risks. The research employed Tree Net, Classification and Adaptive Regression Trees (CART), Random Forest (RF) and hybrid ensemble approach. To identify relevant patterns and illustrate the performance of the techniques for the road safety domain, road accident data collected from Addis Ababa Traffic Office is subject to several analyses. Empirical results illustrate that data quality is a major problem that needs architectural guideline and the prototype models could classify accidents with promising accuracy. In addition, an ensemble technique proves to be better in terms of predictive accuracy in the domain under study.
引用
收藏
页码:215 / 244
页数:30
相关论文
共 43 条
[1]   An empirical assessment of fixed and random parameter logit models using crash- and non-crash-specific injury data [J].
Anastasopoulos, Panagiotis Ch. ;
Mannering, Fred .
ACCIDENT ANALYSIS AND PREVENTION, 2011, 43 (03) :1140-1147
[2]  
[Anonymous], 2011, SALFORD SYSTEMS DATA
[3]  
[Anonymous], 2005, Int. J. Simul.
[4]  
[Anonymous], 2011, INT J COMPUTER APPL
[5]  
Beshah T., 2011, Proceedings of the 2011 World Congress on Information and Communication Technologies (WICT), P1241, DOI 10.1109/WICT.2011.6141426
[6]  
Beshah T., 2010, AAAI Spring Symposium: Artificial Intelligence for Development, P14
[7]   Analysis of traffic injury severity: An application of non-parametric classification tree techniques [J].
Chang, Li-Yen ;
Wang, Hsiu-Wen .
ACCIDENT ANALYSIS AND PREVENTION, 2006, 38 (05) :1019-1027
[8]   Analysis and Solution of Data Quality in Data Warehouse of Chinese Materia Medica [J].
Chen, Bing ;
Wang, Beizhan ;
Weng, Xuchu ;
Hu, Xueqin .
ICCSSE 2009: PROCEEDINGS OF 2009 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION, 2009, :823-+
[9]  
Daigavane Prema, 2009, 2009 2nd International Conference on Emerging Trends in Engineering and Technology (ICETET 2009), P576, DOI 10.1109/ICETET.2009.31
[10]   Analysis of traffic accident injury severity on Spanish rural highways using Bayesian networks [J].
de Ona, Juan ;
Oqab Mujalli, Randa ;
Calvo, Francisco J. .
ACCIDENT ANALYSIS AND PREVENTION, 2011, 43 (01) :402-411