Investigating macro-level hotzone identification and variable importance using big data: A random forest models approach

被引:44
作者
Jiang, Ximiao [1 ]
Abdel-Aty, Mohamed [2 ]
Hu, Jia [1 ]
Lee, Jaeyoung [2 ]
机构
[1] Fed Highway Adm, Off Operat R&D, Mclean, VA 22101 USA
[2] Univ Cent Florida, Dept Civil Environm & Construct Engn, Orlando, FL 32816 USA
关键词
Hotzone identification; Big data; Connected Vehicle; Variable importance; Random forest; Wilcoxon test; TRAFFIC ACCIDENTS; SPATIAL-ANALYSIS; INJURY SEVERITY; SAFETY ANALYSIS; ROAD CRASHES; LAND-USE; CLASSIFICATION; LEVEL; HETEROGENEITY; COLLISIONS;
D O I
10.1016/j.neucom.2015.08.097
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As Connected Vehicle technologies begin to be deployed along roadway networks, they will be providing massive amount of data. This big data can be useful in identifying safety hazardous zones, which can be complicated and unreliable today. Without sufficient data, past studies had to focus mostly on the micro level networks. Research on macro-level hotzone identification is limited, and until this point, the contribution of various macroscopic features on the macro-level crash risks is still in dispute. This paper, with the help of massive amount of data, investigates the feasibility of using random forest for hotzone identification at macro-level- the Traffic Analysis Zone (TAZ) level. At the same time, the most influential macro-level crash risk determinants were identified by applying a series of random forest models in combination with the cross validation methods. The differences of all features between hotzones and normal TAZs were also recognized through Wilcoxon tests. Crash data of three counties in Florida during 2008 and 2009 were employed. Crash risks by different injury levels and collision types were investigated separately. Finally, the significance of various macroscopic variables was determined by different types of crash risks using variable importance analysis. The research results suggest that the distribution of road network and socio-economics are the two most important factors when proactively alleviating traffic safety issues. For developed urban areas, it is desirable to formulate specific traffic safety management strategies that accounts for zone-level socioeconomics and development of road infrastructure. For zones with a higher percentage of school enrollment, pedestrian and bicycle friendly roadway system design are most beneficial. It is also desirable to take efficient countermeasures such as law enforcement and driving school training to regulate young drivers' behavior in school zones. For areas with high minority residence, there might be a need to use awareness campaigns in multiple languages to relieve pedestrian safety issues. Finally, additional attention should be paid to improve intersection design and management during the planning and operation processes. Published by Elsevier B.V.
引用
收藏
页码:53 / 63
页数:11
相关论文
共 50 条
[21]   Macrolevel accident prediction models for evaluating safety of urban transportation systems [J].
Hadayeghi, A ;
Shalaby, AS ;
Persaud, HN .
STATISTICAL METHODS AND MODELING AND SAFETY DATA, ANALYSIS, AND EVALUATION: SAFETY AND HUMAN PERFORMANCE, 2003, (1840) :87-95
[22]   Development of planning level transportation safety tools using Geographically Weighted Poisson Regression [J].
Hadayeghi, Alireza ;
Shalaby, Amer S. ;
Persaud, Bhagwant N. .
ACCIDENT ANALYSIS AND PREVENTION, 2010, 42 (02) :676-688
[23]   Exploring precrash maneuvers using classification trees and random forests [J].
Harb, Rami ;
Yan, Xuedong ;
Radwan, Essam ;
Su, Xiaogang .
ACCIDENT ANALYSIS AND PREVENTION, 2009, 41 (01) :98-107
[24]   Analysis of circumstances and injuries in 217 pedestrian traffic fatalities [J].
Harruff, RC ;
Avery, A ;
Alter-Pandya, AS .
ACCIDENT ANALYSIS AND PREVENTION, 1998, 30 (01) :11-20
[25]   County-Level Crash Risk Analysis in Florida Bayesian Spatial Modeling [J].
Huang, Helai ;
Abdel-Aty, Mohamed A. ;
Darwiche, Ali Lotfi .
TRANSPORTATION RESEARCH RECORD, 2010, (2148) :27-37
[26]   Heterogeneity considerations in accident modeling [J].
Karlaftis, MG ;
Tarko, AP .
ACCIDENT ANALYSIS AND PREVENTION, 1998, 30 (04) :425-433
[27]  
Khondakar B., 2009, TRANSP RES BOARD ANN
[28]   Influence of land use, population, employment, and economic activity on accident's [J].
Kim, Karl ;
Brunner, I. Made ;
Yamashita, Eric Y. .
SAFETY DATA, ANALYSIS, AND EVALUATION, 2006, (1953) :56-64
[29]   SPATIAL-ANALYSIS OF HONOLULU MOTOR-VEHICLE CRASHES .2. ZONAL GENERATORS [J].
LEVINE, N ;
KIM, KE ;
NITZ, LH .
ACCIDENT ANALYSIS AND PREVENTION, 1995, 27 (05) :675-685
[30]   Death on the crosswalk - A study of pedestrian-automobile collisions in Los Angeles [J].
Loukaitou-Sideris, Anastasia ;
Liggett, Robin ;
Sung, Hyun-Gun .
JOURNAL OF PLANNING EDUCATION AND RESEARCH, 2007, 26 (03) :338-351