Investigating macro-level hotzone identification and variable importance using big data: A random forest models approach

被引:44
作者
Jiang, Ximiao [1 ]
Abdel-Aty, Mohamed [2 ]
Hu, Jia [1 ]
Lee, Jaeyoung [2 ]
机构
[1] Fed Highway Adm, Off Operat R&D, Mclean, VA 22101 USA
[2] Univ Cent Florida, Dept Civil Environm & Construct Engn, Orlando, FL 32816 USA
关键词
Hotzone identification; Big data; Connected Vehicle; Variable importance; Random forest; Wilcoxon test; TRAFFIC ACCIDENTS; SPATIAL-ANALYSIS; INJURY SEVERITY; SAFETY ANALYSIS; ROAD CRASHES; LAND-USE; CLASSIFICATION; LEVEL; HETEROGENEITY; COLLISIONS;
D O I
10.1016/j.neucom.2015.08.097
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As Connected Vehicle technologies begin to be deployed along roadway networks, they will be providing massive amount of data. This big data can be useful in identifying safety hazardous zones, which can be complicated and unreliable today. Without sufficient data, past studies had to focus mostly on the micro level networks. Research on macro-level hotzone identification is limited, and until this point, the contribution of various macroscopic features on the macro-level crash risks is still in dispute. This paper, with the help of massive amount of data, investigates the feasibility of using random forest for hotzone identification at macro-level- the Traffic Analysis Zone (TAZ) level. At the same time, the most influential macro-level crash risk determinants were identified by applying a series of random forest models in combination with the cross validation methods. The differences of all features between hotzones and normal TAZs were also recognized through Wilcoxon tests. Crash data of three counties in Florida during 2008 and 2009 were employed. Crash risks by different injury levels and collision types were investigated separately. Finally, the significance of various macroscopic variables was determined by different types of crash risks using variable importance analysis. The research results suggest that the distribution of road network and socio-economics are the two most important factors when proactively alleviating traffic safety issues. For developed urban areas, it is desirable to formulate specific traffic safety management strategies that accounts for zone-level socioeconomics and development of road infrastructure. For zones with a higher percentage of school enrollment, pedestrian and bicycle friendly roadway system design are most beneficial. It is also desirable to take efficient countermeasures such as law enforcement and driving school training to regulate young drivers' behavior in school zones. For areas with high minority residence, there might be a need to use awareness campaigns in multiple languages to relieve pedestrian safety issues. Finally, additional attention should be paid to improve intersection design and management during the planning and operation processes. Published by Elsevier B.V.
引用
收藏
页码:53 / 63
页数:11
相关论文
共 50 条
[41]  
Tin Kam Ho, 1995, Proceedings of the Third International Conference on Document Analysis and Recognition, P278, DOI 10.1109/ICDAR.1995.598994
[42]  
U.S. Census Bureau, SF318
[43]  
U.S. Department of Transportation, 2009, SAF ACC FLEX EFF TRA
[44]   Random Parameter Model Used to Explain Effects of Built-Environment Characteristics on Pedestrian Crash Frequency [J].
Ukkusuri, Satish ;
Hasan, Samiul ;
Aziz, H. M. Abdul .
TRANSPORTATION RESEARCH RECORD, 2011, (2237) :98-106
[45]   Mining data with random forests: A survey and results of new tests [J].
Verikas, A. ;
Gelzinis, A. ;
Bacauskiene, M. .
PATTERN RECOGNITION, 2011, 44 (02) :330-349
[46]   Macro level Model Development for Safety Assessment of Road Network Structures [J].
Wang, Xuesong ;
Jin, Yu ;
Abdel-Aty, Mohamed ;
Tremont, Paul J. ;
Chen, Xiaohong .
TRANSPORTATION RESEARCH RECORD, 2012, (2280) :100-109
[47]  
Washington S., 2010, 844 NCHRP, V8-44
[48]   An area-level model of vehicle-pedestrian injury collisions with implications for land use and transportation planning [J].
Wier, Megan ;
Weintraub, June ;
Humphreys, Elizabeth H. ;
Seto, Edmund ;
Bhatia, Rajiv .
ACCIDENT ANALYSIS AND PREVENTION, 2009, 41 (01) :137-145
[49]  
Xia Y., 2012, P 2012 9 INT C FUZZ
[50]  
Zhao Q, 2011, P 2011 14 INT IEEE C