Incorporating twitter-based human activity information in spatial analysis of crashes in urban areas

被引:64
作者
Bao, Jie [1 ,2 ]
Liu, Pan [1 ,2 ]
Yu, Hao [1 ,2 ]
Xu, Chengcheng [1 ,2 ]
机构
[1] Southeast Univ, Jiangsu Key Lab Urban ITS, Si Pai Lou 2, Nanjing 210096, Jiangsu, Peoples R China
[2] Jiangsu Prov Collaborat Innovat Ctr Modern Urban, Si Pai Lou 2, Nanjing 210096, Jiangsu, Peoples R China
基金
中国国家自然科学基金; 英国工程与自然科学研究理事会;
关键词
Big data; Human activity; Twitter; Safety; Spatial analysis; HUMAN MOBILITY; SAFETY; LEVEL; INFRASTRUCTURE; HETEROGENEITY; PREDICTION; FATALITIES; REGRESSION; MODELS;
D O I
10.1016/j.aap.2017.06.012
中图分类号
TB18 [人体工程学];
学科分类号
1201 ;
摘要
The primary objective of this study was to investigate how to incorporate human activity information in spatial analysis of crashes in urban areas using Twitter check-in data. This study used the data collected from the City of Los Angeles in the United States to illustrate the procedure. The following five types of data were collected: crash data, human activity data, traditional traffic exposure variables, road network attributes and social-demographic data. A web crawler by Python was developed to collect the venue type information from the Twitter check-in data automatically. The human activities were classified into seven categories by the obtained venue types. The collected data were aggregated into 896 Traffic Analysis Zones (TAZ). Geographically weighted regression (GWR) models were developed to establish a relationship between the crash counts reported in a TAZ and various contributing factors. Comparative analyses were conducted to compare the performance of GWR models which considered traditional traffic exposure variables only, Twitter-based human activity variables only, and both traditional traffic exposure and Twitter-based human activity variables. The model specification results suggested that human activity variables significantly affected the crash counts in a TAZ. The results of comparative analyses suggested that the models which considered both traditional traffic exposure and human activity variables had the best goodness-of-fit in terms of the highest R-2 and lowest AICc values. The finding seems to confirm the benefits of incorporating human activity information in spatial analysis of crashes using Twitter check-in data.
引用
收藏
页码:358 / 369
页数:12
相关论文
共 36 条
[1]   Geographical unit based analysis in the context of transportation safety planning [J].
Abdel-Aty, Mohamed ;
Lee, Jaeyoung ;
Siddiqui, Chowdhury ;
Choi, Keechoo .
TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2013, 49 :62-75
[2]   Spatial analysis of fatal and injury crashes in Pennsylvania [J].
Aguero-Valverde, J ;
Jovanis, PP .
ACCIDENT ANALYSIS AND PREVENTION, 2006, 38 (03) :618-625
[3]  
[Anonymous], 2009, National household travel survey
[4]  
[Anonymous], 2013, Bayesian Disease Mapping: Hierarchical Modeling in Spatial Epidemiology
[5]   Social Network Sites: Definition, History, and Scholarship [J].
Boyd, Danah M. ;
Ellison, Nicole B. .
JOURNAL OF COMPUTER-MEDIATED COMMUNICATION, 2007, 13 (01) :210-230
[6]   The promises of big data and small data for travel behavior (aka human mobility) analysis [J].
Chen, Cynthia ;
Ma, Jingtao ;
Susilo, Yusak ;
Liu, Yu ;
Wang, Menglin .
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2016, 68 :285-299
[7]   Sensitivity of location-sharing services data: evidence from American travel pattern [J].
Chen, Zhenhua ;
Schintler, Laurie A. .
TRANSPORTATION, 2015, 42 (04) :669-682
[8]  
Cheng Z, 2011, P 5 INT C WEBL SOC M
[9]   Support vector machine in crash prediction at the level of traffic analysis zones: Assessing the spatial proximity effects [J].
Dong, Ni ;
Huang, Helai ;
Zheng, Liang .
ACCIDENT ANALYSIS AND PREVENTION, 2015, 82 :192-198
[10]   The effect of daily-activity patterns on crash involvement [J].
Elias, Wafa ;
Toledo, Tomer ;
Shiftan, Yoram .
ACCIDENT ANALYSIS AND PREVENTION, 2010, 42 (06) :1682-1688