Mitigating Bias in Big Data for Transportation

被引:22
作者
Greg P. Griffin
Megan Mulhall
Chris Simek
William W. Riggs
机构
[1] The University of Texas at San Antonio,
[2] Michigan Economic Development Corporation,undefined
[3] Texas A&M Transportation Institute,undefined
[4] University of San Francisco,undefined
来源
Journal of Big Data Analytics in Transportation | 2020年 / 2卷 / 1期
关键词
Big data; Bias; Transportation; Safety; Interview;
D O I
10.1007/s42421-020-00013-0
中图分类号
学科分类号
摘要
Emerging big data resources and practices provide opportunities to improve transportation safety planning and outcomes. However, researchers and practitioners recognise that big data from mobile phones, social media, and on-board vehicle systems include biases in representation and accuracy, related to transportation safety statistics. This study examines both the sources of bias and approaches to mitigate them through a review of published studies and interviews with experts. Coding of qualitative data enabled topical comparisons and reliability metrics. Results identify four categories of bias and mitigation approaches that concern transportation researchers and practitioners: sampling, measurement, demographics, and aggregation. This structure for understanding and working with bias in big data supports research with practical approaches for rapidly evolving transportation data sources.
引用
收藏
页码:49 / 59
页数:10
相关论文
共 120 条
[1]  
Abdel-Aty M(2013)Geographical unit based analysis in the context of transportation safety planning Transp Res Part A Policy Pract 49 62-75
[2]  
Lee J(2019)A perspective on the challenges and opportunities for privacy-aware big transportation data J Big Data Anal Transp 1 1-23
[3]  
Siddiqui C(2017)Incorporating twitter-based human activity information in spatial analysis of crashes in urban areas Accid Anal Prev 106 358-369
[4]  
Choi K(2016)Big data and the city Built Environ 42 321-337
[5]  
Badu-Marfo G(2013)Exploring gendered cycling behaviours within a large-scale behavioural data-set Transp Plan Technol 37 83-97
[6]  
Farooq B(2016)Conflation of OpenStreetMap and mobile sports tracking data for automatic bicycle routing Trans GIS 20 848-868
[7]  
Patterson Z(2015)ScienceDirect workshop synthesis: comparing and combining survey modes Transp Res Procedia 11 108-117
[8]  
Bao J(2018)Using crowdsourced data to monitor change in spatial patterns of bicycle ridership J Transp Health 1 25-36
[9]  
Liu P(2019)Visualizing and evaluating interdependent regional traffic congestion and system resiliency, a case study using big data from probe vehicles J Big Data Anal Transp 21 7-18
[10]  
Yu H(2018)Can public transportation compete with automated and connected cars? J Public Transp 68 285-299