Big data in public transportation: a review of sources and methods

被引:82
作者
Welch, Timothy F. [1 ]
Widita, Alyas [1 ]
机构
[1] Georgia Inst Technol, Sch City & Reg Planning, Atlanta, GA 30332 USA
关键词
Big data; public transportation; transport analysis; transit planning; planning methods; statistics; SMART CARD; TIME DISTRIBUTIONS; PASSENGER FLOW; DATA ANALYTICS; TRAVEL; RIDERSHIP; MODEL; TRAJECTORIES; GPS; INFORMATION;
D O I
10.1080/01441647.2019.1616849
中图分类号
U [交通运输];
学科分类号
08 ; 0823 ;
摘要
The collection of big data, as an alternative to traditional resource-intensive manual data collection approaches, has become significantly more feasible over the past decade. The availability of such data, coupled with more sophisticated predictive statistical techniques, has contributed to an increase in attention towards the application of these data, particularly for transportation analysis. Within the transportation literature, there is a growing emphasis on developing sources of commonly collected public transportation data into more powerful analytical tools. A commonly held belief is that application of big data to transportation problems will yield new insights previously unattainable through traditional transportation data sets. However, there exist many ambiguities related to what constitutes big data, the ethical implications of big data collection and application, and how to best utilize the emerging data sets. The existing literature exploring big data provides no clear and consistent definition. While the collection of big data has grown and its application in both research and practice continues to expand, there is a significant disparity between methods of analysis applied to such data. This paper summarizes the recent literature on sources of big data and commonly applied methods used in its application to public transportation problems. We assess predominant big data sources, most frequently studied topics, and methodologies employed. The literature suggests smart card and automated data are the two big data sources most frequently used by researchers to conduct public transit analyses. The studies reviewed indicate that big data has largely been used to understand transit users' travel behavior and to assess public transit service quality. The techniques reported in the literature largely mirror those used with smaller data sets. The application of more advanced statistical methods, commonly associated with big data, has been limited to a small number of studies. In order to fully capture the value of big data, new approaches to analysis will be necessary.
引用
收藏
页码:795 / 818
页数:24
相关论文
共 50 条
  • [41] Biological big-data sources, problems of storage, computational issues, and applications: a comprehensive review
    Chaudhari, Jyoti Kant
    Pant, Shubham
    Jha, Richa
    Pathak, Rajesh Kumar
    Singh, Dev Bukhsh
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (06) : 3159 - 3209
  • [42] A Cyberinfrastructure for Big Data Transportation Engineering
    Md Johirul Islam
    Anuj Sharma
    Hridesh Rajan
    Journal of Big Data Analytics in Transportation, 2019, 1 (1): : 83 - 94
  • [43] Big Geospatial Data or Geospatial Big Data? A Systematic Narrative Review on the Use of Spatial Data Infrastructures for Big Geospatial Sensing Data in Public Health
    Koh, Keumseok
    Hyder, Ayaz
    Karale, Yogita
    Boulos, Maged N. Kamel
    REMOTE SENSING, 2022, 14 (13)
  • [44] A Novel Approach for Big Data Classification and Transportation in Rail Networks
    Saki, Mahdi
    Abolhasan, Mehran
    Lipman, Justin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1239 - 1249
  • [45] Big Data Analytics on Twitter A Systematic Review of Applications and Methods
    Pradyumn, Mudit
    Kapoor, Akshat
    Tabrizi, Nasseh
    BIG DATA - BIGDATA 2018, 2018, 10968 : 326 - 333
  • [46] Deep learning methods in transportation domain: a review
    Hoang Nguyen
    Le-Minh Kieu
    Wen, Tao
    Cai, Chen
    IET INTELLIGENT TRANSPORT SYSTEMS, 2018, 12 (09) : 998 - 1004
  • [47] Public transportation and sustainability: A review
    Miller, Patrick
    de Barros, Alexandre G.
    Kattan, Lina
    Wirasinghe, S. C.
    KSCE JOURNAL OF CIVIL ENGINEERING, 2016, 20 (03) : 1076 - 1083
  • [48] Feature selection methods and genomic big data: a systematic review
    Tadist, Khawla
    Najah, Said
    Nikolov, Nikola S.
    Mrabti, Fatiha
    Zahi, Azeddine
    JOURNAL OF BIG DATA, 2019, 6 (01)
  • [49] Apache Spark Methods and Techniques in Big Data-A Review
    Sahana, H. P.
    Sanjana, M. S.
    Muddasir, N. Mohammed
    Vidyashree, K. P.
    INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES, ICICCT 2019, 2020, 89 : 721 - 726
  • [50] Explicating the mapping between big data and knowledge management: a systematic literature review and future directions
    Goswami, Anil Kumar
    Sinha, Anamika
    Goswami, Meghna
    Kumar, Prashant
    BENCHMARKING-AN INTERNATIONAL JOURNAL, 2024, : 1224 - 1266