Generating public transport data based on population distributions for RDF benchmarking

被引:3
作者
Taelman, Ruben [1 ]
Colpaert, Pieter [1 ]
Mannens, Erik [1 ]
Verborgh, Ruben [1 ]
机构
[1] Univ Ghent, IMEC, IDLab, Technol pk Zwijnaarde 15, B-9052 Ghent, Belgium
基金
欧盟地平线“2020”;
关键词
Public Transport; dataset generator; benchmarking; RDF; linked data;
D O I
10.3233/SW-180319
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When benchmarking RDF data management systems such as public transport route planners, system evaluation needs to happen under various realistic circumstances, which requires a wide range of datasets with different properties. Real-world datasets are almost ideal, as they offer these realistic circumstances, but they are often hard to obtain and inflexible for testing. For these reasons, synthetic dataset generators are typically preferred over real-world datasets due to their intrinsic flexibility. Unfortunately, many synthetic dataset that are generated within benchmarks are insufficiently realistic, raising questions about the generalizability of benchmark results to real-world scenarios. In order to benchmark geospatial and temporal RDF data management systems such as route planners with sufficient external validity and depth, we designed PODiGG, a highly configurable generation algorithm for synthetic public transport datasets with realistic geospatial and temporal characteristics comparable to those of their real-world variants. The algorithm is inspired by real-world public transit network design and scheduling methodologies. This article discusses the design and implementation of PODiGG and validates the properties of its generated datasets. Our findings show that the generator achieves a sufficient level of realism, based on the existing coherence metric and new metrics we introduce specifically for the public transport domain. Thereby, PODiGG provides a flexible foundation for benchmarking RDF data management systems with geospatial and temporal data.
引用
收藏
页码:305 / 328
页数:24
相关论文
共 50 条
  • [31] mStore: Schema Mining based-RDF Data Storage
    Zheng, Guopeng
    Ren, Tenglong
    Yang, Lulu
    Zhang, Xiaowang
    Feng, Zhiyong
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 168 - 171
  • [32] A Benchmarking Strategy for Delhi Transport Corporation: An Application of Data Envelopment Analysis
    Saxena, Punita
    INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES, 2019, 4 (01) : 232 - 244
  • [33] Process Evaluation of Public Project Management Performance Based on Benchmarking
    Yin Yilin
    Du Yaling
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 7488 - +
  • [34] Development of a methodology for benchmarking public transportation organisations: a practical tool based on an industry sound methodology
    Geerlings, H
    Klementschitz, R
    Mulley, C
    JOURNAL OF CLEANER PRODUCTION, 2006, 14 (02) : 113 - 123
  • [35] Determining an efficient and precise choice set for public transport based on tracking data
    Marra, Alessio Daniele
    Corman, Francesco
    TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2020, 142 : 168 - 186
  • [36] Estimating the steps made by public transport commuters using a synthetic population enriched with smart card data
    Del Rosario, Lauren
    Laffan, Shawn W.
    Pettit, Christopher J.
    JOURNAL OF TRANSPORT & HEALTH, 2022, 27
  • [37] Public Transport IC Card Data Analysis and Operation Strategy Research Based on Data Mining Technology
    Zhu Qing
    Wang Yingzhe
    Li Jiankou
    2009 INTERNATIONAL FORUM ON COMPUTER SCIENCE-TECHNOLOGY AND APPLICATIONS, VOL 3, PROCEEDINGS, 2009, : 459 - +
  • [38] Benchmarking Explanatory Models for Inertia Forecasting using Public Data of the Nordic Area
    Graham, Jemima
    Heylen, Evelyn
    Bian, Yuankai
    Teng, Fei
    2022 17TH INTERNATIONAL CONFERENCE ON PROBABILISTIC METHODS APPLIED TO POWER SYSTEMS (PMAPS), 2022,
  • [39] Using routine data for benchmarking and performance measurement of public hospitals in New Zealand
    Stevanovic, Vladimir
    Feek, Colin
    Kay, Rebecca
    BENCHMARKING-AN INTERNATIONAL JOURNAL, 2005, 12 (06) : 498 - 507
  • [40] The potential of public transport smart card data
    Bagchi, M
    White, PR
    TRANSPORT POLICY, 2005, 12 (05) : 464 - 474