The CASE histogram: privacy-aware processing of trajectory data using aggregates

被引:5
|
作者
Fanaeepour, Maryam [1 ,2 ]
Kulik, Lars [1 ,2 ]
Tanin, Egemen [1 ,2 ]
Rubinstein, Benjamin I. P. [1 ]
机构
[1] Univ Melbourne, Dept Comp & Informat Syst, Parkville, Vic 3010, Australia
[2] Natl ICT Australia, Sydney, NSW, Australia
基金
美国国家卫生研究院;
关键词
Aggregate data; Count information; Differential privacy; Distinct counting problem; Euler histograms; Location privacy; Spatial databases; Spatial data analytics; EXPLORING SPATIAL DATASETS; LOCATION PRIVACY;
D O I
10.1007/s10707-015-0228-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the high uptake of location-based services (LBSs), large spatio-temporal datasets of moving objects' trajectories are being created every day. An important task in spatial data analytics is to service range queries by returning trajectory counts within a queried region. The question of how to keep an individual user's data private whilst enabling spatial data analytics by third parties has become an urgent research direction. Indeed, it is increasingly becoming a concern for users. To preserve privacy we discard individual trajectories and aggregate counts over a spatial and temporal partition. However the privacy gained comes at a cost to utility: trajectories passing through multiple cells and re-entering a query region, lead to inaccurate query responses. This is known as the distinct counting problem. We propose the Connection Aware Spatial Euler (CASE) histogram to address this long-standing problem. The CASE histogram maintains the connectivity of a moving object path, but does not require the ID of an object to distinguish multiple entries into an arbitrary query region. Our approach is to process trajectories offline into aggregate counts which are sent to third parties, rather than the original trajectories. We also explore modifications of our aggregate counting approach that preserve differential privacy. Theoretically and experimentally we demonstrate that our method provides a high level of accuracy compared to the best known methods for the distinct counting problem, whilst preserving privacy. We conduct our experiments on both synthetic and real datasets over two competitive Euler histogram-based methods presented in the literature. Our methods enjoy improvements to accuracy from 10 % up to 70 % depending on trip data and query region size, with the greatest increase seen on the Microsoft T-Drive real dataset, representing a more than tripling of accuracy.
引用
收藏
页码:747 / 798
页数:52
相关论文
共 20 条
  • [1] The CASE histogram: privacy-aware processing of trajectory data using aggregates
    Maryam Fanaeepour
    Lars Kulik
    Egemen Tanin
    Benjamin I. P. Rubinstein
    GeoInformatica, 2015, 19 : 747 - 798
  • [2] A Privacy-Aware Remapping Mechanism for Location Data
    Duarte, Guilherme
    Cunha, Mariana
    Vilela, Joao P.
    39TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, SAC 2024, 2024, : 1433 - 1440
  • [3] Privacy-aware task data management using TPR*-Tree for trajectory-based crowdsourcing
    Li, Yan
    Shin, Byeong-Seok
    JOURNAL OF SUPERCOMPUTING, 2018, 74 (12): : 6976 - 6987
  • [4] Privacy-aware task data management using TPR*-Tree for trajectory-based crowdsourcing
    Yan Li
    Byeong-Seok Shin
    The Journal of Supercomputing, 2018, 74 : 6976 - 6987
  • [5] Privacy-Aware Location Data Publishing
    Hu, Haibo
    Xu, Jianliang
    On, Sai Tung
    Du, Jing
    Ng, Joseph Kee-Yin
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2010, 35 (03):
  • [6] A New Privacy-Aware Model Proposal and Application on Trajectory Data Publishing
    Akin, Murat
    Canbay, Yavuz
    Sagiroglu, Seref
    JOURNAL OF POLYTECHNIC-POLITEKNIK DERGISI, 2021, 24 (03): : 1275 - 1286
  • [7] DP-TrajGAN: A privacy-aware trajectory generation model with differential privacy
    Zhang, Jing
    Huang, Qihan
    Huang, Yirui
    Ding, Qian
    Tsai, Pei-Wei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 142 : 25 - 40
  • [8] Privacy-Aware Time-Series Data Sharing With Deep Reinforcement Learning
    Erdemir, Ecenaz
    Dragotti, Pier Luigi
    Gunduz, Deniz
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 389 - 401
  • [9] Optimal Data Acquisition with Privacy-Aware Agents
    Cummings, Rachel
    Elzayn, Hadi
    Pountourakis, Emmanouil
    Gkatzelis, Vasilis
    Ziani, Juba
    2023 IEEE CONFERENCE ON SECURE AND TRUSTWORTHY MACHINE LEARNING, SATML, 2023, : 210 - 224
  • [10] Privacy-aware Methods for Data Sharing between Autonomous Vehicles
    Alekszejenko, Levente
    Dobrowiecki, Tadeusz
    IFAC PAPERSONLINE, 2022, 55 (14): : 7 - 13