Streaming Histogram Publication Over Weighted Sliding Windows Under Differential Privacy

被引:0
作者
Wang, Xiujun [1 ,2 ]
Mo, Lei [3 ]
Zheng, Xiao [1 ,2 ]
Dang, Zhe [4 ]
机构
[1] Anhui Univ Technol, Sch Comp Sci & Technol, Maanshan 243032, Peoples R China
[2] Inst Artificial Intelligence, Hefei Comprehens Natl Sci Ctr, Hefei 230088, Peoples R China
[3] Baosight Software Anhui Co Ltd, Maanshan 243000, Peoples R China
[4] Washington State Univ, Sch Elect Engn & Comp Sci, Pullman, WA 99164, Australia
来源
TSINGHUA SCIENCE AND TECHNOLOGY | 2024年 / 29卷 / 06期
基金
中国国家自然科学基金;
关键词
Histograms; Differential privacy; Costs; Publishing; Noise; Clustering algorithms; Reinforcement learning; differential privacy; randomized algorithm; streaming data publication; weighted sliding window; approximate statistics; data usability; computational complexity; BIG DATA; THINGS IOT; INTERNET; CHALLENGES; MANAGEMENT; FUTURE;
D O I
10.26599/TST.2023.9010083
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Continuously publishing histograms in data streams is crucial to many real-time applications, as it provides not only critical statistical information, but also reduces privacy leaking risk. As the importance of elements usually decreases over time in data streams, in this paper we model a data stream by a sequence of weighted sliding windows, and then study how to publish histograms over these windows continuously. The existing literature can hardly solve this problem in a real-time way, because they need to buffer all elements in each sliding window, resulting in high computational overhead and prohibitive storage burden. In this paper, we overcome this drawback by proposing an online algorithm denoted by Efficient Streaming Histogram Publishing (ESHP) to continuously publish histograms over weighted sliding windows. Specifically, our method first creates a novel sketching structure, called Approximate-Estimate Sketch (AESketch), to maintain the counting information of each histogram interval at every time instance; then, it creates histograms that satisfy the differential privacy requirement by smartly adding appropriate noise values into the sketching structure. Extensive experimental results and rigorous theoretical analysis demonstrate that the ESHP method can offer equivalent data utility with significantly lower computational overhead and storage costs when compared to other existing methods.
引用
收藏
页码:1674 / 1693
页数:20
相关论文
共 51 条
  • [1] Healthcare systems integration using Real Time Publish Subscribe (RTPS) middleware
    Almadani, Basem
    Saeed, Bilal
    Alroubaiy, Anas
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2016, 50 : 67 - 78
  • [2] Trajectory Big Data Processing Based on Frequent Activity
    Belhassena, Amina
    Wang, Hongzhi
    [J]. TSINGHUA SCIENCE AND TECHNOLOGY, 2019, 24 (03) : 317 - 332
  • [3] Differential Privacy via Haar Wavelet Transform and Gaussian Mechanism for Range Query
    Chen, Dong
    Li, Yanjuan
    Chen, Jiaquan
    Bi, Hongbo
    Ding, Xiajun
    [J]. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [4] Chuan Xu, 2009, Proceedings of the 2009 IEEE 15th International Conference on Parallel and Distributed Systems (ICPADS 2009), P886, DOI 10.1109/ICPADS.2009.64
  • [5] Differential privacy: A survey of results
    Dwork, Cynthia
    [J]. THEORY AND APPLICATIONS OF MODELS OF COMPUTATION, PROCEEDINGS, 2008, 4978 : 1 - 19
  • [6] Experimental queueing analysis with long-range dependent packet traffic
    Erramilli, A
    Narayan, O
    Willinger, W
    [J]. IEEE-ACM TRANSACTIONS ON NETWORKING, 1996, 4 (02) : 209 - 223
  • [7] Esponda F., 2006, arXiv
  • [8] An Adaptive Approach to Real-Time Aggregate Monitoring with Differential Privacy
    Fan, Liyue
    Xiong, Li
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (09) : 2094 - 2106
  • [9] Golatkar A, 2019, ADV NEUR IN, V32
  • [10] Internet of Things (IoT): A vision, architectural elements, and future directions
    Gubbi, Jayavardhana
    Buyya, Rajkumar
    Marusic, Slaven
    Palaniswami, Marimuthu
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2013, 29 (07): : 1645 - 1660