A Clustering Approach for Anonymizing Distributed Data Streams

被引:0
|
作者
Mohamed, Mona A. [1 ]
Nagi, Magdy H. [1 ]
Ghanem, Sahar M. [1 ]
机构
[1] Univ Alexandria, Comp & Syst Engn Dept, Fac Engn, Alexandria, Egypt
来源
PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES) | 2016年
关键词
Privacy; anonymization; data stream; clustering;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Privacy preserving data mining have been studied widely on static data. Static algorithms are not suitable for streaming data. This imposes the study of new algorithms for privacy preserving that cope with data streams characteristics. Recently, effective anonymization algorithms have been studied on centralized data streams. In this paper we propose an approach for anonymizing distributed data streams based on clustering. First, anonymization is performed locally at each site by clustering a single stream, then local clusters are exchanged between sites through a global server to construct global clusters. The algorithm is shown to be effective when compared to a centralized algorithm and to the case where no communication is exchanged between sites. In addition, empirical results on real and synthetic data sets have shown that the proposed algorithm gives better information loss when compared to the without communication case and close results to the centralized case. Moreover, the algorithm is shown to be efficient in terms of communication and scalable with increasing number of sites.
引用
收藏
页码:9 / 16
页数:8
相关论文
共 50 条
  • [1] An Adaptive Clustering Approach for Distributed Outlier Detection in Data Streams
    Della Monaca, Andrea
    Cafaro, Massimo
    Pulimeno, Marco
    Epicoco, Italo
    19TH INTERNATIONAL SYMPOSIUM ON DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE, 2023, 583 : 86 - 99
  • [2] Distributed clustering of ubiquitous data streams
    Rodrigues, Pedro Pereira
    Gama, Joao
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2014, 4 (01) : 38 - 54
  • [3] Clustering Distributed Sensor Data Streams
    Rodrigues, Pedro Pereira
    Gama, Joao
    Lopes, Luis
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 282 - +
  • [4] Approximate clustering on distributed data streams
    Zhang, Qi
    Liu, Jinze
    Wang, Wei
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1131 - +
  • [5] CASTLE: Continuously Anonymizing Data Streams
    Cao, Jianneng
    Carminati, Barbara
    Ferrari, Elena
    Tan, Kian-Lee
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2011, 8 (03) : 337 - 352
  • [6] Monitoring Distributed Data Streams through Node Clustering
    Barouti, Maria
    Keren, Daniel
    Kogan, Jacob
    Malinovsky, Yaakov
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, MLDM 2014, 2014, 8556 : 149 - 162
  • [7] Conquering the divide: Continuous clustering of distributed data streams
    Cormode, Graham
    Muthukrishnan, S.
    Zhuang, Wei
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1011 - 1020
  • [8] Clustering Data Streams: A Complex Network Approach
    Porto, Sandy
    Quiles, Marcos G.
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2019, PT I: 19TH INTERNATIONAL CONFERENCE, SAINT PETERSBURG, RUSSIA, JULY 1-4, 2019, PROCEEDINGS, PT I, 2019, 11619 : 52 - 65
  • [9] Toward Anonymizing IoT Data Streams via Partitioning
    Otgonbayar, Ankhbayar
    Pervez, Zeeshan
    Dahal, Keshav
    PROCEEDINGS 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS (MASS 2016), 2016, : 331 - 336
  • [10] Gossip-based Spectral Clustering of Distributed Data Streams
    Talistu, Matt
    Moh, Teng-Sheng
    Moh, Melody
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING & SIMULATION (HPCS 2015), 2015, : 325 - 333