A compression strategy for an efficient TSP-based microaggregation

被引:1
作者
Maya-Lopez, Armando [1 ]
Martinez-Balleste, Antoni [1 ]
Casino, Fran [1 ,2 ]
机构
[1] Univ Rovira & Virgili, Dept Comp Engn & Math, Avinguda Paisos Catalans 26, Tarragona 43007, Spain
[2] Athena Res Ctr, Informat Management Syst Inst, Artemidos 6, Maroussi 15125, Greece
关键词
Statistical disclosure control; Microaggregation; Data privacy; Travelling Salesman Problem; Data protection; k-anonymity; STATISTICAL DISCLOSURE CONTROL; DATA-ORIENTED MICROAGGREGATION; ALGORITHM;
D O I
10.1016/j.eswa.2022.118980
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advent of decentralised systems and the continuous collection of personal data managed by public and private entities require the application of measures to guarantee the privacy of individuals. Due to the necessity to preserve both the privacy and the utility of such data, different techniques have been proposed in the literature. Microaggregation, a family of data perturbation methods, relies on the principle of k-anonymity to aggregate personal data records. While several microaggregation heuristics exist, those based on the Travelling Salesman Problem (TSP) have been shown to outperform the state of the art when considering the trade-off between privacy protection and data utility. However, TSP-based heuristics suffer from scalability issues. Intuitively, methods that may reduce the computational time of TSP-based heuristics may incur a higher information loss. Nevertheless, in this article, we propose a method that improves the performance of TSP-based heuristics and can be used in both small and large datasets effectively. Moreover, instead of focusing only on the computational time perspective, our method can preserve and sometimes reduce the information loss resulting from the microaggregation. Extensive experiments with different benchmarks show how our method is able to outperform the current state of the art, considering the trade-off between information loss and computational time.
引用
收藏
页数:9
相关论文
共 50 条
[21]   Energy Efficient Data Compression in Cloud Based IoT [J].
Al-Kadhim, Halah Mohammed ;
Al-Raweshidy, Hamed S. .
IEEE SENSORS JOURNAL, 2021, 21 (10) :12212-12219
[22]   An Efficient Scheduling Strategy for Containers Based on Kubernetes [J].
Zhang, Xurong ;
Wang, Xiaofeng ;
Liu, Yuan ;
Deng, Zhaohong .
COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, COLLABORATECOM 2022, PT I, 2022, 460 :326-342
[23]   An efficient JPEG-2000 based multimodal compression scheme [J].
Brahimi, Tahar ;
Khelifi, Fouad ;
Kacha, Abdellah .
MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (14) :21241-21260
[24]   EFFICIENT MEDICAL IMAGE COMPRESSION BASED ON INTEGER WAVELET TRANSFORM [J].
Krishnaswamy, R. ;
NirmalaDevi, S. .
2020 SIXTH INTERNATIONAL CONFERENCE ON BIO SIGNALS, IMAGES, AND INSTRUMENTATION (ICBSII), 2020,
[25]   New Efficient Fractal based Compression Method for Electrocardiogram Signals [J].
Khalaj, A. ;
Naimi, H. Miar .
2009 IEEE 22ND CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1 AND 2, 2009, :160-163
[26]   TBM, a transformation based method for microaggregation of large volume mixed data [J].
Mostafa Salari ;
Saeed Jalili ;
Reza Mortazavi .
Data Mining and Knowledge Discovery, 2017, 31 :65-91
[27]   Efficient EPE based Thresholding and Adaptive Coding for Wavelet based ECG Compression [J].
Pisipati, Abhishek ;
Rakshit, Manas ;
Das, Susmita .
2017 2ND IEEE INTERNATIONAL CONFERENCE ON RECENT TRENDS IN ELECTRONICS, INFORMATION & COMMUNICATION TECHNOLOGY (RTEICT), 2017, :987-991
[28]   Using the TSP Solution Strategy for Cloudlet Scheduling in Cloud Computing [J].
Nasr, Aida A. ;
El-Bahnasawy, Nirmeen A. ;
Attiya, Gamal ;
El-Sayed, Ayman .
JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2019, 27 (02) :366-387
[29]   Micro-SOM: A Linear-Time Multivariate Microaggregation Algorithm Based on Self-Organizing Maps [J].
Solanas, Agusti ;
Gavalda, Arnau ;
Rallo, Robert .
ARTIFICIAL NEURAL NETWORKS - ICANN 2009, PT I, 2009, 5768 :525-535
[30]   Privacy preserving dynamic data release against synonymous linkage based on microaggregation [J].
Yan, Yan ;
Eyeleko, Anselme Herman ;
Mahmood, Adnan ;
Li, Jing ;
Dong, Zhuoyue ;
Xu, Fei .
SCIENTIFIC REPORTS, 2022, 12 (01)