MAGE: A semantics retaining K-anonymization method for mixed data

被引:11
作者
Han, Jianmin [1 ]
Yu, Juan [2 ]
Mo, Yuchang [1 ]
Lu, Jianfeng [1 ]
Liu, Huawen [1 ]
机构
[1] Zhejiang Normal Univ, Dept Comp Sci & Technol, Jinhua 321004, Peoples R China
[2] Fudan Univ, Dept Comp Sci & Technol, Shanghai 200433, Peoples R China
基金
中国国家自然科学基金;
关键词
K-anonymity; Generalization; Microaggregation; Privacy preservation; ALGORITHM; ANONYMITY; MICROAGGREGATION;
D O I
10.1016/j.knosys.2013.10.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
K-anonymity is a fine approach to protecting privacy in the release of microdata for data mining. Microaggregation and generalization are two typical methods to implement k-anonymity. But both of them have some defects on anonymizing mixed microdata. To address the problem, we propose a novel anonymization method, named MAGE, which can retain more semantics than generalization and microaggregation in dealing with mixed microdata. The idea of MAGE is to combine the mean vector of numerical data with the generalization values of categorical data as a clustering centroid and to use it as incarnation of the tuples in the corresponding cluster. We also propose an efficient TSCKA algorithm to anonymize mixed data. Experimental results show that MAGE can anonymize mixed microdata effectively and the TSCKA algorithm can achieve better trade-off between data quality and algorithm efficiency comparing with two well-known anonymization algorithms, Incognito and KACA. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:75 / 86
页数:12
相关论文
共 34 条
[1]  
[Anonymous], 2006, P 32 INT C VER LARG
[2]  
[Anonymous], 2005, P 2005 ACM SIGMOD IN
[3]  
[Anonymous], 2005, VLDB, DOI DOI 10.5555/1083592.1083696
[4]  
Byun JW, 2006, LECT NOTES COMPUT SC, V4165, P48
[5]  
Cen Ting-ting, 2008, 2008 International Conference on Computer Science and Software Engineering (CSSE 2008), P1053, DOI 10.1109/CSSE.2008.1005
[6]   TFRP: An efficient micro aggregation algorithm for statistical disclosure control [J].
Chang, Chin-Chen ;
Li, Yu-Chiang ;
Huang, Wen-Hung .
JOURNAL OF SYSTEMS AND SOFTWARE, 2007, 80 (11) :1866-1878
[7]   Ordinal, continuous and heterogeneous k-anonymity through microaggregation [J].
Domingo-Ferrer, J ;
Torra, V .
DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 11 (02) :195-212
[8]   Practical data-oriented microaggregation for statistical disclosure control [J].
Domingo-Ferrer, J ;
Mateo-Sanz, JM .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2002, 14 (01) :189-201
[9]   A polynomial-time approximation to optimal multivariate micro aggregation [J].
Domingo-Ferrer, Josep ;
Sebe, Francesc ;
Solanas, Agusti .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2008, 55 (04) :714-732
[10]   Privacy-Preserving Data Publishing: A Survey of Recent Developments [J].
Fung, Benjamin C. M. ;
Wang, Ke ;
Chen, Rui ;
Yu, Philip S. .
ACM COMPUTING SURVEYS, 2010, 42 (04)