Optimization algorithm for k-anonymization of datasets with low information loss

被引:0
作者
Keisuke Murakami
Takeaki Uno
机构
[1] Kansai University,
[2] National Institute of Informatics,undefined
来源
International Journal of Information Security | 2018年 / 17卷
关键词
Security and protection; Database models; -anonymity; Large-scale dataset; Graph problem; Optimization;
D O I
暂无
中图分类号
学科分类号
摘要
Anonymization is the modification of data to mask the correspondence between a person and sensitive information in the data. Several anonymization models such as k-anonymity have been intensively studied. Recently, a new model with less information loss than existing models was proposed; this is a type of non-homogeneous generalization. In this paper, we present an alternative anonymization algorithm that further reduces the information loss using optimization techniques. We also prove that a modified dataset is checked whether it satisfies the k-anonymity by a polynomial-time algorithm. Computational experiments were conducted and demonstrated the efficiency of our algorithm even on large datasets.
引用
收藏
页码:631 / 644
页数:13
相关论文
共 23 条
[1]  
Sacharidis D(2010)k-Anonymity in the presence of external databases IEEE Trans. Knowl. Data Eng. 22 392-403
[2]  
Mouratidis K(1986)Finding a needle in a haystack or identifying anonymous census record J. Off. Stat. 2 329-336
[3]  
Papadias D(2001)Protecting respondants identities in microdata release IEEE Trans. Knowl. Data Eng. 13 1010-1027
[4]  
Dalenius T(2007)l-Diversity: privacy beyond k-anonymity ACM Trans. Knowl. Discov. Data 1 1-52
[5]  
Samarati P(2014)Efficient maximum flow algorithms Commun. ACM 57 82-89
[6]  
Machanavajjhala A(2002)Achieving k-anonymity privacy protection using generalization and suppression Int. J. Uncertain. Fuzziness Knowl. Based Syst. 10 571-588
[7]  
Gehrke J(2012)Limiting disclosure of sensitive data in sequential releases of databases Inf. Sci. 191 98-127
[8]  
Kifer D(2015)Privacy by diversity in sequential releases of databases Inf. Sci. 298 344-372
[9]  
Goldberg AV(1997)An Efficient implementation of a scaling minimum-cost flow algorithm J. Algorithms 22 1-29
[10]  
Tarjan RE(1935)On representatives of subsets J. Lond. Math. Soc. 10 26-30