Combined fuzzy clustering and firefly algorithm for privacy preserving in social networks

被引:53
作者
Langari, Rohulla Kosari [1 ]
Sardar, Soheila [2 ]
Mousavi, Seyed Abdollah Amin [3 ]
Radfar, Reza [4 ]
机构
[1] Islamic Azad Univ, Fac Management, Dept Informat Technol Management, Tehran North Branch, Tehran, Iran
[2] Islamic Azad Univ, Fac Management, Dept Ind Management, Tehran North Branch, Tehran, Iran
[3] Islamic Azad Univ, Fac Management, Dept Informat Technol Management, Sci & Res Branch, Tehran, Iran
[4] Islamic Azad Univ, Fac Management, Dept Technol Management, Sci & Res Branch, Tehran, Iran
关键词
Firefly algorithm; Fuzzy clustering; K-anonymity; Privacy preserving; Social networks; K-ANONYMITY; MODEL;
D O I
10.1016/j.eswa.2019.112968
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, an explosive growth of social networks has been made publicly available for understanding the behavior of users and data mining purposes. The main challenge in sharing the social network databases is protecting public released data from individual identification. The most common privacy preserving technique is anonymizing data by removing or changing some information, while the anonymized data should retain as much information as possible of the original data. K-anonymity and its extensions (e.g., L-diversity and T-closeness) have widely been used for data anonymization. The main drawback of the existing anonymity techniques is the lack of protection against attribute/link disclosure and similarity attacks. Moreover, they suffer from high amount of information loss in the released database. In order to overcome these drawbacks, this paper proposes a combined anonymizing algorithm based On K-member Fuzzy Clustering and Firefly Algorithm (KFCFA) to protect the anonymized database against identity disclosure, attribute disclosure, link disclosure, and similarity attacks, and significantly minimize the information loss. In KFCFA, at first, a modified K-member version of fuzzy c-means is utilized to create balanced clusters with at least K members in each cluster. Then, firefly algorithm is performed for further optimizing the primary clusters and anonymizing the network graph and data. To achieve this purpose, a constrained multi-objective function is introduced to simultaneously minimize the clustering error rate and the generated information loss, while satisfying the defined anonymity constraints. The proposed methodology can be utilized for both network graph structures and micro data. Simulation results over four social network databases from Facebook, Google+, Twitter and YouTube demonstrate the efficiency of the proposed KFCFA algorithm to minimize the information loss of the published data and graph, while satisfying K-anonymity, L-diversity and T-closeness conditions. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 30 条
[1]  
[Anonymous], 2015, P 10 ACM S INF COMP
[2]  
[Anonymous], 0719 U MASS
[3]  
[Anonymous], 2008, P 2008 SIAM INT C DA
[4]   Identity obfuscation in graphs through the information theoretic lens [J].
Bonchi, Francesco ;
Gionis, Aristides ;
Tassa, Tamir .
INFORMATION SCIENCES, 2014, 275 :232-256
[5]  
Byun JW, 2007, LECT NOTES COMPUT SC, V4443, P188
[6]   A survey of graph-modification techniques for privacy-preserving on networks [J].
Casas-Roma, Jordi ;
Herrera-Joancomarti, Jordi ;
Torra, Vicenc .
ARTIFICIAL INTELLIGENCE REVIEW, 2017, 47 (03) :341-366
[7]  
Casas-Roma J, 2014, LECT NOTES ARTIF INT, V8825, P204, DOI 10.1007/978-3-319-12054-6_18
[8]  
Hartung S, 2014, LECT NOTES COMPUT SC, V8504, P376, DOI 10.1007/978-3-319-07959-2_32
[9]  
Honda K., 2012, PROC IEEE INT C FUZZ, P1
[10]   A dual privacy decision model for online social networks [J].
James, Tabitha L. ;
Warkentin, Merrill ;
Collignon, Stephane E. .
INFORMATION & MANAGEMENT, 2015, 52 (08) :893-908