Random Walk based Fake Account Detection in Online Social Networks

被引:84
作者
Jia, Jinyuan [1 ]
Wang, Binghui [1 ]
Gong, Neil Zhenqiang [1 ]
机构
[1] Iowa State Univ, ECE Dept, Ames, IA 50011 USA
来源
2017 47TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN) | 2017年
关键词
D O I
10.1109/DSN.2017.55
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online social networks are known to be vulnerable to the so-called Sybil attack, in which an attacker maintains massive fake accounts (also called Sybils) and uses them to perform various malicious activities. Therefore, Sybil detection is a fundamental security research problem in online social networks. Random walk based methods, which leverage the structure of an online social network to distribute reputation scores for users, have been demonstrated to be promising in certain real-world online social networks. In particular, random walk based methods have three desired features: they can have theoretically guaranteed performance for online social networks that have the fast-mixing property, they are accurate when the social network has strong homophily property, and they can be scalable to large-scale online social networks. However, existing random walk based methods suffer from several key limitations: 1) they can only leverage either labeled benign users or labeled Sybils, but not both, 2) they have limited detection accuracy for weak-homophily social networks, and 3) they are not robust to label noise in the training dataset. In this work, we propose a new random walk based Sybil detection method called SybilWalk. SybilWalk addresses the limitations of existing random walk based methods while maintaining their desired features. We perform both theoretical and empirical evaluations to compare SybilWalk with previous random walk based methods. Theoretically, for online social networks with the fast-mixing property, SybilWalk has a tighter asymptotical bound on the number of Sybils that are falsely accepted into the social network than all existing random walk based methods. Empirically, we compare SybilWalk with previous random walk based methods using both social networks with synthesized Sybils and a large-scale Twitter dataset with real Sybils. Our empirical results demonstrate that 1) SybilWalk is substantially more accurate than existing random walk based methods for weak-homophily social networks, 2) SybilWalk is substantially more robust to label noise than existing random walk based methods, and 3) SybilWalk is as scalable as the most efficient existing random walk based methods. In particular, on the Twitter dataset, SybilWalk achieves a false positive rate of 1.3% and a false negative rate of 17.3%.
引用
收藏
页码:273 / 284
页数:12
相关论文
共 40 条
[1]  
Alex Hai Wang, 2010, INT C SEC CRYPT SECR
[2]  
Alvisi Lorenzo, 2013, IEEE S SEC PRIV S P
[3]  
[Anonymous], 2016, Hacking Election
[4]  
[Anonymous], INT C MACH LEARN ICM
[5]  
[Anonymous], 2016, Hacking Financial Market
[6]  
[Anonymous], 2009, NDSS
[7]   Emergence of scaling in random networks [J].
Barabási, AL ;
Albert, R .
SCIENCE, 1999, 286 (5439) :509-512
[8]  
Benevenuto Fabricio., 2010, CEAS
[9]  
Boshmaf Yazan, 2014, NETW DISTR SYST SEC
[10]  
Chao Yang, 2012, WORLD WIDE WEB WWW