A Semi-Supervised Framework for Social Spammer Detection

被引:19
作者
Li, Zhaoxing [1 ]
Zhang, Xianchao [1 ]
Shen, Hua [1 ]
Liang, Wenxin [1 ]
He, Zengyou [1 ]
机构
[1] Dalian Univ Technol, Sch Software Technol, Dalian 116621, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PART II | 2015年 / 9078卷
关键词
Semi-supervised Learning; Social Spam; Social Graph;
D O I
10.1007/978-3-319-18032-8_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spammers create large number of compromised or fake accounts to disseminate harmful information in social networks like Twitter. Identifying social spammers has become a challenging problem. Most of existing algorithms for social spammer detection are based on supervised learning, which needs a large amount of labeled data for training. However, labeling sufficient training set costs too much resources, which makes supervised learning impractical for social spammer detection. In this paper, we propose a semi-supervised framework for social spammer detection(SSSD), which combines the supervised classification model with a ranking scheme on the social graph. First, we train an original classifier with a small number of labeled data. Second, we propose a ranking model to propagate trust and distrust on the social graph. Third, we select confident users that are judged by the classifier and ranking scores as new training data and retrain the classifier. We repeat the all steps above until the classifier cannot be refined any more. Experimental results show that our framework can effectively detect social spammers in the condition of lacking sufficient labeled data.
引用
收藏
页码:177 / 188
页数:12
相关论文
共 21 条
[1]  
Amleshwaram AA, 2013, INT CONF COMMUN SYST
[2]  
[Anonymous], 2004, P 30 INT C VERY LARG
[3]  
[Anonymous], 2006, WORKSH MOD TRUST WEB
[4]  
[Anonymous], 2006, AIRWEB
[5]  
[Anonymous], AAAI
[6]  
[Anonymous], 2014, 28 AAAI C ART INT
[7]  
Benevenuto F., 2010, P COLL EL MESS ANT S, P75
[8]   Detecting Spammers and Content Promoters in Online Video Social Networks [J].
Benevenuto, Fabricio ;
Rodrigues, Tiago ;
Almeida, Virgilio ;
Almeida, Jussara ;
Goncalves, Marcos .
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, :620-627
[9]  
Gao H., 2010, P 17 ACM C COMP COMM, P35
[10]  
Ghosh Saptarshi, 2012, P 21 INT C WORLD WID, P61, DOI DOI 10.1145/2187836.2187846