ACTSSD: social spammer detection based on active learning and co-training

被引:4
作者
Chen, Ailin [1 ]
Yang, Pin [1 ]
Cheng, Pengsen [1 ]
机构
[1] Sichuan Univ, Sch Cyber Sci & Engn, Chengdu, Peoples R China
关键词
Social spammer detection; Co-training; Active learning; Social network; FRAMEWORK;
D O I
10.1007/s11227-021-03966-3
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The rumors, advertisements and malicious links are spread in social networks by social spammers, which affect users' normal access to social networks and cause security problems. Most methods aim to detect social spammers by various features, such as content features, behavior features and relationship graph features, which rely on a large-scale labeled data. However, labeled data are lacking for training in real world, and manual annotating is time-consuming and labor-intensive. To solve this problem, we propose a novel method which combines active learning algorithm with co-training algorithm to make full use of unlabeled data. In co-training, user features are divided into two views without overlap. Classifiers are trained iteratively with labeled instances and the most confident unlabeled instances with pseudo-labels. In active learning, the most representative and uncertain instances are selected and annotated with real labels to extend labeled dataset. Experimental results on the Twitter and Apontador datasets show that our method can effectively detect social spammers in the case of limited labeled data.
引用
收藏
页码:2744 / 2771
页数:28
相关论文
共 29 条
[1]   Malicious accounts: Dark of the social networks [J].
Adewole, Kayode Sakariyah ;
Anuar, Nor Badrul ;
Kamsin, Amirrudin ;
Varathan, Kasturi Dewi ;
Razak, Syed Abdul .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 79 :41-67
[2]  
Amleshwaram AA, 2013, INT CONF COMMUN SYST
[3]   Detecting Spammers and Content Promoters in Online Video Social Networks [J].
Benevenuto, Fabricio ;
Rodrigues, Tiago ;
Almeida, Virgilio ;
Almeida, Jussara ;
Goncalves, Marcos .
PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, :620-627
[4]  
Benevenuto Fabricio., 2010, CEAS
[5]   A new direction in social network analysis: Online social network analysis problems and applications [J].
Can, Umit ;
Alatas, Bilal .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2019, 535
[6]   Semi-supervised clue fusion for spammer detection in Sina Weibo [J].
Chen, Hao ;
Liu, Jun ;
Lv, Yanzhang ;
Li, Max Haifei ;
Liu, Mengyue ;
Zheng, Qinghua .
INFORMATION FUSION, 2018, 44 :22-32
[7]  
[陈侃 Chen Kan], 2015, [通信学报, Journal on Communications], V36, P120
[8]  
[程晓涛 Cheng Xiaotao], 2015, [自动化学报, Acta Automatica Sinica], V41, P1533
[9]   Pollution, bad-mouthing, and local marketing: The underground of location-based social networks [J].
Costa, Helen ;
Merschmann, Luiz H. C. ;
Barth, Fabricio ;
Benevenuto, Fabricio .
INFORMATION SCIENCES, 2014, 279 :123-137
[10]   The Rise of Social Bots [J].
Ferrara, Emilio ;
Varol, Onur ;
Davis, Clayton ;
Menczer, Filippo ;
Flammini, Alessandro .
COMMUNICATIONS OF THE ACM, 2016, 59 (07) :96-104