Semi-Supervised Collaborative Learning for Social Spammer and Spam Message Detection in Microblogging

被引:7
作者
Wu, Fangzhao [1 ]
Wu, Chuhan [2 ]
Liu, Junxin [2 ]
机构
[1] Microsoft Res Asia, Beijing 100080, Peoples R China
[2] Tsinghua Univ, Elect Engn, Beijing, Peoples R China
来源
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT | 2018年
基金
中国国家自然科学基金;
关键词
spam detection; spammer detection; social media;
D O I
10.1145/3269206.3269324
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It is important to detect social spammers and spam messages in microblogging platforms. Existing methods usually handle the detection of social spammers and spam messages as two separate tasks using supervised learning techniques. However, labeled samples are usually scarce and manual annotation is expensive. In this paper, we propose a semi-supervised collaborative learning approach to jointly detect social spammers and spam messages in microblogging platforms. In our approach, the social spammer classifier and spam message classifier are collaboratively trained by exploiting the inherent relatedness between these tasks. In addition, unlabeled samples are incorporated into model training with the help of social contexts of users and messages. Experiments on real-world dataset show our approach can effectively improve the performance of both social spammer detection and spam message detection.
引用
收藏
页码:1791 / 1794
页数:4
相关论文
共 17 条
[1]  
Benevenuto Fabricio., 2010, CEAS
[2]  
Bennett K. P., 1998, ADV NEURAL INFORM PR, P368
[3]  
Bilge Leyla, 2009, Proceedings of the 18th international conference on World wide web, WWW '09, P551, DOI DOI 10.1145/1526709.1526784
[4]  
Chapelle O., 2009, SEMISUPERVISED LEARN, V20, P542
[5]  
Chen F., 2009, Proceeding of the 18th ACM conference on information and knowledge management, P1807, DOI [10.1145/1645953.1646235, DOI 10.1145/1645953.1646235]
[6]   SMOOTHING PROXIMAL GRADIENT METHOD FOR GENERAL STRUCTURED SPARSE REGRESSION [J].
Chen, Xi ;
Lin, Qihang ;
Kim, Seyoung ;
Carbonell, Jaime G. ;
Xing, Eric P. .
ANNALS OF APPLIED STATISTICS, 2012, 6 (02) :719-752
[7]   Robust Spammer Detection in Microblogs: Leveraging User Carefulness [J].
Fu, Hao ;
Xie, Xing ;
Rui, Yong ;
Gong, Neil Zhenqiang ;
Sun, Guangzhong ;
Chen, Enhong .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2017, 8 (06)
[8]   @spam: The Underground on 140 Characters or Less [J].
Grier, Chris ;
Thomas, Kurt ;
Paxson, Vern ;
Zhang, Michael .
PROCEEDINGS OF THE 17TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY (CCS'10), 2010, :27-37
[9]  
Hu X, 2014, AAAI CONF ARTIF INTE, P59
[10]   Leveraging Knowledge across Media for Spammer Detection in Microblogging [J].
Hu, Xia ;
Tang, Jiliang ;
Liu, Huan .
SIGIR'14: PROCEEDINGS OF THE 37TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2014, :547-556