Approach for Spammer Detection in Weibo Based on Multi-View Fusion

被引:0
|
作者
Yang X. [1 ]
Liang X. [1 ]
机构
[1] School of Cyber Security and Computer, Hebei University, Baoding
关键词
Linear weighting function; Multi-view fusion; Spammer detection; Weibo;
D O I
10.12141/j.issn.1000-565X.200366
中图分类号
学科分类号
摘要
In order to detect spammers more effectively in Weibo, an approach based on multi-view fusion was proposed. First, a user representation strategy for integrating multi-view information was designed to characterize users from 3 views, namely, user behavior, social relationship and text content. In view of the deficiencies that the exis-ting approaches do not fully consider the user's fans and user's environment in social networks, new features such as fan ratio, fan average bidirectional connection rate, community-based bidirectional connection rate, community-based cluster coefficient, etc. were introduced. Then, a multi-view fusion decision model based on a linear weighting function was constructed. A linear weighting fusion was carried out based on the classification results from each view. The optimal fusion coefficient was obtained by minimizing the approximate error, and then the final classification result was obtained. The test result on the real data from Weibo show that this approach can not only effectively detect spammers, with significant improvement in precision and F1-sorce, but also exhibits greater stability especially when processing unbalanced data. It also analyzes the impact of different views on the final detection effect, and the results show that the user's social relationship view has the most significant effect. © 2020, Editorial Department, Journal of South China University of Technology. All right reserved.
引用
收藏
页码:125 / 134
页数:9
相关论文
共 24 条
  • [1] WU X, CHEN Q G, HU Y, Et al., Multi-view multi-label learning with view-specific information extraction, Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 3884-3890, (2019)
  • [2] ZHANG Yu-xiang, SUN Yu, YANG Jia-hai, Et al., Feature importance analysis for spammer detection in Sina Weibo, Journal on Communications, 37, 8, pp. 24-33, (2016)
  • [3] LEE K, CAVERLEE J, WEBB S., Uncovering social spammers: social honeypots + machine learning, Proceedings of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 435-442, (2010)
  • [4] STRINGHINI G, KRUEGEL C, VIGNA G., Detecting spammers on social networks, Proceedings of the 26th Annual Computer Security Applications Confe-rence, pp. 1-9, (2010)
  • [5] CRESCI S, PIETRO R D, PETROCCHI M, Et al., Social fingerprinting: detection of spambot groups through DNA-inspired behavioral modeling, IEEE Transactions on Dependable and Secure Computing, 15, 4, pp. 561-576, (2018)
  • [6] AMLESHWARAM A, REDDY N, YADAV S, Et al., CATS: characterizing automation of Twitter spammers, Proceedings of 2013 Fifth International Confe-rence on Communication Systems and Networks(COMSNETS), pp. 1-10, (2013)
  • [7] TAN E, GUO L, CHEN S, Et al., Spammer behavior analysis and detection in user generated content on social networks, Proceedings of 2012 IEEE 32nd International Conference on Distributed Computing Systems, pp. 305-314, (2012)
  • [8] HU X, TANG J, LIU H, Et al., Social spammer detection with sentiment information, Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, pp. 59-65, (2014)
  • [9] THOMAS K, GRIER C, PAXSON V, Et al., Suspended accounts in retrospect: an analysis of Twitter spam, Proceedings of the 2011 ACM SIGCOMM Conference on Internet Measurement Conference, pp. 248-258, (2011)
  • [10] BINDU P V, MISHRA R, THILAGAM P S., Disco-vering spammer communities in Twitter, Journal of Intelligent Information Systems, 51, 3, pp. 503-527, (2018)