Generating behavior features for cold-start spam review detection with adversarial learning

被引:32
作者
Tang, Xiaoya [1 ]
Qian, Tieyun [1 ]
You, Zhenni [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China
关键词
Spam review detection; Cold-start problem; Generative adversarial network;
D O I
10.1016/j.ins.2020.03.063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the wide applications, spam detection has long been a hot research topic in both academia and industry. Existing studies show that behavior features are effective in distinguishing the spam and legitimate reviews. However, it usually takes a long time to collect such features and thus is hard to apply them to cold-start spam review detection tasks. Recent advances leveraged the neural network to encode the various types of textual, behavior, and attribute information for this task. However, the inherent problem, i.e., lack of effective behavior features for new users who post just one review, is still unsolved. In this paper, we exploit the generative adversarial network (GAN) for addressing this problem. The key idea is to generate synthetic behavior features (SBFs) for new users from their easily accessible features (EAFs). Specifically, we first select six well recognized real behavior features (RBFs) existing for regular users. We then train a GAN framework including a generator to generate SBFs from their EAFs including text, rating, and attribute features, and a discriminator to discriminate RBFs and SBFs. We design a new implementation of generator and discriminator for effective training. The trained GAN is finally applied to new users for generating synthetic behavior features. We conduct extensive experiments on two Yelp datasets. Experimental results demonstrate that our proposed framework significantly outperforms the state-of-the-art methods. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:274 / 288
页数:15
相关论文
共 48 条
[1]   Learning from the Crowd: Regression Discontinuity Estimates of the Effects of an Online Review Database [J].
Anderson, Michael ;
Magruder, Jeremy .
ECONOMIC JOURNAL, 2012, 122 (563) :957-989
[2]  
[Anonymous], 2011, P IEEE 11 INT C DAT, DOI DOI 10.1109/ICDM.2011.124
[3]  
[Anonymous], COLING
[4]  
[Anonymous], 2016, Proceedings of the 2016 conference on empirical methods in natural language processing, DOI DOI 10.18653/V1/D16-1187
[5]  
[Anonymous], 2013, 7 INT AAAI C WEBL SO
[6]  
[Anonymous], 2016, AAAI
[7]  
[Anonymous], BEHAV COMPUTING MODE
[8]  
[Anonymous], 2011, HARVARD BUSINESS SCH
[9]  
[Anonymous], IJCAI
[10]  
[Anonymous], 2013, ICWSM