Generating behavior features for cold-start spam review detection with adversarial learning

被引:32
作者
Tang, Xiaoya [1 ]
Qian, Tieyun [1 ]
You, Zhenni [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan, Hubei, Peoples R China
关键词
Spam review detection; Cold-start problem; Generative adversarial network;
D O I
10.1016/j.ins.2020.03.063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the wide applications, spam detection has long been a hot research topic in both academia and industry. Existing studies show that behavior features are effective in distinguishing the spam and legitimate reviews. However, it usually takes a long time to collect such features and thus is hard to apply them to cold-start spam review detection tasks. Recent advances leveraged the neural network to encode the various types of textual, behavior, and attribute information for this task. However, the inherent problem, i.e., lack of effective behavior features for new users who post just one review, is still unsolved. In this paper, we exploit the generative adversarial network (GAN) for addressing this problem. The key idea is to generate synthetic behavior features (SBFs) for new users from their easily accessible features (EAFs). Specifically, we first select six well recognized real behavior features (RBFs) existing for regular users. We then train a GAN framework including a generator to generate SBFs from their EAFs including text, rating, and attribute features, and a discriminator to discriminate RBFs and SBFs. We design a new implementation of generator and discriminator for effective training. The trained GAN is finally applied to new users for generating synthetic behavior features. We conduct extensive experiments on two Yelp datasets. Experimental results demonstrate that our proposed framework significantly outperforms the state-of-the-art methods. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页码:274 / 288
页数:15
相关论文
共 48 条
[31]  
Liu Y, 2015, PROCEEDINGS OF THE TWENTY-FOURTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI), P1617
[32]   From action to activity: Sensor-based activity recognition [J].
Liu, Ye ;
Nie, Liqiang ;
Liu, Li ;
Rosenblum, David S. .
NEUROCOMPUTING, 2016, 181 :108-115
[33]  
Mikolov T, 2013, NIPS, V26
[34]   ON HIDDEN NODES FOR NEURAL NETS [J].
MIRCHANDANI, G ;
CAO, W .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (05) :661-664
[35]   Data Analytics in Ubiquitous Sensor-based Health Information Systems [J].
Mukherjee, Arijit ;
Pal, Arpan ;
Misra, Prateep .
2012 6TH INTERNATIONAL CONFERENCE ON NEXT GENERATION MOBILE APPLICATIONS, SERVICES AND TECHNOLOGIES (NGMAST), 2012, :193-198
[36]  
Mukherjee A, 2013, 19TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'13), P632
[37]   Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: State of the art and research challenges [J].
Nweke, Henry Friday ;
Teh, Ying Wah ;
Al-Garadi, Mohammed Ali ;
Alo, Uzoma Rita .
EXPERT SYSTEMS WITH APPLICATIONS, 2018, 105 :233-261
[38]  
Ratliff LJ, 2013, ANN ALLERTON CONF, P917, DOI 10.1109/Allerton.2013.6736623
[39]   Collective Opinion Spam Detection: Bridging Review Networks and Metadata [J].
Rayana, Shebuti ;
Akoglu, Leman .
KDD'15: PROCEEDINGS OF THE 21ST ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2015, :985-994
[40]  
Ren Y., 2016, Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, P140