Spam review detection with Metapath-aggregated graph convolution network

被引:1
作者
Jayashree, P. [1 ]
Laila, K. [1 ]
Amuthan, Aara [1 ]
机构
[1] Anna Univ, Dept Comp Technol, MIT Campus, Chennai, Tamil Nadu, India
关键词
Spam review detection; feature sets derivation; machine learning; Metapath; graph convolution network; DECEPTIVE OPINION SPAM; FRAMEWORK;
D O I
10.3233/JIFS-223136
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The large flux of online products in today's world makes business reviews a valuable source for consumers for making sound decisions before making online purchases. Reviews are useful for readers in learning more about the product and gauge its quality. Fake reviews and reviewers form the bulk of the review corpus, making review spamming an open research challenge. These spam reviews require detection to nullify their contribution to product recommendations. In the past, researchers and communities have taken spam detection problems as a matter of serious concern. Yet, for all that, there is space for the performance of exploration on large-scale complex datasets. The work contributes towards robust feature selection with derived features that provide more details on malicious reviews and spammers. Ensemble and other standard machine learning techniques are trained and evaluated over optimal feature sets. In addition, the Metapath-based Graph Convolution Network (M-GCN) framework is proposed, which is an implicit knowledge extraction method to automatically capture the complex semantic meaning of reviews from the heterogeneous network. It makes analysis of triplet (users, reviews, and products) relationships in e-commerce sites through examination of Top-n feature sets in a mutually reinforcing manner. The proposed model is demonstrated on Yelp and Amazon benchmark datasets for evaluation of efficacy and it is shown outperforming state-of-the-art techniques with and without graph-utilization, providing an accuracy of 96% in the prediction task.
引用
收藏
页码:3005 / 3023
页数:19
相关论文
共 48 条
[41]   NetSpam: A Network-Based Spam Detection Framework for Reviews in Online Social Media [J].
Shehnepoor, Saeedreza ;
Salehi, Mostafa ;
Farahbakhsh, Reza ;
Crespi, Noel .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2017, 12 (07) :1585-1595
[42]   A Survey of Heterogeneous Information Network Analysis [J].
Shi, Chuan ;
Li, Yitong ;
Zhang, Jiawei ;
Sun, Yizhou ;
Yu, Philip S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (01) :17-37
[43]   Generating behavior features for cold-start spam review detection with adversarial learning [J].
Tang, Xiaoya ;
Qian, Tieyun ;
You, Zhenni .
INFORMATION SCIENCES, 2020, 526 :274-288
[44]  
unctad.org, About us
[45]   FdGars: Fraudster Detection via Graph Convolutional Networks in Online App Review System [J].
Wang, Jianyu ;
Wen, Rui ;
Wu, Chunming ;
Huang, Yu ;
Xiong, Jian .
COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2019 ), 2019, :310-316
[46]  
Wang Nanxin, 2020, COMPLEXITY, V2020
[47]   Interpretable and Efficient Heterogeneous Graph Convolutional Network [J].
Yang, Yaming ;
Guan, Ziyu ;
Li, Jianxin ;
Zhao, Wei ;
Cui, Jiangtao ;
Wang, Quan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) :1637-1650
[48]   Research of Deceptive Review Detection Based on Target Product Identification and Metapath Feature Weight Calculation [J].
Yuan, Ling ;
Li, Dan ;
Wei, Shikang ;
Wang, Mingli .
COMPLEXITY, 2018,