Spam review detection with Metapath-aggregated graph convolution network

被引:1
作者
Jayashree, P. [1 ]
Laila, K. [1 ]
Amuthan, Aara [1 ]
机构
[1] Anna Univ, Dept Comp Technol, MIT Campus, Chennai, Tamil Nadu, India
关键词
Spam review detection; feature sets derivation; machine learning; Metapath; graph convolution network; DECEPTIVE OPINION SPAM; FRAMEWORK;
D O I
10.3233/JIFS-223136
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The large flux of online products in today's world makes business reviews a valuable source for consumers for making sound decisions before making online purchases. Reviews are useful for readers in learning more about the product and gauge its quality. Fake reviews and reviewers form the bulk of the review corpus, making review spamming an open research challenge. These spam reviews require detection to nullify their contribution to product recommendations. In the past, researchers and communities have taken spam detection problems as a matter of serious concern. Yet, for all that, there is space for the performance of exploration on large-scale complex datasets. The work contributes towards robust feature selection with derived features that provide more details on malicious reviews and spammers. Ensemble and other standard machine learning techniques are trained and evaluated over optimal feature sets. In addition, the Metapath-based Graph Convolution Network (M-GCN) framework is proposed, which is an implicit knowledge extraction method to automatically capture the complex semantic meaning of reviews from the heterogeneous network. It makes analysis of triplet (users, reviews, and products) relationships in e-commerce sites through examination of Top-n feature sets in a mutually reinforcing manner. The proposed model is demonstrated on Yelp and Amazon benchmark datasets for evaluation of efficacy and it is shown outperforming state-of-the-art techniques with and without graph-utilization, providing an accuracy of 96% in the prediction task.
引用
收藏
页码:3005 / 3023
页数:19
相关论文
共 48 条
[1]  
Akram Abubakker Usman, 2018, FINDING ROTTEN EGGS
[2]   RETRACTED: Development of Integrated Neural Network Model for Identification of Fake Reviews in E-Commerce Using Multidomain Datasets (Retracted article. See vol. 2023, 2023) [J].
Alsubari, Saleh Nagi ;
Deshmukh, Sachin N. ;
Al-Adhaileh, Mosleh Hmoud ;
Alsaade, Fawaz Waselalla ;
Aldhyani, Theyazn H. H. .
APPLIED BIONICS AND BIOMECHANICS, 2021, 2021
[3]   Detection of Fake Job Postings by Utilizing Machine Learning and Natural Language Processing Approaches [J].
Amaar, Aashir ;
Aljedaani, Wajdi ;
Rustam, Furqan ;
Ullah, Saleem ;
Rupapara, Vaibhav ;
Ludi, Stephanie .
NEURAL PROCESSING LETTERS, 2022, 54 (03) :2219-2247
[4]   Deceptive Opinion Spam based On Deep Learning [J].
Anass, Fahfouh ;
Jamal, Riffi ;
Mahraz, Mohamed Adnane ;
Ali, Yahyaouy ;
Tairi, Hamid .
2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
[5]  
[Anonymous], About us
[6]  
[Anonymous], About Us
[7]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)
[8]  
Crawford M., 2015, Journal of Big Data, V2, P23, DOI [10.1186/s40537-015-0029-9, DOI 10.1186/S40537-015-0029-9]
[9]   Machine learning for email spam filtering: review, approaches and open research problems [J].
Dada, Emmanuel Gbenga ;
Bassi, Joseph Stephen ;
Chiroma, Haruna ;
Abdulhamid, Shafi'i Muhammad ;
Adetunmbi, Adebayo Olusola ;
Ajibuwa, Opeyemi Emmanuel .
HELIYON, 2019, 5 (06)
[10]   Semi-supervised Learning based Fake Review Detection [J].
Deng, Huaxun ;
Zhao, Linfeng ;
Luo, Ning ;
Liu, Yuan ;
Guo, Guibing ;
Wang, Xingwei ;
Tan, Zhenhua ;
Wang, Shuang ;
Zhou, Fucai .
2017 15TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS AND 2017 16TH IEEE INTERNATIONAL CONFERENCE ON UBIQUITOUS COMPUTING AND COMMUNICATIONS (ISPA/IUCC 2017), 2017, :1278-1280