Node embedding approach for accurate detection of fake reviews: a graph-based machine learning approach with explainable AI

被引：0

作者：

Zaki, Nazar ^{[1
]}

Krishnan, Anusuya ^{[1
]}

Turaev, Sherzod ^{[1
]}

Rustamov, Zahiriddin ^{[1
]}

Rustamov, Jaloliddin ^{[1
]}

Almusalami, Aisha ^{[1
]}

Ayyad, Farah ^{[2
]}

Regasa, Tsion ^{[1
]}

Iriho, Brice Boris ^{[1
]}

机构：

[1] United Arab Emirates Univ, Coll Informat Technol, Dept Comp Sci & Software Engn, Al Ain 15551, U Arab Emirates

[2] United Arab Emirates Univ, Coll Informat Technol, Dept Informat Syst & Secur, Al Ain 15551, U Arab Emirates

来源：

INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS | 2024年 / 18卷 / 03期

关键词：

Fake review detection; Fake opinion detection; Node embedding; Machine learning; Explainable AI;

D O I：

10.1007/s41060-024-00565-2

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, online reviews have become increasingly important in promoting various products and services. Unfortunately, writing deceptive reviews has also become a common practice to promote one's own business or tarnish the reputation of competitors. As a result, identifying fake reviews has become an intense and ongoing area of research. This paper proposes a node embedding approach to detect online fake reviews. The approach involves extracting features from the input data to create a distance matrix, which is then used to construct a Graph. From the graph, we extract graph nodes and use the Node2Vec biased random walk algorithm to create a model. We retrieve node embeddings from the Node2Vec model using Word2Vec and use different classifiers to classify the nodes. We trained and evaluated the machine learning models on the Deceptive Opinion Spam Corpus and YelpChi datasets and achieved superior results compared to prior work for detecting fake reviews, with SVM using the Hamming distance achieving 98.44% accuracy, 98.44% precision, 98.44% recall, and 98.44% F1-score. Furthermore, we explored different methods for explaining our proposed methods using explainable AI, demonstrating the interpretability of our approach. Our proposed node embedding approach shows promising results for detecting fake reviews and offers a transparent and interpretable solution for the problem.

引用

页码：295 / 315

页数：21

共 60 条

[51] Opinion Spam Detection Based on Heterogeneous Information Network
Sun, Yingcheng
Loparo, Kenneth
[J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1156 - 1163
[52] A non-convex semi-supervised approach to opinion spam detection by ramp-one class SVM
Tian, Yingjie
Mirzabagheri, Mahboubeh
Tirandazi, Peyman
Bamakan, Seyed Mojtaba Hosseini
[J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
[53] Vajjala S, 2020, PRACTICAL NATURAL LA
[54] van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
[55] Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
Wang, Jingdong
Kan, Haitao
Meng, Fanqi
Mu, Qizi
Shi, Genhua
Xiao, Xixi
[J]. IEEE ACCESS, 2020, 8 : 182625 - 182639
[56] Xu Q., 2012, P COLING 2012, P1341
[57] Toward a Semantic Granularity Model for Domain-Specific Information Retrieval
Yan, Xin
Lau, Raymond Y. K.
Song, Dawei
Li, Xue
Ma, Jian
[J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2011, 29 (03)
[58] Graph Learning for Fake Review Detection
Yu, Shuo
Ren, Jing
Li, Shihao
Naseriparsa, Mehdi
Xia, Feng
[J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
[59] Identifying Protein Complexes in Protein-Protein Interaction Data Using Graph Convolutional Network
Zaki, Nazar
Singh, Harsh
Mohamed, Elfadil A.
[J]. IEEE ACCESS, 2021, 9 : 123717 - 123726
[60] Conotoxin protein classification using free scores of words and support vector machines
Zaki, Nazar
Wolfsheimer, Stefan
Nuel, Gregory
Khuri, Sawsan
[J]. BMC BIOINFORMATICS, 2011, 12

← 1 2 3 4 5 6 →