Node embedding approach for accurate detection of fake reviews: a graph-based machine learning approach with explainable AI

被引:0
作者
Zaki, Nazar [1 ]
Krishnan, Anusuya [1 ]
Turaev, Sherzod [1 ]
Rustamov, Zahiriddin [1 ]
Rustamov, Jaloliddin [1 ]
Almusalami, Aisha [1 ]
Ayyad, Farah [2 ]
Regasa, Tsion [1 ]
Iriho, Brice Boris [1 ]
机构
[1] United Arab Emirates Univ, Coll Informat Technol, Dept Comp Sci & Software Engn, Al Ain 15551, U Arab Emirates
[2] United Arab Emirates Univ, Coll Informat Technol, Dept Informat Syst & Secur, Al Ain 15551, U Arab Emirates
关键词
Fake review detection; Fake opinion detection; Node embedding; Machine learning; Explainable AI;
D O I
10.1007/s41060-024-00565-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, online reviews have become increasingly important in promoting various products and services. Unfortunately, writing deceptive reviews has also become a common practice to promote one's own business or tarnish the reputation of competitors. As a result, identifying fake reviews has become an intense and ongoing area of research. This paper proposes a node embedding approach to detect online fake reviews. The approach involves extracting features from the input data to create a distance matrix, which is then used to construct a Graph. From the graph, we extract graph nodes and use the Node2Vec biased random walk algorithm to create a model. We retrieve node embeddings from the Node2Vec model using Word2Vec and use different classifiers to classify the nodes. We trained and evaluated the machine learning models on the Deceptive Opinion Spam Corpus and YelpChi datasets and achieved superior results compared to prior work for detecting fake reviews, with SVM using the Hamming distance achieving 98.44% accuracy, 98.44% precision, 98.44% recall, and 98.44% F1-score. Furthermore, we explored different methods for explaining our proposed methods using explainable AI, demonstrating the interpretability of our approach. Our proposed node embedding approach shows promising results for detecting fake reviews and offers a transparent and interpretable solution for the problem.
引用
收藏
页码:295 / 315
页数:21
相关论文
共 60 条
  • [51] Opinion Spam Detection Based on Heterogeneous Information Network
    Sun, Yingcheng
    Loparo, Kenneth
    [J]. 2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019), 2019, : 1156 - 1163
  • [52] A non-convex semi-supervised approach to opinion spam detection by ramp-one class SVM
    Tian, Yingjie
    Mirzabagheri, Mahboubeh
    Tirandazi, Peyman
    Bamakan, Seyed Mojtaba Hosseini
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [53] Vajjala S, 2020, PRACTICAL NATURAL LA
  • [54] van der Maaten L, 2008, J MACH LEARN RES, V9, P2579
  • [55] Fake Review Detection Based on Multiple Feature Fusion and Rolling Collaborative Training
    Wang, Jingdong
    Kan, Haitao
    Meng, Fanqi
    Mu, Qizi
    Shi, Genhua
    Xiao, Xixi
    [J]. IEEE ACCESS, 2020, 8 : 182625 - 182639
  • [56] Xu Q., 2012, P COLING 2012, P1341
  • [57] Toward a Semantic Granularity Model for Domain-Specific Information Retrieval
    Yan, Xin
    Lau, Raymond Y. K.
    Song, Dawei
    Li, Xue
    Ma, Jian
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2011, 29 (03)
  • [58] Graph Learning for Fake Review Detection
    Yu, Shuo
    Ren, Jing
    Li, Shihao
    Naseriparsa, Mehdi
    Xia, Feng
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [59] Identifying Protein Complexes in Protein-Protein Interaction Data Using Graph Convolutional Network
    Zaki, Nazar
    Singh, Harsh
    Mohamed, Elfadil A.
    [J]. IEEE ACCESS, 2021, 9 : 123717 - 123726
  • [60] Conotoxin protein classification using free scores of words and support vector machines
    Zaki, Nazar
    Wolfsheimer, Stefan
    Nuel, Gregory
    Khuri, Sawsan
    [J]. BMC BIOINFORMATICS, 2011, 12