Understanding Large-Scale Network Effects in Detecting Review Spammers

被引:4
|
作者
Rout, Jitendra Kumar [1 ]
Sahoo, Kshira Sagar [2 ]
Dalmia, Anmol
Bakshi, Sambit [3 ]
Bilal, Muhammad [4 ]
Song, Houbing [5 ]
机构
[1] Natl Inst Technol Raipur, Dept Comp Sci & Engn, Raipur 492010, India
[2] Umea Univ, Dept Comp Sci, SE-90187 Umea, Sweden
[3] Natl Inst Technol Rourkela, Dept Comp Sci & Engn, Rourkela 769008, Odisha, India
[4] Hankuk Univ Foreign Studies, Dept Comp Engn, Yongin 17035, South Korea
[5] Univ Maryland Baltimore Cty UMBC, Dept Informat Syst, Baltimore, MD 21250 USA
关键词
Feature extraction; Behavioral sciences; Analytical models; Writing; Unsolicited e-mail; Sentiment analysis; Scalability; Online review spam; opinion spam; review graphs; spam detection; unlabeled review; FRAMEWORK;
D O I
10.1109/TCSS.2023.3243139
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Opinion spam detection is a challenge for online review systems and social forum operators. Opinion spamming costs businesses and people money since it deceives customers as well as automated opinion mining and sentiment analysis systems by bestowing undeserved positive opinions on target firms and/or bestowing fake negative opinions on others. One popular detection approach is to model a review system as a network of users, products, and reviews, for example using review graph models. In this article, we study the effects of network scale on network-based review spammer detection models, specifically on the trust model and the SpammerRank model. We then evaluate both network models using two large publicly available review datasets, namely: the Amazon dataset (containing 6 million reviews by more than 2 million reviewers) and the UCSD dataset (containing over 82 million reviews by 21 million reviewers). It has been observed thatSpammerRank model provides a better scaling time for applications requiring reviewer indicators and in case of trust model distributions are flattening out indicating variance of reviews with respect to spamming. Detailed observations on the scaling effects of these models are reported in the result section.
引用
收藏
页码:4994 / 5004
页数:11
相关论文
共 50 条
  • [1] Detecting Product Review Spammers Using Principles of Big Data
    Rout, Jitendra Kumar
    Dalmia, Anmol
    Rath, Santanu Kumar
    Mohanta, Bhabendu Kumar
    Ramasubbareddy, Somula
    Gandomi, Amir H.
    IEEE TRANSACTIONS ON ENGINEERING MANAGEMENT, 2023, 70 (07) : 2516 - 2527
  • [2] Detecting collusive spammers with heterogeneous graph attention network
    Zhang, Fuzhi
    Wu, Jiayi
    Zhang, Peng
    Ma, Ru
    Yu, Hongtao
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [3] Detecting Singleton Review Spammers Using Semantic Similarity
    Sandulescu, Vlad
    Ester, Martin
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 971 - 976
  • [4] Detecting Inaccurate Sensors on a Large-Scale Sensor Network Using Centralized and Localized Graph Neural Networks
    Wu, Dennis Y.
    Lin, Tsu-Heng
    Zhang, Xin-Ru
    Chen, Chia-Pan
    Chen, Jia-Hui
    Chen, Hung-Hsuan
    IEEE SENSORS JOURNAL, 2023, 23 (15) : 16446 - 16455
  • [5] LINE: Large-scale Information Network Embedding
    Tang, Jian
    Qu, Meng
    Wang, Mingzhe
    Zhang, Ming
    Yan, Jun
    Mei, Qiaozhu
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 1067 - 1077
  • [6] Detecting and Analyzing Motifs in Large-Scale Online Transaction Networks
    Jiang, Jiawei
    Huang, Hao
    Zheng, Zhigao
    Wei, Yi
    Fu, Fangcheng
    Li, Xiaosen
    Cui, Bin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (02) : 584 - 596
  • [7] Toward Understanding the Cliques of Opinion Spammers with Social Network Analysis
    Wang, Chih-Chien
    Day, Min-Yuh
    Lin, Yu-Ruei
    PROCEEDINGS OF THE 2016 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING ASONAM 2016, 2016, : 1163 - 1169
  • [8] Learning Distilled Graph for Large-Scale Social Network Data Clustering
    Liu, Wenhe
    Gong, Dong
    Tan, Mingkui
    Shi, Javen Qinfeng
    Yang, Yi
    Hauptmann, Alexander G.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (07) : 1393 - 1404
  • [9] Understanding the role of territorial factors in the large-scale hydropower business sustainability: A systematic literature review
    Suarez-Gomez, Juan D.
    Polanco, Jorge-Andres
    Escobar-Sierra, Manuela
    ENERGY REPORTS, 2021, 7 : 3249 - 3266
  • [10] Large-Scale Nodes Classification With Deep Aggregation Network
    Li, Jiangtao
    Wu, Jianshe
    He, Weiquan
    Zhou, Peng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (06) : 2560 - 2572