ScoreGAN: A Fraud Review Detector Based on Regulated GAN With Data Augmentation

被引：11

作者：

Shehnepoor, Saeedreza ^{[1
]}

Togneri, Roberto ^{[1
]}

Liu, Wei ^{[2
]}

Bennamoun, Mohammed ^{[2
]}

机构：

[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Perth, WA 6009, Australia

[2] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia

来源：

IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY | 2022年 / 17卷

关键词：

Feature extraction; Generative adversarial networks; Metadata; Australia; Deep learning; Training; Generators; Fraud reviews detection; deep learning; generative adversarial networks; joint representation; information gain maximization;

D O I：

10.1109/TIFS.2021.3139771

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The promising performance of Deep Neural Networks (DNNs) in text classification has attracted researchers to use them for fraud review detection. However, the lack of trusted labeled data has limited the performance of the current solutions in detecting fraud reviews. The Generative Adversarial Network (GAN) as a semi-supervised method has been demonstrated to be effective for data augmentation purposes. The state-of-the-art solutions utilize GANs to overcome the data scarcity problem. However, they fail to incorporate the behavioral clues in fraud generation. Additionally, state-of-the-art approaches overlook the possible bot-generated reviews in the dataset. Finally, they also suffer from a common limitation in the generalization and stability of the GAN, slowing down the training procedure. In this work, we propose ScoreGAN for fraud review detection that makes use of both review text and review rating scores in the generation and detection process. Scores are incorporated through Information Gain Maximization (IGM) into the loss function for three reasons. One is to generate score-correlated reviews based on the scores given to the generator. Second, the generated reviews are employed to train the discriminator, allowing the discriminator to correctly label the possible bot-generated reviews through joint representations learned from the concatenation of GLobal Vector for Word representation (GLoVe) extracted from the text and the score. Finally, it can be used to improve the stability and generalization of the GAN. Results show that the proposed framework outperformed the existing state-of-the-art FakeGAN framework, in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets, respectively.

引用

页码：280 / 291

页数：12

共 50 条

[21] Enhancing OCT patch-based segmentation with improved GAN data augmentation and semi-supervised learning
Kugelman J.
Alonso-Caneiro D.
Read S.A.
Vincent S.J.
Collins M.J.
Neural Computing and Applications, 2024, 36 (29) : 18087 - 18105
[22] Tea Disease Recognition Based on Image Segmentation and Data Augmentation
Li, Ji
Liao, Chenyi
IEEE ACCESS, 2025, 13 : 19664 - 19677
[23] Log-Spectral Matching GAN: PPG-Based Atrial Fibrillation Detection can be Enhanced by GAN-Based Data Augmentation With Integration of Spectral Loss
Ding, Cheng
Xiao, Ran
Do, Duc H.
Lee, David Scott
Lee, Randall J.
Kalantarian, Shadi
Hu, Xiao
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (03) : 1331 - 1341
[24] Tackling the class imbalanced dermoscopic image classification using data augmentation and GAN
Alsaidi, Mostapha
Jan, Muhammad Tanveer
Altaher, Ahmed
Zhuang, Hanqi
Zhu, Xingquan
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 49121 - 49147
[25] Tackling the class imbalanced dermoscopic image classification using data augmentation and GAN
Mostapha Alsaidi
Muhammad Tanveer Jan
Ahmed Altaher
Hanqi Zhuang
Xingquan Zhu
Multimedia Tools and Applications, 2024, 83 : 49121 - 49147
[26] GAN based augmentation using a hybrid loss function for dermoscopy images
Goceri, Evgin
ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (09)
[27] Towards Post-disaster Damage Assessment using Deep Transfer Learning and GAN-based Data Augmentation
Banerjee, Sourasekhar
Patel, Yashwant Singh
Kumar, Pushkar
Bhuyan, Monowar
PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 372 - 377
[28] Data augmentation for medical imaging: A systematic literature review
Garcea, Fabio
Serra, Alessio
Lamberti, Fabrizio
Morra, Lia
COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
[29] EnvGAN: a GAN-based augmentation to improve environmental sound classification
Madhu, Aswathy
Suresh, K.
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6301 - 6320
[30] A Novel Approach for Intelligent Fault Diagnosis in Bearing With Imbalanced Data Based on Cycle-Consistent GAN
Liao, Wenjie
Wu, Like
Xu, Shihui
Fujimura, Shigeru
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73

← 1 2 3 4 5 →