ScoreGAN: A Fraud Review Detector Based on Regulated GAN With Data Augmentation

被引:11
|
作者
Shehnepoor, Saeedreza [1 ]
Togneri, Roberto [1 ]
Liu, Wei [2 ]
Bennamoun, Mohammed [2 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Perth, WA 6009, Australia
[2] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia
关键词
Feature extraction; Generative adversarial networks; Metadata; Australia; Deep learning; Training; Generators; Fraud reviews detection; deep learning; generative adversarial networks; joint representation; information gain maximization;
D O I
10.1109/TIFS.2021.3139771
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The promising performance of Deep Neural Networks (DNNs) in text classification has attracted researchers to use them for fraud review detection. However, the lack of trusted labeled data has limited the performance of the current solutions in detecting fraud reviews. The Generative Adversarial Network (GAN) as a semi-supervised method has been demonstrated to be effective for data augmentation purposes. The state-of-the-art solutions utilize GANs to overcome the data scarcity problem. However, they fail to incorporate the behavioral clues in fraud generation. Additionally, state-of-the-art approaches overlook the possible bot-generated reviews in the dataset. Finally, they also suffer from a common limitation in the generalization and stability of the GAN, slowing down the training procedure. In this work, we propose ScoreGAN for fraud review detection that makes use of both review text and review rating scores in the generation and detection process. Scores are incorporated through Information Gain Maximization (IGM) into the loss function for three reasons. One is to generate score-correlated reviews based on the scores given to the generator. Second, the generated reviews are employed to train the discriminator, allowing the discriminator to correctly label the possible bot-generated reviews through joint representations learned from the concatenation of GLobal Vector for Word representation (GLoVe) extracted from the text and the score. Finally, it can be used to improve the stability and generalization of the GAN. Results show that the proposed framework outperformed the existing state-of-the-art FakeGAN framework, in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets, respectively.
引用
收藏
页码:280 / 291
页数:12
相关论文
共 50 条
  • [21] Enhancing OCT patch-based segmentation with improved GAN data augmentation and semi-supervised learning
    Kugelman J.
    Alonso-Caneiro D.
    Read S.A.
    Vincent S.J.
    Collins M.J.
    Neural Computing and Applications, 2024, 36 (29) : 18087 - 18105
  • [22] Tea Disease Recognition Based on Image Segmentation and Data Augmentation
    Li, Ji
    Liao, Chenyi
    IEEE ACCESS, 2025, 13 : 19664 - 19677
  • [23] Log-Spectral Matching GAN: PPG-Based Atrial Fibrillation Detection can be Enhanced by GAN-Based Data Augmentation With Integration of Spectral Loss
    Ding, Cheng
    Xiao, Ran
    Do, Duc H.
    Lee, David Scott
    Lee, Randall J.
    Kalantarian, Shadi
    Hu, Xiao
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (03) : 1331 - 1341
  • [24] Tackling the class imbalanced dermoscopic image classification using data augmentation and GAN
    Alsaidi, Mostapha
    Jan, Muhammad Tanveer
    Altaher, Ahmed
    Zhuang, Hanqi
    Zhu, Xingquan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (16) : 49121 - 49147
  • [25] Tackling the class imbalanced dermoscopic image classification using data augmentation and GAN
    Mostapha Alsaidi
    Muhammad Tanveer Jan
    Ahmed Altaher
    Hanqi Zhuang
    Xingquan Zhu
    Multimedia Tools and Applications, 2024, 83 : 49121 - 49147
  • [27] Towards Post-disaster Damage Assessment using Deep Transfer Learning and GAN-based Data Augmentation
    Banerjee, Sourasekhar
    Patel, Yashwant Singh
    Kumar, Pushkar
    Bhuyan, Monowar
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING AND NETWORKING, ICDCN 2023, 2023, : 372 - 377
  • [28] Data augmentation for medical imaging: A systematic literature review
    Garcea, Fabio
    Serra, Alessio
    Lamberti, Fabrizio
    Morra, Lia
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 152
  • [29] EnvGAN: a GAN-based augmentation to improve environmental sound classification
    Madhu, Aswathy
    Suresh, K.
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6301 - 6320
  • [30] A Novel Approach for Intelligent Fault Diagnosis in Bearing With Imbalanced Data Based on Cycle-Consistent GAN
    Liao, Wenjie
    Wu, Like
    Xu, Shihui
    Fujimura, Shigeru
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73