ScoreGAN: A Fraud Review Detector Based on Regulated GAN With Data Augmentation

被引:11
|
作者
Shehnepoor, Saeedreza [1 ]
Togneri, Roberto [1 ]
Liu, Wei [2 ]
Bennamoun, Mohammed [2 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Perth, WA 6009, Australia
[2] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia
关键词
Feature extraction; Generative adversarial networks; Metadata; Australia; Deep learning; Training; Generators; Fraud reviews detection; deep learning; generative adversarial networks; joint representation; information gain maximization;
D O I
10.1109/TIFS.2021.3139771
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The promising performance of Deep Neural Networks (DNNs) in text classification has attracted researchers to use them for fraud review detection. However, the lack of trusted labeled data has limited the performance of the current solutions in detecting fraud reviews. The Generative Adversarial Network (GAN) as a semi-supervised method has been demonstrated to be effective for data augmentation purposes. The state-of-the-art solutions utilize GANs to overcome the data scarcity problem. However, they fail to incorporate the behavioral clues in fraud generation. Additionally, state-of-the-art approaches overlook the possible bot-generated reviews in the dataset. Finally, they also suffer from a common limitation in the generalization and stability of the GAN, slowing down the training procedure. In this work, we propose ScoreGAN for fraud review detection that makes use of both review text and review rating scores in the generation and detection process. Scores are incorporated through Information Gain Maximization (IGM) into the loss function for three reasons. One is to generate score-correlated reviews based on the scores given to the generator. Second, the generated reviews are employed to train the discriminator, allowing the discriminator to correctly label the possible bot-generated reviews through joint representations learned from the concatenation of GLobal Vector for Word representation (GLoVe) extracted from the text and the score. Finally, it can be used to improve the stability and generalization of the GAN. Results show that the proposed framework outperformed the existing state-of-the-art FakeGAN framework, in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets, respectively.
引用
收藏
页码:280 / 291
页数:12
相关论文
共 50 条
  • [31] Semi-GAN: An Improved GAN-Based Missing Data Imputation Method for the Semiconductor Industry
    Lee, Sun-Yong
    Connerton, Timothy Paul
    Lee, Yeon-Woo
    Kim, Daeyoung
    Kim, Donghwan
    Kim, Jin-Ho
    IEEE ACCESS, 2022, 10 : 72328 - 72338
  • [32] GAN-Based Data Augmentation for Visual Finger Spelling Recognition
    Kwolek, Bogdan
    ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041
  • [33] SeqESR-GAN-Based Sparse Data Augmentation for Distribution Networks
    Xu, Tao
    Zhang, Jiadong
    Meng, He
    Liu, Lutong
    Wang, Kaiqi
    Qiao, Ji
    Zhao, Zixuan
    Zhu, Hong
    Wang, Wendi
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (11) : 12913 - 12923
  • [34] Enhancing human action recognition with GAN-based data augmentation
    Pulakurthi, Prasanna Reddy
    de Melo, Celso M.
    Rao, Raghuveer
    Rabbani, Majid
    SYNTHETIC DATA FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING: TOOLS, TECHNIQUES, AND APPLICATIONS II, 2024, 13035
  • [35] Adversarial Learning-Based Data Augmentation for Palm-Vein Identification
    Qin, Huafeng
    Xi, Haofei
    Li, Yantao
    El-Yacoubi, Mounim A.
    Wang, Jun
    Gao, Xinbo
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (06) : 4325 - 4341
  • [36] Data Augmentation using GAN for Sound based COVID 19 Diagnosis
    Yella, Nishant
    Rajan, Bina
    PROCEEDINGS OF THE 11TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS (IDAACS'2021), VOL 2, 2021, : 606 - 609
  • [37] GAN-Based Data Augmentation For Improving The Classification Of EEG Signals
    Bhat, Sudhanva
    Hortal, Enrique
    THE 14TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2021, 2021, : 453 - 458
  • [38] ADL-GAN: Data Augmentation to Improve In-the-Wild ADL Recognition Using GANs
    Ditthapron, Apiwat
    Lammert, Adam C. C.
    Agu, Emmanuel O. O.
    IEEE ACCESS, 2023, 11 : 50671 - 50688
  • [39] FTGAN: A Novel GAN-Based Data Augmentation Method Coupled Time-Frequency Domain for Imbalanced Bearing Fault Diagnosis
    Wang, Haoyu
    Li, Peng
    Lang, Xun
    Tao, Dapeng
    Ma, Jun
    Li, Xiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [40] FTGAN: A Novel GAN-Based Data Augmentation Method Coupled Time-Frequency Domain for Imbalanced Bearing Fault Diagnosis
    Wang, Haoyu
    Li, Peng
    Lang, Xun
    Tao, Dapeng
    Ma, Jun
    Li, Xiang
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72