ScoreGAN: A Fraud Review Detector Based on Regulated GAN With Data Augmentation

被引:11
|
作者
Shehnepoor, Saeedreza [1 ]
Togneri, Roberto [1 ]
Liu, Wei [2 ]
Bennamoun, Mohammed [2 ]
机构
[1] Univ Western Australia, Sch Elect Elect & Comp Engn, Perth, WA 6009, Australia
[2] Univ Western Australia, Dept Comp Sci & Software Engn, Perth, WA 6009, Australia
关键词
Feature extraction; Generative adversarial networks; Metadata; Australia; Deep learning; Training; Generators; Fraud reviews detection; deep learning; generative adversarial networks; joint representation; information gain maximization;
D O I
10.1109/TIFS.2021.3139771
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The promising performance of Deep Neural Networks (DNNs) in text classification has attracted researchers to use them for fraud review detection. However, the lack of trusted labeled data has limited the performance of the current solutions in detecting fraud reviews. The Generative Adversarial Network (GAN) as a semi-supervised method has been demonstrated to be effective for data augmentation purposes. The state-of-the-art solutions utilize GANs to overcome the data scarcity problem. However, they fail to incorporate the behavioral clues in fraud generation. Additionally, state-of-the-art approaches overlook the possible bot-generated reviews in the dataset. Finally, they also suffer from a common limitation in the generalization and stability of the GAN, slowing down the training procedure. In this work, we propose ScoreGAN for fraud review detection that makes use of both review text and review rating scores in the generation and detection process. Scores are incorporated through Information Gain Maximization (IGM) into the loss function for three reasons. One is to generate score-correlated reviews based on the scores given to the generator. Second, the generated reviews are employed to train the discriminator, allowing the discriminator to correctly label the possible bot-generated reviews through joint representations learned from the concatenation of GLobal Vector for Word representation (GLoVe) extracted from the text and the score. Finally, it can be used to improve the stability and generalization of the GAN. Results show that the proposed framework outperformed the existing state-of-the-art FakeGAN framework, in terms of AP by 7%, and 5% on the Yelp and TripAdvisor datasets, respectively.
引用
收藏
页码:280 / 291
页数:12
相关论文
共 50 条
  • [41] A GAN-Based Data Augmentation Method for Imbalanced Multi-Class Skin Lesion Classification
    Su, Qichen
    Hamed, Haza Nuzly Abdull
    Isa, Mohd Adham
    Hao, Xue
    Dai, Xin
    IEEE ACCESS, 2024, 12 : 16498 - 16513
  • [42] GAN-Based Data Augmentation for AI-Enabled ATP in Free Space Optical Communication
    Liu, Yuchen
    Liu, Yejun
    Song, Song
    Chen, Kun
    Guo, Lei
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (05) : 1067 - 1071
  • [43] Data Augmentation for Insider Threat Detection with GAN
    Yuan, Fangfang
    Shang, Yanmin
    Liu, Yanbing
    Cao, Yanan
    Tan, Jianlong
    2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2020, : 632 - 638
  • [44] A NOVEL GAN-BASED DATA AUGMENTATION ALGORITHM FOR SEMICONDUCTOR DEFECT INSPECTION
    Liu, Yang
    Guan, Yuanjun
    Han, Tianyan
    Ma, Can
    Wang, Jiayi
    Wang, Tao
    Yi, Qianchuan
    Hu, Lilei
    CONFERENCE OF SCIENCE & TECHNOLOGY FOR INTEGRATED CIRCUITS, 2024 CSTIC, 2024,
  • [45] A Survey on GAN-Based Data Augmentation for Hand Pose Estimation Problem
    Farahanipad, Farnaz
    Rezaei, Mohammad
    Nasr, Mohammad Sadegh
    Kamangar, Farhad
    Athitsos, Vassilis
    TECHNOLOGIES, 2022, 10 (02)
  • [46] A review of medical image data augmentation techniques for deep learning applications
    Chlap, Phillip
    Min, Hang
    Vandenberg, Nym
    Dowling, Jason
    Holloway, Lois
    Haworth, Annette
    JOURNAL OF MEDICAL IMAGING AND RADIATION ONCOLOGY, 2021, 65 (05) : 545 - 563
  • [47] Data augmentation using fast converging CIELAB-GAN for efficient deep learning dataset generation
    Fadaeddini, Amin
    Majidi, Babak
    Souri, Alireza
    Eshghi, Mohammad
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2023, 26 (04) : 459 - 469
  • [48] GAN-Based Data Augmentation for Prediction Improvement Using Gene Expression Data in Cancer
    Moreno-Barea, Francisco J.
    Jerez, Jose M.
    Franco, Leonardo
    COMPUTATIONAL SCIENCE - ICCS 2022, PT III, 2022, 13352 : 28 - 42
  • [49] A transfer learning-based GAN for data augmentation in automatic modulation recognition
    Gao, Hai
    Ke, Jing
    Lu, Xiaochun
    Cheng, Fang
    Chen, Xiaofei
    ENGINEERING RESEARCH EXPRESS, 2024, 6 (04):
  • [50] GAN-Based Temporal Association Rule Mining on Multivariate Time Series Data
    He, Guoliang
    Dai, Lifang
    Yu, Zhiwen
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5168 - 5180