Semi-SGD: Semi-supervised Learning based Spammer Group Detection in Product Reviews

被引:5
|
作者
Zhang, Lu [1 ]
Yuan, Yang [2 ]
Wu, Zhiang [1 ]
Cao, Jie [1 ]
机构
[1] Nanjing Univ Finance & Econ, Jiangsu Prov Key Lab E Business, Nanjing, Jiangsu, Peoples R China
[2] Nuctech JiangSu Co Ltd, Changzhou, Jiangsu, Peoples R China
来源
2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD) | 2017年
基金
中国国家自然科学基金;
关键词
Spammer Group Detection; Semi-supervised Learning; Naive Bayes Classifier; EM Algorithm; Amazon.cn;
D O I
10.1109/CBD.2017.70
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The purchase decision of customers in e-commerce platforms is strongly influenced by product ratings and reviews. Driven by the profits, review spammers post fake reviews to promote their products or demote their competitors' products. Differ from individual spammers, the spammer groups manipulate reviews together and can be more damaging. Existing work for spammer group detection extract candidate groups from review data and identify the spammer groups using unsupervised spamicity ranking methods. However, the labeled and unlabeled data are existing simultaneously in practice and no method makes good use of both these data in spammer group detection. In this paper, we propose a semi-supervised learning based spammer group detection method (Semi-SGD), which trains a Naive Bayes classifier on a small set of labeled data as an initial classifier, and then incorporates unlabeled data with Expectation Maximization (EM) algorithm to improve the initial classifier iteratively. Experiments on Amazon.cn datasets show that our proposed Semi-SGD is efficient and effective.
引用
收藏
页码:368 / 373
页数:6
相关论文
共 50 条
  • [41] Behavior modeling and abnormality detection based on semi-supervised learning method
    National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing 100080, China
    Ruan Jian Xue Bao, 2007, 3 (527-537): : 527 - 537
  • [42] Semi-Supervised Learning-Based Method for Unknown Anomaly Detection
    Cheng, Yudong
    Zhou, Fang
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (07): : 1670 - 1680
  • [43] Semi-Supervised Novelty Detection
    Blanchard, Gilles
    Lee, Gyemin
    Scott, Clayton
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 2973 - 3009
  • [44] Consistency-based semi-supervised learning for oriented object detection
    Fu, Ronghao
    Chen, Chengcheng
    Yan, Shuang
    Wang, Xianchang
    Chen, Huiling
    KNOWLEDGE-BASED SYSTEMS, 2024, 304
  • [45] Semi-supervised learning methods for weed detection in turf
    Liu, Teng
    Zhai, Danlan
    He, Feiyu
    Yu, Jialin
    PEST MANAGEMENT SCIENCE, 2024, 80 (06) : 2552 - 2562
  • [46] LEARNING DISCRIMINATIVE FEATURES FOR SEMI-SUPERVISED ANOMALY DETECTION
    Feng, Zhe
    Tang, Jie
    Dou, Yishun
    Wu, Gangshan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2935 - 2939
  • [47] Semi-supervised vanishing point detection with contrastive learning
    Wang, Yukun
    Gu, Shuo
    Liu, Yinbo
    Kong, Hui
    PATTERN RECOGNITION, 2024, 153
  • [48] Semi-supervised Object Detection via VC Learning
    Chen, Changrui
    Debattista, Kurt
    Han, Jungong
    COMPUTER VISION, ECCV 2022, PT XXXI, 2022, 13691 : 169 - 185
  • [49] Semi-supervised learning for text-line detection
    Liu, Zongyi
    Zhou, Hanning
    Yang, Ning
    PATTERN RECOGNITION LETTERS, 2010, 31 (11) : 1260 - 1273
  • [50] Flow-based anomaly detection using semi-supervised learning
    Jadidi, Zahra
    Muthukkumarasamy, Vallipuram
    Sithirasenan, Elankayer
    Singh, Kalvinder
    2015 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2015,