Semi-SGD: Semi-supervised Learning based Spammer Group Detection in Product Reviews

被引:5
|
作者
Zhang, Lu [1 ]
Yuan, Yang [2 ]
Wu, Zhiang [1 ]
Cao, Jie [1 ]
机构
[1] Nanjing Univ Finance & Econ, Jiangsu Prov Key Lab E Business, Nanjing, Jiangsu, Peoples R China
[2] Nuctech JiangSu Co Ltd, Changzhou, Jiangsu, Peoples R China
来源
2017 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD) | 2017年
基金
中国国家自然科学基金;
关键词
Spammer Group Detection; Semi-supervised Learning; Naive Bayes Classifier; EM Algorithm; Amazon.cn;
D O I
10.1109/CBD.2017.70
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The purchase decision of customers in e-commerce platforms is strongly influenced by product ratings and reviews. Driven by the profits, review spammers post fake reviews to promote their products or demote their competitors' products. Differ from individual spammers, the spammer groups manipulate reviews together and can be more damaging. Existing work for spammer group detection extract candidate groups from review data and identify the spammer groups using unsupervised spamicity ranking methods. However, the labeled and unlabeled data are existing simultaneously in practice and no method makes good use of both these data in spammer group detection. In this paper, we propose a semi-supervised learning based spammer group detection method (Semi-SGD), which trains a Naive Bayes classifier on a small set of labeled data as an initial classifier, and then incorporates unlabeled data with Expectation Maximization (EM) algorithm to improve the initial classifier iteratively. Experiments on Amazon.cn datasets show that our proposed Semi-SGD is efficient and effective.
引用
收藏
页码:368 / 373
页数:6
相关论文
共 50 条
  • [31] Introduction to semi-supervised learning
    Goldberg, Xiaojin
    Synthesis Lectures on Artificial Intelligence and Machine Learning, 2009, 6 : 1 - 116
  • [32] Graph-based semi-supervised learning
    Changshui Zhang
    Fei Wang
    Artificial Life and Robotics, 2009, 14 (4) : 445 - 448
  • [33] Graph-based semi-supervised learning
    Subramanya, Amarnag
    Talukdar, Partha Pratim
    Synthesis Lectures on Artificial Intelligence and Machine Learning, 2014, 29 : 1 - 126
  • [34] Human Semi-Supervised Learning
    Gibson, Bryan R.
    Rogers, Timothy T.
    Zhu, Xiaojin
    TOPICS IN COGNITIVE SCIENCE, 2013, 5 (01) : 132 - 172
  • [35] Ensemble-Based Semi-Supervised Learning for Milling Chatter Detection
    Liu, Weichao
    Wang, Pengyu
    You, Youpeng
    MACHINES, 2022, 10 (11)
  • [36] Reliable Semi-supervised Learning
    Shao, Junming
    Huang, Chen
    Yang, Qinli
    Luo, Guangchun
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1197 - 1202
  • [37] Extreme Learning Machine based Novelty Detection for Incremental Semi-Supervised Learning
    Al-Behadili, Husam
    Grumpe, Arne
    Dopp, Christian
    Woehler, Christian
    2015 THIRD INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP), 2015, : 230 - 235
  • [38] Remote Sensing Aircraft Image Detection Based on Semi-Supervised Learning
    Du Zexing
    Yin Jinyong
    Yang Jian
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (06)
  • [39] Fuzziness based semi-supervised learning approach for intrusion detection system
    Ashfaq, Rana Aamir Raza
    Wang, Xi-Zhao
    Huang, Joshua Zhexue
    Abbas, Haider
    He, Yu-Lin
    INFORMATION SCIENCES, 2017, 378 : 484 - 497
  • [40] FLOW BASED BOTNET DETECTION THROUGH SEMI-SUPERVISED ACTIVE LEARNING
    Qiu, Zhicong
    Miller, David J.
    Kesidis, George
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2387 - 2391