Detecting Collusive Spamming Activities in Community Question Answering

被引:11
|
作者
Liu, Yuli [1 ]
Liu, Yiqun [1 ]
Zhou, Ke [2 ]
Zhang, Min [1 ]
Ma, Shaoping [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Univ Nottingham, Sch Comp Sci, Nottingham, England
来源
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17) | 2017年
关键词
Community Question Answering; Crowdsourcing Manipulation; Spam Detection; Factor Graph;
D O I
10.1145/3038912.3052594
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Community Question Answering (CQA) portals provide rich sources of information on a variety of topics. However, the authenticity and quality of questions and answers (Q&As) has proven hard to control. In a troubling direction, the widespread growth of crowdsourcing websites has created a large-scale, potentially difficult-to-detect workforce to manipulate malicious contents in CQA. The crowd workers who join the same crowdsourcing task about promotion campaigns in CQA collusively manipulate deceptive Q&As for promoting a target (product or service). The collusive spamming group can fully control the sentiment of the target. How to utilize the structure and the attributes for detecting manipulated Q&As? How to detect the collusive group and leverage the group information for the detection task? To shed light on these research questions, we propose a unified framework to tackle the challenge of detecting collusive spamming activities of CQA. First, we interpret the questions and answers in CQA as two independent networks. Second, we detect collusive question groups and answer groups from these two networks respectively by measuring the similarity of the contents posted within a short duration. Third, using attributes (individual-level and group-level) and correlations (user-based and content-based), we proposed a combined factor graph model to detect deceptive Q&As simultaneously by combining two independent factor graphs. With a large-scale practical data set, we find that the proposed framework can detect deceptive contents at early stage, and outperforms a number of competitive baselines.
引用
收藏
页码:1073 / 1082
页数:10
相关论文
共 50 条
  • [31] Gender Identification From Community Question Answering Avatars
    Peralta, Billy
    Figueroa, Alejandro
    Nicolis, Orietta
    Trewhela, Alvaro
    IEEE ACCESS, 2021, 9 : 156701 - 156716
  • [32] Interactive Topic Tagging in Community Question Answering Platforms
    Rad, Radin Hamidi
    Cucerzan, Silviu
    Chandrasekaran, Nirupama
    Gamon, Michael
    ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT III, 2024, 14610 : 195 - 209
  • [33] Named entity disambiguation for questions in community question answering
    Wang, Fang
    Wu, Wei
    Li, Zhoujun
    Zhou, Ming
    KNOWLEDGE-BASED SYSTEMS, 2017, 126 : 68 - 77
  • [34] A Hybrid Model for Experts Finding in Community Question Answering
    Li, Hai
    Jin, Songchang
    Li, Shudong
    2015 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY, 2015, : 176 - 184
  • [35] Early Detection of Topical Expertise in Community Question Answering
    van Dijk, David
    Tsagkias, Manos
    de Rijke, Maarten
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 995 - 998
  • [36] A Comprehensive Survey and Classification of Approaches for Community Question Answering
    Srba, Ivan
    Bielikova, Maria
    ACM TRANSACTIONS ON THE WEB, 2016, 10 (03)
  • [37] Vote Calibration in Community Question-Answering Systems
    Chen, Bee-Chung
    Dasgupta, Anirban
    Wang, Xuanhui
    Yang, Jie
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 781 - 790
  • [38] User authority ranking models for community question answering
    Raoa, Yanghui
    Xie, Haoran
    Liu, Xuebo
    Li, Qing
    Wang, Fu Lee
    Wong, Tak-Lam
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (05) : 2533 - 2542
  • [39] A Weighted Question Retrieval Model using Descriptive Information in Community Question Answering
    Hong, Beomseok
    Kim, Yanggon
    2016 RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS, 2016, : 35 - 39
  • [40] A Semantic Graph based Topic Model for Question Retrieval in Community Question Answering
    Chen, Long
    Jose, Joemon M.
    Yu, Haitao
    Yuan, Fajie
    Zhang, Dell
    PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 287 - 296