Detecting Collusive Spamming Activities in Community Question Answering

被引:11
|
作者
Liu, Yuli [1 ]
Liu, Yiqun [1 ]
Zhou, Ke [2 ]
Zhang, Min [1 ]
Ma, Shaoping [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Univ Nottingham, Sch Comp Sci, Nottingham, England
来源
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17) | 2017年
关键词
Community Question Answering; Crowdsourcing Manipulation; Spam Detection; Factor Graph;
D O I
10.1145/3038912.3052594
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Community Question Answering (CQA) portals provide rich sources of information on a variety of topics. However, the authenticity and quality of questions and answers (Q&As) has proven hard to control. In a troubling direction, the widespread growth of crowdsourcing websites has created a large-scale, potentially difficult-to-detect workforce to manipulate malicious contents in CQA. The crowd workers who join the same crowdsourcing task about promotion campaigns in CQA collusively manipulate deceptive Q&As for promoting a target (product or service). The collusive spamming group can fully control the sentiment of the target. How to utilize the structure and the attributes for detecting manipulated Q&As? How to detect the collusive group and leverage the group information for the detection task? To shed light on these research questions, we propose a unified framework to tackle the challenge of detecting collusive spamming activities of CQA. First, we interpret the questions and answers in CQA as two independent networks. Second, we detect collusive question groups and answer groups from these two networks respectively by measuring the similarity of the contents posted within a short duration. Third, using attributes (individual-level and group-level) and correlations (user-based and content-based), we proposed a combined factor graph model to detect deceptive Q&As simultaneously by combining two independent factor graphs. With a large-scale practical data set, we find that the proposed framework can detect deceptive contents at early stage, and outperforms a number of competitive baselines.
引用
收藏
页码:1073 / 1082
页数:10
相关论文
共 50 条
  • [21] Learning Distributed Representations of Data in Community Question Answering for Question Retrieval
    Zhang, Kai
    Wu, Wei
    Wang, Fang
    Zhou, Ming
    Li, Zhoujun
    PROCEEDINGS OF THE NINTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'16), 2016, : 533 - 542
  • [22] The Research of Multi-label Question Classification in Community Question Answering
    Shu, Peng
    Su, Lei
    Yuan, Liwei
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 5504 - 5507
  • [23] Finding Active Experts for Question Routing in Community Question Answering Services
    Kundu, Dipankar
    Pal, Rajat Kumar
    Mandal, Deba Prasad
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2019, PT II, 2019, 11942 : 320 - 327
  • [24] Distilling Essence of a Question: A Hierarchical Architecture for Question Quality in Community Question Answering Sites
    Ho, Mun Kit
    Tatinati, Sivanagaraja
    Khong, Andy W. H.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [25] Answer Generating Methods for Community Question and Answering Portals
    Tao, Haoxiong
    Hao, Yu
    Zhu, Xiaoyan
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 249 - 259
  • [26] Multimodal representative answer extraction in community question answering
    Li, Ming
    Ma, Yating
    Li, Ying
    Bai, Yixue
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)
  • [27] Early Detection of Promotion Campaigns in Community Question Answering
    Li, Xin
    Liu, Yiqun
    Zhang, Min
    Ma, Shaoping
    SOCIAL MEDIA PROCESSING, SMP 2016, 2016, 669 : 172 - 185
  • [28] User Embedding for Expert Finding in Community Question Answering
    Ghasemi, Negin
    Fatourechi, Ramin
    Momtazi, Saeedeh
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2021, 15 (04)
  • [29] The Social World of Content Abusers in Community Question Answering
    Kayes, Imrul
    Kourtellis, Nicolas
    Quercia, Daniele
    Iamnitchi, Adriana
    Bonchi, Francesco
    PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW 2015), 2015, : 570 - 580
  • [30] CQAVis: Visual Text Analytics for Community Question Answering
    Hoque, Enamul
    Joty, Shafiq
    Marquez, Lluis
    Carenini, Giuseppe
    IUI'17: PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2017, : 161 - 172