Detecting Collusive Spamming Activities in Community Question Answering

被引:11
|
作者
Liu, Yuli [1 ]
Liu, Yiqun [1 ]
Zhou, Ke [2 ]
Zhang, Min [1 ]
Ma, Shaoping [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
[2] Univ Nottingham, Sch Comp Sci, Nottingham, England
来源
PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'17) | 2017年
关键词
Community Question Answering; Crowdsourcing Manipulation; Spam Detection; Factor Graph;
D O I
10.1145/3038912.3052594
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Community Question Answering (CQA) portals provide rich sources of information on a variety of topics. However, the authenticity and quality of questions and answers (Q&As) has proven hard to control. In a troubling direction, the widespread growth of crowdsourcing websites has created a large-scale, potentially difficult-to-detect workforce to manipulate malicious contents in CQA. The crowd workers who join the same crowdsourcing task about promotion campaigns in CQA collusively manipulate deceptive Q&As for promoting a target (product or service). The collusive spamming group can fully control the sentiment of the target. How to utilize the structure and the attributes for detecting manipulated Q&As? How to detect the collusive group and leverage the group information for the detection task? To shed light on these research questions, we propose a unified framework to tackle the challenge of detecting collusive spamming activities of CQA. First, we interpret the questions and answers in CQA as two independent networks. Second, we detect collusive question groups and answer groups from these two networks respectively by measuring the similarity of the contents posted within a short duration. Third, using attributes (individual-level and group-level) and correlations (user-based and content-based), we proposed a combined factor graph model to detect deceptive Q&As simultaneously by combining two independent factor graphs. With a large-scale practical data set, we find that the proposed framework can detect deceptive contents at early stage, and outperforms a number of competitive baselines.
引用
收藏
页码:1073 / 1082
页数:10
相关论文
共 50 条
  • [41] Method of Predicting User Professionalism Based on Question Granularity in Community Question Answering
    Zhu M.
    Tian W.
    Peng D.
    Su Y.
    Niu H.
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 2019, 51 (01): : 173 - 180
  • [42] HCA: Hierarchical Compare Aggregate model for question retrieval in community question answering
    Zahedi, Mohammad Sadegh
    Rahgozar, Maseud
    Zoroofi, Reza Aghaeizadeh
    INFORMATION PROCESSING & MANAGEMENT, 2020, 57 (06)
  • [43] Where to Post: Routing Questions to Right Community in Community Question Answering Systems
    Kumar, Nagendra
    Srikanth, G.
    Mudda, Karthik Yadav
    Trishal, Gayam
    Konjengbam, Anand
    Singh, Manish
    PROCEEDINGS OF THE 6TH ACM IKDD CODS AND 24TH COMAD, 2019, : 136 - 142
  • [44] Convolutional neural networks for expert recommendation in community question answering
    Jian WANG
    Jiqing SUN
    Hongfei LIN
    Hualei DONG
    Shaowu ZHANG
    ScienceChina(InformationSciences), 2017, 60 (11) : 19 - 27
  • [45] Convolutional neural networks for expert recommendation in community question answering
    Wang, Jian
    Sun, Jiqing
    Lin, Hongfei
    Dong, Hualei
    Zhang, Shaowu
    SCIENCE CHINA-INFORMATION SCIENCES, 2017, 60 (11)
  • [46] Expert recommendation in community question answering: a review and future direction
    Yang Z.
    Liu Q.
    Sun B.
    Zhao X.
    International Journal of Crowd Science, 2019, 3 (03) : 348 - 372
  • [47] Exploring Question Selection Bias to Identify Experts and Potential Experts in Community Question Answering
    Pal, Aditya
    Harper, F. Maxwell
    Konstan, Joseph A.
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2012, 30 (02)
  • [48] Question Quality Analysis and Prediction in Community Question Answering Services with Coupled Mutual Reinforcement
    Liu, Jinwei
    Shen, Haiying
    Yu, Lei
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2017, 10 (02) : 286 - 301
  • [49] CQARank: Jointly Model Topics and Expertise in Community Question Answering
    Yang, Liu
    Qiu, Minghui
    Gottipati, Swapna
    Zhu, Feida
    Jiang, Jing
    Sun, Huiping
    Chen, Zhong
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 99 - 108
  • [50] Modeling Information-Seeker Satisfaction in Community Question Answering
    Agichtein, Eugene
    Liu, Yandong
    Bian, Jiang
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (02)