A Method for Predicting Wikipedia Editors' Editing Interest Based on a Factor Graph Model

被引:3
作者
Zhang, Haisu [1 ]
Zhang, Sheng [1 ]
Wu, Zhaolin [1 ]
Huang, Liwei [2 ]
Ma, Yutao [3 ,4 ]
机构
[1] Acad Natl Def Informat, Wuhan, Peoples R China
[2] Beijing Inst Remote Sensing, Beijing, Peoples R China
[3] Wuhan Univ, Sch Comp, Wuhan, Peoples R China
[4] WISET Automat Co Ltd, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
Factor Graph; Interest Prediction; Probabilistic Graphical Model; Social Network Mining; Wikipedia;
D O I
10.4018/IJWSR.2016070101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recruiting or recommending appropriate potential Wikipedia editors to edit a specific Wikipedia entry (or article) can play an important role in improving the quality and credibility of Wikipedia. According to empirical observations based on a small-scale dataset collected from Wikipedia, this paper proposes an Interest Prediction Factor Graph (IPFG) model, which is characterized by editor's social properties, hyperlinks between Wikipedia entries, the categories of an entry and other important features, to predict an editor's editing interest in types of Wikipedia entries. Furthermore, the paper suggests a parameter learning algorithm based on the gradient descent algorithm and the Loopy Sum-Product algorithm for factor graphs. An experiment on a Wikipedia dataset (with different frequencies of data collection) shows that the average prediction accuracy (F1 score) of the IPFG model for data collected quarterly could be up to 0.875, which is approximately 0.49 higher than that of a collaborative filtering approach. In addition, the paper analyzes how incomplete social properties and editing bursts affect the prediction accuracy of the IPFG model. The authors' results can provide insight into effective Wikipedia article tossing and can improve the quality of special entries that belong to specific categories by means of collective collaboration.
引用
收藏
页码:1 / 25
页数:25
相关论文
共 21 条
  • [1] [Anonymous], 2007, Proceedings of the International Symposium on Wikis, DOI [10.1145/1296951.1296968, DOI 10.1145/1296951.1296968]
  • [2] [Anonymous], 2012, WSDM
  • [3] The origin of bursts and heavy tails in human dynamics
    Barabási, AL
    [J]. NATURE, 2005, 435 (7039) : 207 - 211
  • [4] Internet encyclopaedias go head to head
    Giles, J
    [J]. NATURE, 2005, 438 (7070) : 900 - 901
  • [5] Haisu Zhang, 2011, 2011 IEEE International Conference on Granular Computing, P790, DOI 10.1109/GRC.2011.6122699
  • [6] Kittur A, 2008, CSCW: 2008 ACM CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK, CONFERENCE PROCEEDINGS, P37
  • [7] Koller D, 2009, Probabilistic graphical models: principles and techniques
  • [8] Factor graphs and the sum-product algorithm
    Kschischang, FR
    Frey, BJ
    Loeliger, HA
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2001, 47 (02) : 498 - 519
  • [9] Lazarsfeld P.F., 1954, Freedom and Control in Modern Society, V18, P18
  • [10] Birds of a feather: Homophily in social networks
    McPherson, M
    Smith-Lovin, L
    Cook, JM
    [J]. ANNUAL REVIEW OF SOCIOLOGY, 2001, 27 : 415 - 444