On Quality Assesement in Wikipedia Articles Based on Markov Random Fields

被引:2
作者
Kleminski, Rajmund [1 ]
Kajdanowicz, Tomasz [1 ]
Bartusiak, Roman [1 ]
Kazienko, Przemyslaw [1 ]
机构
[1] Wroclaw Univ Sci & Technol, Fac Comp Sci & Management, Wroclaw, Poland
来源
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I | 2017年 / 10191卷
关键词
Wikipedia; Quality prediction; Iterative classification; CLASSIFICATION; NETWORK;
D O I
10.1007/978-3-319-54472-4_73
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article investigates the possibility of accurate quality prediction of resources generated by communities based on the crowd-generated content. We use data from Wikipedia, the prime example of community-run site, as our object of study. We define the quality as a distribution of user-assigned grades across a predefined range of possible scores and present a measure of distribution similarity to quantify the accuracy of a prediction. The proposed method of quality prediction is based on Markov Random Field and its Loopy Belief Propagation implementation. Based on our results, we highlight key problems in the approach as presented, as well as trade-offs caused by relying solely on network structure and characteristics, excluding metadata. The overall results of content quality prediction are promising in homophilic networks.
引用
收藏
页码:782 / 791
页数:10
相关论文
共 13 条
  • [1] [Anonymous], 2002, P 18 C UNCERTAINTY A
  • [2] Dalip DH, 2009, ACM-IEEE J CONF DIG, P295
  • [3] DeLaCalzada G., 2010, P 4 WORKSH INF CRED, P11, DOI [10.1145/1772938.1772943, DOI 10.1145/1772938.1772943]
  • [4] Hu M., 2007, P 16 ACM C C INF KNO, P243, DOI [10.1145/1321440.1321476, DOI 10.1145/1321440.1321476]
  • [5] Label-dependent node classification in the network
    Kazienko, Przemyslaw
    Kajdanowicz, Tomasz
    [J]. NEUROCOMPUTING, 2012, 75 (01) : 199 - 209
  • [6] Leskovec Jure, 2014, 1 INT WORKSH GRAPH D
  • [7] Liu J., 2011, ACM Transactions on Management Information Systems, V2, P1, DOI [10.1145/1985347.1985352, DOI 10.1145/1985347.1985352]
  • [8] Malewicz G., 2010, P 2010 ACM SIGMOD IN, P135, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
  • [9] McPherson M., ANN REV SOCIOL, V27, P415
  • [10] Collective Classification in Network Data
    Sen, Prithviraj
    Namata, Galileo
    Bilgic, Mustafa
    Getoor, Lise
    Gallagher, Brian
    Eliassi-Rad, Tina
    [J]. AI MAGAZINE, 2008, 29 (03) : 93 - 106