Quantifying the trustworthiness of social media content

被引:0
作者
Sai T. Moturu
Huan Liu
机构
[1] Arizona State University,School of Computing, Informatics and Decision Systems Engineering
[2] Massachusetts Institute of Technology,Media Lab
来源
Distributed and Parallel Databases | 2011年 / 29卷
关键词
Trust evaluation; Trustworthiness; Social media; Content; Quality;
D O I
暂无
中图分类号
学科分类号
摘要
The growing popularity of social media in recent years has resulted in the creation of an enormous amount of user-generated content. A significant portion of this information is useful and has proven to be a great source of knowledge. However, since much of this information has been contributed by strangers with little or no apparent reputation to speak of, there is no easy way to detect whether the content is trustworthy. Search engines are the gateways to knowledge but search relevance cannot guarantee that the content in the search results is trustworthy. A casual observer might not be able to differentiate between trustworthy and untrustworthy content. This work is focused on the problem of quantifying the value of such shared content with respect to its trustworthiness. In particular, the focus is on shared health content as the negative impact of acting on untrustworthy content is high in this domain. Health content from two social media applications, Wikipedia and Daily Strength, is used for this study. Sociological notions of trust are used to motivate the search for a solution. A two-step unsupervised, feature-driven approach is proposed for this purpose: a feature identification step in which relevant information categories are specified and suitable features are identified, and a quantification step for which various unsupervised scoring models are proposed. Results indicate that this approach is effective and can be adapted to disparate social media applications with ease.
引用
收藏
页码:239 / 260
页数:21
相关论文
共 14 条
[1]  
Childs S.(2005)Judging the quality of Internet-based health information Perform. Meas. Metr. 6 80-96
[2]  
Dondio P.(2007)Computational trust in web content quality: a comparative evaluation on the Wikipedia project Informatica 31 151-160
[3]  
Barrett S.(2002)Cumulated gain-based evaluation of IR techniques ACM Trans. Inf. Sys. 20 422-446
[4]  
Järvelin K.(2005)Health on the Internet: implications for health promotion Health Educ. Res. 21 78-86
[5]  
Kekäläinen J.(2002)Discretization: an enabling technique Data Min. Knowl. Discov. 6 393-423
[6]  
Korp P.(2008)Computing information retrieval performance measures efficiently in the presence of tied scores Lect. Notes Comput. Sci. 4956 414-421
[7]  
Liu H.(2006)Perception of hazards: the role of social trust and knowledge Risk Anal. 20 713-720
[8]  
Hussain F.(undefined)undefined undefined undefined undefined-undefined
[9]  
Tan C.(undefined)undefined undefined undefined undefined-undefined
[10]  
Dash M.(undefined)undefined undefined undefined undefined-undefined