How valuable is medical social media data? Content analysis of the medical web

被引:109
作者
Denecke, Kerstin [1 ]
Nejdl, Wolfgang [1 ]
机构
[1] Leibniz Univ Hannover, Res Ctr L3S, D-30167 Hannover, Germany
关键词
Search engines - Health - Portals - Data mining - Social networking (online) - Classification (of information);
D O I
10.1016/j.ins.2009.01.025
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
it is still an open question where to search for complying a specific information need due to the large amount and diversity of information available. In this paper, a content analysis of health-related information provided in the Web is performed to get an overview on the medical content available. In particular, the content of medical Question & Answer Portals, medical weblogs, medical reviews and Wikis is compared. For this purpose, medical concepts are extracted from the text material with existing extraction technology. Based on these concepts, the content of the different knowledge resources is compared. Since medical weblogs describe experiences as well as information, it is of large interest to be able to distinguish between informative and affective posts. For this reason, a method to classify blogs based on their information content is presented, which exploits high-level features describing the medical and affective content of blog posts. The results show that there are substantial differences in the content of various health-related Web resources. Weblogs and answer portals mainly deal with diseases and medications. The Wiki and the encyclopedia provide more information on anatomy and procedures. While patients and nurses describe personal aspects of their life, doctors aim to present health-related information in their blog posts. The knowledge on content differences and information content can be exploited by search engines to improve ranking. search and to direct users to appropriate knowledge sources. (c) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:1870 / 1880
页数:11
相关论文
共 28 条
[1]  
[Anonymous], 2004, P 13 INT C WORLD WID
[2]  
[Anonymous], Data Mining Practical Machine Learning Tools and Techniques with Java
[3]  
[Anonymous], 2007, ACL '07: Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics
[4]  
[Anonymous], 2008, P 2008 INT C WEB SEA, DOI [DOI 10.1145/1341531.1341557, 10.1145/1341531.1341557]
[5]  
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[6]  
BATH PA, 2001, J INF SCI, V34, P501
[7]  
Chaumartin F., 2007, P 4 INT WORKSH SEM E, P422, DOI DOI 10.3115/1621474.1621568
[8]   Semantic Structuring of and Information Extraction from Medical Documents Using the UMLS [J].
Denecke, K. .
METHODS OF INFORMATION IN MEDICINE, 2008, 47 (05) :425-434
[9]  
ESULI A., 2006, P 5 INT C LANGUAGE R
[10]  
Eysenbach G, 2003, AMIA Annu Symp Proc, P225