Use of Sentiment Analysis for Capturing Patient Experience From Free-Text Comments Posted Online

被引:192
作者
Greaves, Felix [1 ,2 ]
Ramirez-Cano, Daniel [2 ]
Millett, Christopher [1 ]
Darzi, Ara [2 ]
Donaldson, Liam [2 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Primary Care & Publ Hlth, London W6 8RF, England
[2] Univ London Imperial Coll Sci Technol & Med, Ctr Hlth Policy, London W6 8RF, England
关键词
Internet; patient experience; quality; machine learning; SOCIAL MEDIA; HEALTH-CARE; QUALITY; INTERNET; RATINGS; ASSOCIATIONS;
D O I
10.2196/jmir.2721
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: There are large amounts of unstructured, free-text information about quality of health care available on the Internet in blogs, social networks, and on physician rating websites that are not captured in a systematic way. New analytical techniques, such as sentiment analysis, may allow us to understand and use this information more effectively to improve the quality of health care. Objective: We attempted to use machine learning to understand patients' unstructured comments about their care. We used sentiment analysis techniques to categorize online free-text comments by patients as either positive or negative descriptions of their health care. We tried to automatically predict whether a patient would recommend a hospital, whether the hospital was clean, and whether they were treated with dignity from their free-text description, compared to the patient's own quantitative rating of their care. Methods: We applied machine learning techniques to all 6412 online comments about hospitals on the English National Health Service website in 2010 using Weka data-mining software. We also compared the results obtained from sentiment analysis with the paper-based national inpatient survey results at the hospital level using Spearman rank correlation for all 161 acute adult hospital trusts in England. Results: There was 81%, 84%, and 89% agreement between quantitative ratings of care and those derived from free-text comments using sentiment analysis for cleanliness, being treated with dignity, and overall recommendation of hospital respectively (kappa scores:.40-.74, P<.001 for all). We observed mild to moderate associations between our machine learning predictions and responses to the large patient survey for the three categories examined (Spearman rho 0.37-0.51, P<.001 for all). Conclusions: The prediction accuracy that we have achieved using this machine learning process suggests that we are able to predict, from free-text, a reasonably accurate assessment of patients' opinion about different performance aspects of a hospital and that these machine learning predictions are associated with results of more conventional surveys.
引用
收藏
页数:9
相关论文
共 35 条
[1]  
Acute Trusts: Adult Inpatients Survey, 2010, AC TRUSTS AD INP SUR
[2]  
Alemi Farrokh, 2012, Qual Manag Health Care, V21, P9, DOI 10.1097/QMH.0b013e3182417fc4
[3]  
[Anonymous], 2001, CROSS QUAL CHASM NEW
[4]  
[Anonymous], 2010, P INT C LANG RES EV
[5]  
[Anonymous], 2012, Synth. Lectures Human Lang. Technol., DOI [10.2200/S00416ED1V01Y201204HLT016, DOI 10.2200/S00416ED1V01Y201204HLT016]
[6]  
[Anonymous], 2012, DEM INT US
[7]  
[Anonymous], 1997, ICML
[8]  
[Anonymous], 2013, NHS CHOIC DAT HOSP C
[9]  
[Anonymous], 2011, SOCIAL LIFE HLTH INF
[10]  
[Anonymous], 2012, POW INF PUTT ALL US