Developing a natural language processing application for measuring the quality of colonoscopy procedures

被引:69
作者
Harkema, Henk [1 ]
Chapman, Wendy W. [2 ]
Saul, Melissa [1 ]
Dellon, Evan S. [3 ]
Schoen, Robert E. [4 ]
Mehrotra, Ateev [5 ]
机构
[1] Univ Pittsburgh, Dept Biomed Informat, Pittsburgh, PA 15260 USA
[2] Univ Calif San Diego, Div Biomed Informat, La Jolla, CA 92093 USA
[3] Univ N Carolina, Div Gastroenterol & Hepatol, Chapel Hill, NC USA
[4] Univ Pittsburgh, Div Gastroenterol Hepatol & Nutr, Pittsburgh, PA 15260 USA
[5] Univ Pittsburgh, Div Gen Internal Med, Pittsburgh, PA 15260 USA
基金
美国国家卫生研究院;
关键词
ELECTRONIC HEALTH RECORDS; SOCIETY-TASK-FORCE; COLORECTAL-CANCER; OF-CARE; PERFORMANCE-MEASUREMENT; MEDICAL-RECORDS; CLINICAL-DATA; SYSTEM; INFORMATION; EXTRACTION;
D O I
10.1136/amiajnl-2011-000431
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective The quality of colonoscopy procedures for colorectal cancer screening is often inadequate and varies widely among physicians. Routine measurement of quality is limited by the costs of manual review of free-text patient charts. Our goal was to develop a natural language processing (NLP) application to measure colonoscopy quality. Materials and methods Using a set of quality measures published by physician specialty societies, we implemented an NLP engine that extracts 21 variables for 19 quality measures from free-text colonoscopy and pathology reports. We evaluated the performance of the NLP engine on a test set of 453 colonoscopy reports and 226 pathology reports, considering accuracy in extracting the values of the target variables from text, and the reliability of the outcomes of the quality measures as computed from the NLP-extracted information. Results The average accuracy of the NLP engine over all variables was 0.89 (range: 0.62-1.0) and the average F measure over all variables was 0.74 (range: 0.49-0.89). The average agreement score, measured as Cohen's kappa, between the manually established and NLP-derived outcomes of the quality measures was 0.62 (range: 0.09-0.86). Discussion For nine of the 19 colonoscopy quality measures, the agreement score was 0.70 or above, which we consider a sufficient score for the NLP-derived outcomes of these measures to be practically useful for quality measurement. Conclusion The use of NLP for information extraction from free-text colonoscopy and pathology reports creates opportunities for large scale, routine quality measurement, which can support quality improvement in colonoscopy care.
引用
收藏
页码:I150 / I156
页数:7
相关论文
共 73 条
[1]   Natural language processing for the development of a clinical registry: a validation study in intraductal papillary mucinous neoplasms [J].
Al-Haddad, Mohammad A. ;
Friedlin, Jeff ;
Kesterson, Joe ;
Waters, Joshua A. ;
Aguilar-Saavedra, Juan R. ;
Schmidt, C. Max .
HPB, 2010, 12 (10) :688-695
[2]  
[Anonymous], 1995, Natural language understanding
[3]  
[Anonymous], 2009, COLONOSCOPY PRINCIPL
[4]  
[Anonymous], 9 ANN C INT S DIS SU
[5]  
[Anonymous], P 40 ANN M ASS COMP
[6]  
[Anonymous], PAT SAF ACH NEW STAN
[7]  
[Anonymous], METAMAP TRANSF MMTX
[8]  
[Anonymous], CROSS QUAL CHASM NEW
[9]  
[Anonymous], AM J MANAG CARE 1
[10]  
[Anonymous], APPLYING NA IN PRESS