Developing a natural language processing application for measuring the quality of colonoscopy procedures

被引:69
作者
Harkema, Henk [1 ]
Chapman, Wendy W. [2 ]
Saul, Melissa [1 ]
Dellon, Evan S. [3 ]
Schoen, Robert E. [4 ]
Mehrotra, Ateev [5 ]
机构
[1] Univ Pittsburgh, Dept Biomed Informat, Pittsburgh, PA 15260 USA
[2] Univ Calif San Diego, Div Biomed Informat, La Jolla, CA 92093 USA
[3] Univ N Carolina, Div Gastroenterol & Hepatol, Chapel Hill, NC USA
[4] Univ Pittsburgh, Div Gastroenterol Hepatol & Nutr, Pittsburgh, PA 15260 USA
[5] Univ Pittsburgh, Div Gen Internal Med, Pittsburgh, PA 15260 USA
基金
美国国家卫生研究院;
关键词
ELECTRONIC HEALTH RECORDS; SOCIETY-TASK-FORCE; COLORECTAL-CANCER; OF-CARE; PERFORMANCE-MEASUREMENT; MEDICAL-RECORDS; CLINICAL-DATA; SYSTEM; INFORMATION; EXTRACTION;
D O I
10.1136/amiajnl-2011-000431
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective The quality of colonoscopy procedures for colorectal cancer screening is often inadequate and varies widely among physicians. Routine measurement of quality is limited by the costs of manual review of free-text patient charts. Our goal was to develop a natural language processing (NLP) application to measure colonoscopy quality. Materials and methods Using a set of quality measures published by physician specialty societies, we implemented an NLP engine that extracts 21 variables for 19 quality measures from free-text colonoscopy and pathology reports. We evaluated the performance of the NLP engine on a test set of 453 colonoscopy reports and 226 pathology reports, considering accuracy in extracting the values of the target variables from text, and the reliability of the outcomes of the quality measures as computed from the NLP-extracted information. Results The average accuracy of the NLP engine over all variables was 0.89 (range: 0.62-1.0) and the average F measure over all variables was 0.74 (range: 0.49-0.89). The average agreement score, measured as Cohen's kappa, between the manually established and NLP-derived outcomes of the quality measures was 0.62 (range: 0.09-0.86). Discussion For nine of the 19 colonoscopy quality measures, the agreement score was 0.70 or above, which we consider a sufficient score for the NLP-derived outcomes of these measures to be practically useful for quality measurement. Conclusion The use of NLP for information extraction from free-text colonoscopy and pathology reports creates opportunities for large scale, routine quality measurement, which can support quality improvement in colonoscopy care.
引用
收藏
页码:I150 / I156
页数:7
相关论文
共 73 条
[21]  
Chapman W.W., 2010, CLIN MED RES, V8, P53
[22]   The urgent need to improve health care quality - Institute of medicine National Roundtable on Health Care Quality [J].
Chassin, MR ;
Galvin, RW .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1998, 280 (11) :1000-1005
[23]   Systematic review: Impact of health information technology on quality, efficiency, and costs of medical care [J].
Chaudhry, Basit ;
Wang, Jerome ;
Wu, Shinyi ;
Maglione, Margaret ;
Mojica, Walter ;
Roth, Elizabeth ;
Morton, Sally C. ;
Shekelle, Paul G. .
ANNALS OF INTERNAL MEDICINE, 2006, 144 (10) :742-752
[24]   Endoscopist is comparable to age and gender as predictor of adenomas at colonoscopy [J].
Chen, SC ;
Rex, DK .
AMERICAN JOURNAL OF GASTROENTEROLOGY, 2005, 100 (09) :S393-S393
[25]   Automated evaluation of electronic discharge notes to assess quality of care for cardiovascular diseases using Medical Language Extraction and Encoding System (MedLEE) [J].
Chiang, Jung-Hsien ;
Lin, Jou-Wei ;
Yang, Chen-Wei .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :245-252
[26]  
Christensen L.M., 2009, P WORKSHOP BIONLP, P19
[27]   caTIES: a grid based system for coding and retrieval of surgical pathology reports and tissue specimens in support of translational research [J].
Crowley, Rebecca S. ;
Castine, Melissa ;
Mitchell, Kevin ;
Chavan, Girish ;
McSherry, Tara ;
Feldman, Michael .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (03) :253-264
[28]   Evaluation of a generalizable approach to clinical information retrieval using the automated retrieval console (ARC) [J].
D'Avolio, Leonard W. ;
Nguyen, Thien M. ;
Farwell, Wildon R. ;
Chen, Yongming ;
Fitzmeyer, Felicia ;
Harris, Owen M. ;
Fiore, Louis D. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (04) :375-382
[29]   Facilitating Clinical Outcomes Assessment through the automated identification of quality measures for prostate cancer surgery [J].
D'Avouo, Leona W. ;
Litwin, Mark S. ;
Roges, Selwyn O., Jr. ;
Bui, Alex A. T. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (03) :341-348
[30]   Natural Language Processing Improves Identification of Colorectal Cancer Testing in the Electronic Medical Record [J].
Denny, Joshua C. ;
Choma, Neesha N. ;
Peterson, Josh F. ;
Miller, Randolph A. ;
Bastarache, Lisa ;
Li, Ming ;
Peterson, Neeraja B. .
MEDICAL DECISION MAKING, 2012, 32 (01) :188-197