Automated Extraction of BI-RADS Final Assessment Categories from Radiology Reports with Natural Language Processing

被引:0
作者
Dorothy A. Sippo
Graham I. Warden
Katherine P. Andriole
Ronilda Lacson
Ichiro Ikuta
Robyn L. Birdwell
Ramin Khorasani
机构
[1] Harvard Medical School,Department of Radiology, Brigham and Women’s Hospital
[2] Johns Hopkins University School of Medicine Green Spring Station,Russell H. Morgan Department of Radiology and Radiological Science
[3] United States Air Force,Medical Corp
[4] Department of Radiology,undefined
[5] Norwalk Hospital,undefined
[6] Yale School of Medicine,undefined
来源
Journal of Digital Imaging | 2013年 / 26卷
关键词
Breast Imaging Reporting and Data System (BI-RADS); Natural language processing; Imaging informatics; Breast;
D O I
暂无
中图分类号
学科分类号
摘要
The objective of this study is to evaluate a natural language processing (NLP) algorithm that determines American College of Radiology Breast Imaging Reporting and Data System (BI-RADS) final assessment categories from radiology reports. This HIPAA-compliant study was granted institutional review board approval with waiver of informed consent. This cross-sectional study involved 1,165 breast imaging reports in the electronic medical record (EMR) from a tertiary care academic breast imaging center from 2009. Reports included screening mammography, diagnostic mammography, breast ultrasound, combined diagnostic mammography and breast ultrasound, and breast magnetic resonance imaging studies. Over 220 reports were included from each study type. The recall (sensitivity) and precision (positive predictive value) of a NLP algorithm to collect BI-RADS final assessment categories stated in the report final text was evaluated against a manual human review standard reference. For all breast imaging reports, the NLP algorithm demonstrated a recall of 100.0 % (95 % confidence interval (CI), 99.7, 100.0 %) and a precision of 96.6 % (95 % CI, 95.4, 97.5 %) for correct identification of BI-RADS final assessment categories. The NLP algorithm demonstrated high recall and precision for extraction of BI-RADS final assessment categories from the free text of breast imaging reports. NLP may provide an accurate, scalable data extraction mechanism from reports within EMRs to create databases to track breast imaging performance measures and facilitate optimal breast cancer population management strategies.
引用
收藏
页码:989 / 994
页数:5
相关论文
共 62 条
[1]  
Ballard-Barbash R(1997)Breast Cancer Surveillance Consortium: a national mammography screening and outcomes database AJR Am J Roentgenol 169 1001-1008
[2]  
Taplin SH(2007)Auditing your breast imaging practice: an evidence-based approach Semin Roentgenol 42 211-217
[3]  
Yankaskas BC(2002)Use of natural language processing to translate clinical information from a database of 889,921 chest radiographic reports Radiology 224 157-163
[4]  
Ernster VL(2005)Application of recently developed computer algorithm for automatic classification of unstructured radiology reports: validation study Radiology 234 323-329
[5]  
Rosenberg RD(2012)Repeat abdominal imaging examinations in a tertiary care hospital Am J Med 125 155-161
[6]  
Carney PA(2010)Discerning tumor status from unstructured MRI reports—completeness of information in existing reports and utility of automated natural language processing J Digit Imaging 23 119-132
[7]  
Barlow WE(2012)Automatically correlating clinical findings and body locations in radiology reports using MedLEE J Digit Imaging 25 240-249
[8]  
Geller BM(2012)Automatic classification of mammography reports by BI-RADS breast tissue composition class J Am Med Inform Assoc 19 913-916
[9]  
Kerlikowske K(2009)Rule-based information extraction from patients' clinical data J Biomed Inform 42 923-936
[10]  
Edwards BK(2006)Systematic review: impact of health information technology on quality, efficiency, and costs of medical care Ann Intern Med 144 742-752