Post-Structuring Radiology Reports of Breast Cancer Patients for Clinical Quality Assurance

被引:11
作者
Pathak, Shreyasi [1 ]
van Rossen, Jorit [2 ]
Vijlbrief, Onno [2 ]
Geerdink, Jeroen [2 ]
Seifert, Christin [1 ]
van Keulen, Maurice [1 ]
机构
[1] Univ Twente, Data Management & Biometr Grp, NL-7522 NB Enschede, Netherlands
[2] Hosp Grp Twente ZGT, NL-7555 DL Hengelo, Netherlands
关键词
Radiology; Breast cancer; Quality assurance; Standards; Task analysis; Machine learning; Natural language processing; automatic structuring; post-structuring; radiology reports; conditional random field; INFORMATION EXTRACTION; FREE-TEXT;
D O I
10.1109/TCBB.2019.2914678
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Hospitals often set protocols based on well defined standards to maintain the quality of patient reports. To ensure that the clinicians conform to the protocols, quality assurance of these reports is needed. Patient reports are currently written in free-text format, which complicates the task of quality assurance. In this paper, we present a machine learning based natural language processing system for automatic quality assurance of radiology reports on breast cancer. This is achieved in three steps: we i) identify the top-level structure (headings) of the report, ii) classify the report content into the top-level headings, and iii) convert the free-text detailed findings in the report to a semi-structured format (post-structuring). Top level structure and content of report were predicted with an F1 score of 0.97 and 0.94, respectively, using Support Vector Machine (SVM) classifiers. For automatic structuring, our proposed hierarchical Conditional Random Field (CRF) outperformed the baseline CRF with an F1 score of 0.78 versus 0.71. The determined structure of the report is represented in semi-structured XML format of the free-text report, which helps to easily visualize the conformance of the findings to the protocols. This format also allows easy extraction of specific information for other purposes such as search, evaluation, and research.
引用
收藏
页码:1883 / 1894
页数:12
相关论文
共 22 条
[1]   Quality Initiatives Key Performance Indicators for Measuring and Improving Radiology Department Performance [J].
Abujudeh, Hani H. ;
Kaewlai, Rathachai ;
Asfaw, Benjamin A. ;
Thrall, James H. .
RADIOGRAPHICS, 2010, 30 (03) :571-580
[2]  
[Anonymous], 2013, Breast Imaging Reporting and Data System
[3]   An enhanced CRFs-based system for information extraction from radiology reports [J].
Esuli, Andrea ;
Marcheggiani, Diego ;
Sebastiani, Fabrizio .
JOURNAL OF BIOMEDICAL INFORMATICS, 2013, 46 (03) :425-435
[4]   Information extraction from multi-institutional radiology reports [J].
Hassanpour, Saeed ;
Langlotz, Curtis P. .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2016, 66 :29-39
[5]   Quality of breast imaging reports falls short of recommended standards [J].
Houssami, Nehmat ;
Boyages, John ;
Stuart, Kirsty ;
Brennan, Meagan .
BREAST, 2007, 16 (03) :271-279
[6]  
Johnson Annette J, 2004, J Am Coll Radiol, V1, P497, DOI 10.1016/j.jacr.2004.02.019
[7]   Cohort Study of Structured Reporting Compared with Conventional Dictation [J].
Johnson, Annette J. ;
Chen, Michael Y. M. ;
Swan, J. Shannon ;
Applegate, Kimberly E. ;
Littenberg, Benjamin .
RADIOLOGY, 2009, 253 (01) :74-80
[8]   Toward Best Practices in Radiology Reporting [J].
Kahn, Charles E., Jr. ;
Langlotz, Curtis P. ;
Burnside, Elizabeth S. ;
Carrino, John A. ;
Channin, David S. ;
Hovsepian, David M. ;
Rubin, Daniel L. .
RADIOLOGY, 2009, 252 (03) :852-856
[9]  
Lafferty J., 2001, CONDITIONAL RANDOM F
[10]   RadLex: A new method for indexing online educational materials [J].
Langlotz, Curtis P. .
RADIOGRAPHICS, 2006, 26 (06) :1595-1597