Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning

被引:0
作者
Jackson M. Steinkamp
Charles Chambers
Darco Lalevic
Hanna M. Zafar
Tessa S. Cook
机构
[1] Hospital of the University of Pennsylvania,Department of Radiology
[2] Boston University School of Medicine,undefined
来源
Journal of Digital Imaging | 2019年 / 32卷
关键词
Machine learning; Radiology reports; Natural language processing; Structured reporting;
D O I
暂无
中图分类号
学科分类号
摘要
Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, and as a source of weak data labels for training image processing systems. Clinical ML systems must be interpretable to ensure user trust. To create interpretable models applicable to all of these tasks, we can build general-purpose systems which extract all relevant human-level assertions or “facts” documented in reports; identifying these facts is an information extraction (IE) task. Previous IE work in radiology has focused on a limited set of information, and extracts isolated entities (i.e., single words such as “lesion” or “cyst”) rather than complete facts, which require the linking of multiple entities and modifiers. Here, we develop a prototype system to extract all useful information in abdominopelvic radiology reports (findings, recommendations, clinical history, procedures, imaging indications and limitations, etc.), in the form of complete, contextualized facts. We construct an information schema to capture the bulk of information in reports, develop real-time ML models to extract this information, and demonstrate the feasibility and performance of the system.
引用
收藏
页码:554 / 564
页数:10
相关论文
共 7 条
  • [1] Rubin DL(2017)Common Data Elements in Radiology Radiology 283 837-844
  • [2] Kahn CE(2018)ESR paper on structured reporting in radiology Insights Imaging 9 1-7
  • [3] Hassanpour S(2016)Information extraction from multi-institutional radiology reports Artif Intell Med 66 29-39
  • [4] Langlotz CP(2013)An enhanced CRFs-based system for information extraction from radiology reports J. Biomed. Inform. 46 425-435
  • [5] Esuli A(undefined)undefined undefined undefined undefined-undefined
  • [6] Marcheggiani D(undefined)undefined undefined undefined undefined-undefined
  • [7] Sebastiani F(undefined)undefined undefined undefined undefined-undefined