Agile Natural Language Processing Model for Pathology Knowledge Extraction and Integration with Clinical Enterprise Data Warehouse

被引:0
作者
Baghal, Ahmad [1 ]
Al-Shukri, Shaymaa [1 ]
Kumari, Annu [1 ]
机构
[1] Univ Arkansas Med Sci, Biomed Informat, Coll Med, Little Rock, AR 72205 USA
来源
2019 SIXTH INTERNATIONAL CONFERENCE ON SOCIAL NETWORKS ANALYSIS, MANAGEMENT AND SECURITY (SNAMS) | 2019年
关键词
NLP; free-text; pathology; cancer; EMR; data-warehouse;
D O I
10.1109/snams.2019.8931828
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Electronic Medical Record (EMR) systems store patients' medical information in either structured or unstructured, free-text format such as clinical reports. Pathology notes are a type of clinical reports that may store cancer related information such as diagnoses and description of tissue sample. Data in clinical documents can provide up to 20% of knowledge in addition to structured data stored in discrete fields. The process of extracting information from documents can be time-consuming and non-trivial. We evaluated several natural language processing (NLP) open source tools to extract terms of interest from pathology documents and to incorporate with data already stored in the institutional data warehouse (EDW). Many of the evaluated NLP software tools provide various features, but none suites our immediate need of extracting key pathology terms. This paper discusses our in-house developed framework to identify and extract pathology data points from pathology documents, curate, and load in the EDW. The performance of the proposed model was evaluated and extracted terms were validated with data stored in the institutional electronic medical record system.
引用
收藏
页码:419 / 422
页数:4
相关论文
共 13 条
  • [1] Baghal Ahmad, 2019, Stud Health Technol Inform, V257, P31
  • [2] Bird S, 2007, Natural Language Processing in Python
  • [3] Friedman C, 2000, J AM MED INFORM ASSN, P270
  • [4] Kavuluru Ramakanth, 2013, AMIA Jt Summits Transl Sci Proc, V2013, P112
  • [5] Meystre S M, 2008, Yearb Med Inform, P128
  • [6] Ou Y, 2014, ELECTRON J HEALTH IN, V8
  • [7] Qiu J., 2018, IEEE J BIOMED HEALTH, V22, P173
  • [8] Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications
    Savova, Guergana K.
    Masanz, James J.
    Ogren, Philip V.
    Zheng, Jiaping
    Sohn, Sunghwan
    Kipper-Schuler, Karin C.
    Chute, Christopher G.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) : 507 - 513
  • [9] Si Yuqi, 2018, AMIA Annu Symp Proc, V2018, P1524
  • [10] CLAMP - a toolkit for efficiently building customized clinical natural language processing pipelines
    Soysal, Ergin
    Wang, Jingqi
    Jiang, Min
    Wu, Yonghui
    Pakhomov, Serguei
    Liu, Hongfang
    Xu, Hua
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2018, 25 (03) : 331 - 336