Lexical patterns, features and knowledge resources for coreference resolution in clinical notes

被引:4
作者
Gooch, Phil [1 ]
Roudsari, Abdul [2 ]
机构
[1] City Univ London, Ctr Hlth Informat, Sch Informat, London EC1V 0HB, England
[2] Univ Victoria, Sch Hlth Informat Sci, Victoria, BC V8W 2Y2, Canada
基金
英国工程与自然科学研究理事会; 美国国家卫生研究院;
关键词
Natural language processing; Coreference resolution; Knowledge engineering; Clinical records; Algorithms;
D O I
10.1016/j.jbi.2012.02.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Generation of entity coreference chains provides a means to extract linked narrative events from clinical notes, but despite being a well-researched topic in natural language processing, general-purpose coreference tools perform poorly on clinical texts. This paper presents a knowledge-centric and pattern-based approach to resolving coreference across a wide variety of clinical records from two corpora (Ontology Development and Information Extraction (ODIE) and i2b2/VA), and describes a method for generating coreference chains using progressively pruned linked lists that reduces the search space and facilitates evaluation by a number of metrics. Independent evaluation results give an F-measure for each corpus of 79.2% and 87.5%, respectively. A baseline of blind coreference of mentions of the same class gives F-measures of 65.3% and 51.9% respectively. For the ODIE corpus, recall is significantly improved over the baseline (p < 0.05) but overall there was no statistically significant improvement in F-measure (p > 0.05). For the i2b2/VA corpus, recall, precision, and F-measure are significantly improved over the baseline (p < 0.05). Overall, our approach offers performance at least as good as human annotators and greatly increased performance over general-purpose tools. The system uses a number of open-source components that are available to download. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:901 / 912
页数:12
相关论文
共 28 条
  • [21] Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes
    Khalifa, Abdulrahman
    Meystre, Stephane
    JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 : S128 - S132
  • [22] Harnessing Multi-modality and Expert Knowledge for Adverse Events Prediction in Clinical Notes
    Postiglione, Marco
    Esposito, Giovanni
    Izzo, Raffaele
    La Gatta, Valerio
    Moscato, Vincenzo
    Piccolo, Raffaele
    IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 119 - 130
  • [23] ClinicalRadioBERT: Knowledge-Infused Few Shot Learning for Clinical Notes Named Entity Recognition
    Rezayi, Saed
    Dai, Haixing
    Liu, Zhengliang
    Wu, Zihao
    Hebbar, Akarsh
    Burns, Andrew H.
    Zhao, Lin
    Zhu, Dajiang
    Li, Quanzheng
    Liu, Wei
    Li, Sheng
    Liu, Tianming
    Li, Xiang
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 269 - 278
  • [24] Bi-LSTM-CRF Network for Clinical Event Extraction With Medical Knowledge Features
    Zhang, Shunli
    Li, Yancui
    Li, Shiyong
    Yan, Fang
    IEEE ACCESS, 2022, 10 : 110100 - 110109
  • [25] EILEEN: A Multi-Modal Framework for Extracting Alcohol Consumption Patterns From Bilingual Clinical Notes
    Kim, Han Kyul
    Park, Yujin
    Kim, Yoon Ji
    Yi, Seungag
    Park, Yeju
    So, Sujin
    Lee, Hyeon-Ji
    Bae, Ye Seul
    IEEE ACCESS, 2025, 13 : 25741 - 25751
  • [26] Integrating data-driven and knowledge-driven approaches to analyze clinical notes with structured data for sarcopenia detection
    Luo, Xiao
    Ding, Haoran
    Warden, Stuart J.
    Moorthi, Ranjani N.
    Imel, Erik A.
    HEALTH INFORMATICS JOURNAL, 2024, 30 (04)
  • [27] EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
    Wang, Li
    Wang, Qinghua
    Bai, Heming
    Liu, Cong
    Liu, Wei
    Zhang, Yuanpeng
    Jiang, Lei
    Xu, Huji
    Wang, Kai
    Zhou, Yunyun
    FRONTIERS IN GENETICS, 2020, 11
  • [28] Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
    Petch, Jeremy
    Batt, Jane
    Murray, Joshua
    Mamdani, Muhammad
    JMIR MEDICAL INFORMATICS, 2019, 7 (04) : 69 - 79