Lexical patterns, features and knowledge resources for coreference resolution in clinical notes

被引：4

作者：

Gooch, Phil ^{[1
]}

Roudsari, Abdul ^{[2
]}

机构：

[1] City Univ London, Ctr Hlth Informat, Sch Informat, London EC1V 0HB, England

[2] Univ Victoria, Sch Hlth Informat Sci, Victoria, BC V8W 2Y2, Canada

来源：

JOURNAL OF BIOMEDICAL INFORMATICS | 2012年 / 45卷 / 05期

基金：

英国工程与自然科学研究理事会; 美国国家卫生研究院;

关键词：

Natural language processing; Coreference resolution; Knowledge engineering; Clinical records; Algorithms;

D O I：

10.1016/j.jbi.2012.02.012

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Generation of entity coreference chains provides a means to extract linked narrative events from clinical notes, but despite being a well-researched topic in natural language processing, general-purpose coreference tools perform poorly on clinical texts. This paper presents a knowledge-centric and pattern-based approach to resolving coreference across a wide variety of clinical records from two corpora (Ontology Development and Information Extraction (ODIE) and i2b2/VA), and describes a method for generating coreference chains using progressively pruned linked lists that reduces the search space and facilitates evaluation by a number of metrics. Independent evaluation results give an F-measure for each corpus of 79.2% and 87.5%, respectively. A baseline of blind coreference of mentions of the same class gives F-measures of 65.3% and 51.9% respectively. For the ODIE corpus, recall is significantly improved over the baseline (p < 0.05) but overall there was no statistically significant improvement in F-measure (p > 0.05). For the i2b2/VA corpus, recall, precision, and F-measure are significantly improved over the baseline (p < 0.05). Overall, our approach offers performance at least as good as human annotators and greatly increased performance over general-purpose tools. The system uses a number of open-source components that are available to download. (C) 2012 Elsevier Inc. All rights reserved.

引用

页码：901 / 912

页数：12

共 28 条

[21] Adapting existing natural language processing resources for cardiovascular risk factors identification in clinical notes
Khalifa, Abdulrahman
Meystre, Stephane
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 : S128 - S132
[22] Harnessing Multi-modality and Expert Knowledge for Adverse Events Prediction in Clinical Notes
Postiglione, Marco
Esposito, Giovanni
Izzo, Raffaele
La Gatta, Valerio
Moscato, Vincenzo
Piccolo, Raffaele
IMAGE ANALYSIS AND PROCESSING - ICIAP 2023 WORKSHOPS, PT II, 2024, 14366 : 119 - 130
[23] ClinicalRadioBERT: Knowledge-Infused Few Shot Learning for Clinical Notes Named Entity Recognition
Rezayi, Saed
Dai, Haixing
Liu, Zhengliang
Wu, Zihao
Hebbar, Akarsh
Burns, Andrew H.
Zhao, Lin
Zhu, Dajiang
Li, Quanzheng
Liu, Wei
Li, Sheng
Liu, Tianming
Li, Xiang
MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2022, 2022, 13583 : 269 - 278
[24] Bi-LSTM-CRF Network for Clinical Event Extraction With Medical Knowledge Features
Zhang, Shunli
Li, Yancui
Li, Shiyong
Yan, Fang
IEEE ACCESS, 2022, 10 : 110100 - 110109
[25] EILEEN: A Multi-Modal Framework for Extracting Alcohol Consumption Patterns From Bilingual Clinical Notes
Kim, Han Kyul
Park, Yujin
Kim, Yoon Ji
Yi, Seungag
Park, Yeju
So, Sujin
Lee, Hyeon-Ji
Bae, Ye Seul
IEEE ACCESS, 2025, 13 : 25741 - 25751
[26] Integrating data-driven and knowledge-driven approaches to analyze clinical notes with structured data for sarcopenia detection
Luo, Xiao
Ding, Haoran
Warden, Stuart J.
Moorthi, Ranjani N.
Imel, Erik A.
HEALTH INFORMATICS JOURNAL, 2024, 30 (04)
[27] EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism
Wang, Li
Wang, Qinghua
Bai, Heming
Liu, Cong
Liu, Wei
Zhang, Yuanpeng
Jiang, Lei
Xu, Huji
Wang, Kai
Zhou, Yunyun
FRONTIERS IN GENETICS, 2020, 11
[28] Extracting Clinical Features From Dictated Ambulatory Consult Notes Using a Commercially Available Natural Language Processing Tool: Pilot, Retrospective, Cross-Sectional Validation Study
Petch, Jeremy
Batt, Jane
Murray, Joshua
Mamdani, Muhammad
JMIR MEDICAL INFORMATICS, 2019, 7 (04) : 69 - 79

← 1 2 3 →