Lexical patterns, features and knowledge resources for coreference resolution in clinical notes

被引:4
作者
Gooch, Phil [1 ]
Roudsari, Abdul [2 ]
机构
[1] City Univ London, Ctr Hlth Informat, Sch Informat, London EC1V 0HB, England
[2] Univ Victoria, Sch Hlth Informat Sci, Victoria, BC V8W 2Y2, Canada
基金
英国工程与自然科学研究理事会; 美国国家卫生研究院;
关键词
Natural language processing; Coreference resolution; Knowledge engineering; Clinical records; Algorithms;
D O I
10.1016/j.jbi.2012.02.012
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Generation of entity coreference chains provides a means to extract linked narrative events from clinical notes, but despite being a well-researched topic in natural language processing, general-purpose coreference tools perform poorly on clinical texts. This paper presents a knowledge-centric and pattern-based approach to resolving coreference across a wide variety of clinical records from two corpora (Ontology Development and Information Extraction (ODIE) and i2b2/VA), and describes a method for generating coreference chains using progressively pruned linked lists that reduces the search space and facilitates evaluation by a number of metrics. Independent evaluation results give an F-measure for each corpus of 79.2% and 87.5%, respectively. A baseline of blind coreference of mentions of the same class gives F-measures of 65.3% and 51.9% respectively. For the ODIE corpus, recall is significantly improved over the baseline (p < 0.05) but overall there was no statistically significant improvement in F-measure (p > 0.05). For the i2b2/VA corpus, recall, precision, and F-measure are significantly improved over the baseline (p < 0.05). Overall, our approach offers performance at least as good as human annotators and greatly increased performance over general-purpose tools. The system uses a number of open-source components that are available to download. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:901 / 912
页数:12
相关论文
共 28 条
  • [1] Experiments on Coreference Resolution for Indonesian Language with Lexical and Shallow Syntactic Features
    Suherik, Gilang Julian
    Purwarianti, Ayu
    2017 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOIC7), 2017,
  • [2] Analysing Semantic Resources for Coreference Resolution
    Lima, Thiago
    Collovini, Sandra
    Leal, Ana
    Fonseca, Evandro
    Han, Xiaoxuan
    Huang, Siyu
    Vieira, Renata
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 284 - 293
  • [3] An Exercise in Reuse of Resources: Adapting General Discourse Coreference Resolution for Detecting Lexical Chains in Patent Documentation
    Bouayad-Agha, Nadjet
    Burga, Alicia
    Casamayor, Gerard
    Codina, Joan
    Nazar, Rogelio
    Wanner, Leo
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3214 - 3221
  • [4] Improving Coreference Resolution with Semantic Knowledge
    Fonseca, Evandro
    Vieira, Renata
    Vanin, Aline
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE (PROPOR 2016), 2016, 9727 : 213 - 224
  • [5] A Deeper Look into Features for Coreference Resolution
    Recasens, Marta
    Hovy, Eduard
    ANAPHORA PROCESSING AND APPLICATIONS, 2009, 5847 : 29 - +
  • [6] Multidimensional relational knowledge embedding for coreference resolution
    Li, Kai
    Zhang, Shuquan
    Zhao, Zhenlei
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (04) : 1507 - 1521
  • [7] Multidimensional relational knowledge embedding for coreference resolution
    Kai Li
    Shuquan Zhang
    Zhenlei Zhao
    Neural Computing and Applications, 2024, 36 : 1507 - 1521
  • [8] Nominal Coreference Resolution Using Semantic Knowledge
    Fonseca, Evandro
    Vanin, Aline
    Vieira, Renata
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 37 - 45
  • [9] Evaluation of Uryupina's Coreference Resolution Features for Polish
    Niton, Bartlomiej
    HUMAN LANGUAGE TECHNOLOGY: CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2016, 9561 : 354 - 367
  • [10] A Coreference Resolution Approach using Morphological Features in Arabic
    Beseiso, Majdi
    Al-Alwani, Abdulkareem
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (10) : 107 - 113