2018 n2c2 shared task on adverse drug events and medication extraction in electronic health records

被引:169
作者
Henry, Sam [1 ]
Buchan, Kevin [2 ]
Filannino, Michele [1 ,3 ]
Stubbs, Amber [4 ]
Uzuner, Ozlem [1 ,3 ,5 ]
机构
[1] George Mason Univ, Dept Informat Sci & Technol, 4400 Univ Dr, Fairfax, VA 22030 USA
[2] SUNY Albany, Dept Informat Sci, Albany, NY 12222 USA
[3] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] Simmons Univ, Dept Math & Comp Sci, Boston, MA USA
[5] Harvard Med Sch, Dept Biomed Informat, Boston, MA 02115 USA
基金
美国国家卫生研究院;
关键词
OF-THE-ART; CLINICAL NARRATIVES; DE-IDENTIFICATION; HEART-DISEASE; RISK-FACTORS; INFORMATION;
D O I
10.1093/jamia/ocz166
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: This article summarizes the preparation, organization, evaluation, and results of Track 2 of the 2018 National NLP Clinical Challenges shared task. Track 2 focused on extraction of adverse drug events (ADEs) from clinical records and evaluated 3 tasks: concept extraction, relation classification, and end-to-end systems. We perform an analysis of the results to identify the state of the art in these tasks, learn from it, and build on it. Materials and Methods: For all tasks, teams were given raw text of narrative discharge summaries, and in all the tasks, participants proposed deep learning-based methods with hand-designed features. In the concept extraction task, participants used sequence labelling models (bidirectional long short-term memory being the most popular), whereas in the relation classification task, they also experimented with instance-based classifiers (namely support vector machines and rules). Ensemble methods were also popular. Results: A total of 28 teams participated in task 1, with 21 teams in tasks 2 and 3. The best performing systems set a high performance bar with F1 scores of 0.9418 for concept extraction, 0.9630 for relation classification, and 0.8905 for end-to-end. However, the results were much lower for concepts and relations of Reasons and ADEs. These were often missed because local context is insufficient to identify them. Conclusions: This challenge shows that clinical concept extraction and relation classification systems have a high performance for many concept types, but significant improvement is still required for ADEs and Reasons. Incorporating the larger context or outside knowledge will likely improve the performance of future systems.
引用
收藏
页码:3 / 12
页数:10
相关论文
共 55 条
[21]   MIMIC-III, a freely accessible critical care database [J].
Johnson, Alistair E. W. ;
Pollard, Tom J. ;
Shen, Lu ;
Lehman, Li-wei H. ;
Feng, Mengling ;
Ghassemi, Mohammad ;
Moody, Benjamin ;
Szolovits, Peter ;
Celi, Leo Anthony ;
Mark, Roger G. .
SCIENTIFIC DATA, 2016, 3
[22]   An ensemble of neural models for nested adverse drug events and medication extraction with subwords [J].
Ju, Meizhi ;
Nguyen, Nhung T. H. ;
Miwa, Makoto ;
Ananiadou, Sophia .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (01) :22-30
[23]   Overview of the CLEF eHealth Evaluation Lab 2016 [J].
Kelly, Liadh ;
Goeuriot, Lorraine ;
Suominen, Hanna ;
Neveol, Aurelie ;
Palotti, Joao ;
Zuccon, Guido .
EXPERIMENTAL IR MEETS MULTILINGUALITY, MULTIMODALITY, AND INTERACTION, CLEF 2016, 2016, 9822 :255-266
[24]   Ensemble method-based extraction of medication and related information from clinical texts [J].
Kim, Youngjun ;
Meystre, Stephane M. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (01) :31-38
[25]  
Lafferty JD, 2001, P 18 INT C MACH LEAR, P282, DOI 10.5555/645530.655813
[26]  
Lample G., 2016, P NAACL HLT, P260, DOI DOI 10.18653/V1/N16-1030
[27]   An end-to-end hybrid algorithm for automated medication discrepancy detection [J].
Li, Qi ;
Spooner, Stephen Andrew ;
Kaiser, Megan ;
Lingren, Nataline ;
Robbins, Jessica ;
Lingren, Todd ;
Tang, Huaxiu ;
Solti, Imre ;
Ni, Yizhao .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2015, 15
[28]  
Ling W, 2016, FINDING FUNCTION FOR
[29]   Automated detection of adverse events using natural language processing of discharge summaries [J].
Melton, CB ;
Hripcsak, G .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2005, 12 (04) :448-457
[30]  
Mikolov T., 2013, Advances in neural information processing systems, V26, P3111