Adverse drug event and medication extraction in electronic health records via a cascading architecture with different sequence labeling models and word embeddings

被引:19
作者
Dai, Hong-Jie [1 ,2 ]
Su, Chu-Hsien [3 ]
Wu, Chi-Shin [3 ]
机构
[1] Natl Kaohsiung Univ Sci & Technol, Coll Elect Engn & Comp Sci, Dept Elect Engn, Kaohsiung, Taiwan
[2] Kaohsiung Med Univ, Coll Med, Dept Postbaccalaureate Med, Kaohsiung, Taiwan
[3] Natl Taiwan Univ Hosp, Dept Psychiat, Taipei, Taiwan
关键词
adverse drug event; information extraction; named entity recognition; word embedding; electronic health record;
D O I
10.1093/jamia/ocz120
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: An adverse drug event (ADE) refers to an injury resulting from medical intervention related to a drug including harm caused by drugs or from the usage of drugs. Extracting ADEs from clinical records can help physicians associate adverse events to targeted drugs. Materials and Methods: We proposed a cascading architecture to recognize medical concepts including ADEs, drug names, and entities related to drugs. The architecture includes a preprocessing method and an ensemble of conditional random fields (CRFs) and neural network-based models to respectively address the challenges of surrogate string and overlapping annotation boundaries observed in the employed ADEs and medication extraction (ADME) corpus. The effectiveness of applying different pretrained and postprocessed word embeddings for the ADME task was also studied. Results: The empirical results showed that both CRFs and neural network-based models provide promising solution for the ADME task. The neural network-based models particularly outperformed CRFs in concept types involving narrative descriptions. Our best run achieved an overall micro F-score of 0.919 on the employed corpus. Our results also suggested that the Global Vectors for word representation embedding in general domain provides a very strong baseline, which can be further improved by applying the principal component analysis to generate more isotropic vectors. Conclusions: We have demonstrated that the proposed cascading architecture can handle the problem of overlapped annotations and further improve the overall recall and F-scores because the architecture enables the developed models to exploit more context information and forms an ensemble for creating a stronger recognizer.
引用
收藏
页码:47 / 55
页数:9
相关论文
共 39 条
[1]  
[Anonymous], 2017, OPTIMAL HYPERPARAMET
[2]   Extraction of Adverse Drug Effects from Clinical Records [J].
Aramaki, Eiji ;
Miura, Yasuhide ;
Tonoike, Masatsugu ;
Ohkuma, Tomoko ;
Masuichi, Hiroshi ;
Waki, Kayo ;
Ohe, Kazuhiko .
MEDINFO 2010, PTS I AND II, 2010, 160 :739-743
[3]   INCIDENCE OF ADVERSE DRUG EVENTS AND POTENTIAL ADVERSE DRUG EVENTS - IMPLICATIONS FOR PREVENTION [J].
BATES, DW ;
CULLEN, DJ ;
LAIRD, N ;
PETERSEN, LA ;
SMALL, SD ;
SERVI, D ;
LAFFEL, G ;
SWEITZER, BJ ;
SHEA, BF ;
HALLISEY, R ;
VANDERVLIET, M ;
NEMESKAL, R ;
LEAPE, LL .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1995, 274 (01) :29-34
[4]  
Bojanowski P., 2017, Transactions of the Association for Computational Linguistics, V5, P135, DOI [10.1162/tacla00051, DOI 10.1162/TACL_A_00051, DOI 10.1162/TACLA00051]
[5]  
Buchan K, 2018, ANNOTATION GUIDELINE
[6]   A context-aware approach for progression tracking of medical concepts in electronic medical records [J].
Chang, Nai-Wen ;
Dai, Hong-Jie ;
Jonnagaddala, Jitendra ;
Chen, Chih-Wei ;
Tsai, Richard Tzong-Han ;
Hsu, Wen-Lian .
JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 58 :S150-S157
[7]   Adverse drug events in hospitalized patients - Excess length of stay, extra costs, and attributable mortality [J].
Classen, DC ;
Pestotnik, SL ;
Evans, RS ;
Lloyd, JF ;
Burke, JP .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1997, 277 (04) :301-306
[8]   Cascaded classifiers for confidence-based chemical named entity recognition [J].
Corbett, Peter ;
Copestake, Ann .
BMC BIOINFORMATICS, 2008, 9 (Suppl 11)
[9]  
Dai HJ, 2015, BIOMED RES INT, V2015
[10]   Feature Engineering for Recognizing Adverse Drug Reactions from Twitter Posts [J].
Dai, Hong-Jie ;
Touray, Musa ;
Jonnagaddala, Jitendra ;
Syed-Abdul, Shabbir .
INFORMATION, 2016, 7 (02)