Dictionary construction and identification of possible adverse drug events in Danish clinical narrative text

被引:49
作者
Eriksson, Robert [1 ,2 ]
Jensen, Peter Bjodstrup [2 ]
Frankild, Sune [2 ]
Jensen, Lars Juhl [2 ]
Brunak, Soren [1 ,2 ]
机构
[1] Univ Copenhagen, Fac Hlth & Med Sci, NNF Ctr Prot Res, Dept Dis Syst Biol, Copenhagen, Denmark
[2] Tech Univ Denmark, Ctr Biol Sequence Anal, Dept Syst Biol, DK-2800 Lyngby, Denmark
关键词
Adverse Drug Event; Adverse Drug Reaction Reporting Systems; Electronic Health Records; Dictionary; Data Mining; EXTRACTION SYSTEM; INFORMATION;
D O I
10.1136/amiajnl-2013-001708
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Drugs have tremendous potential to cure and relieve disease, but the risk of unintended effects is always present. Healthcare providers increasingly record data in electronic patient records (EPRs), in which we aim to identify possible adverse events (AEs) and, specifically, possible adverse drug events (ADEs). Materials and methods Based on the undesirable effects section from the summary of product characteristics (SPC) of 7446 drugs, we have built a Danish ADE dictionary. Starting from this dictionary we have developed a pipeline for identifying possible ADEs in unstructured clinical narrative text. We use a named entity recognition (NER) tagger to identify dictionary matches in the text and post-coordination rules to construct ADE compound terms. Finally, we apply post-processing rules and filters to handle, for example, negations and sentences about subjects other than the patient. Moreover, this method allows synonyms to be identified and anatomical location descriptions can be merged to allow appropriate grouping of effects in the same location. Results The method identified 1970731 (35477 unique) possible ADEs in a large corpus of 6011 psychiatric hospital patient records. Validation was performed through manual inspection of possible ADEs, resulting in precision of 89% and recall of 75%. Discussion The presented dictionary-building method could be used to construct other ADE dictionaries. The complication of compound words in Germanic languages was addressed. Additionally, the synonym and anatomical location collapse improve the method. Conclusions The developed dictionary and method can be used to identify possible ADEs in Danish clinical narratives.
引用
收藏
页码:947 / 953
页数:7
相关论文
共 28 条
  • [21] Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications
    Savova, Guergana K.
    Masanz, James J.
    Ogren, Philip V.
    Zheng, Jiaping
    Sohn, Sunghwan
    Kipper-Schuler, Karin C.
    Chute, Christopher G.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (05) : 507 - 513
  • [22] Skeppstedt M, 2012, LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1250
  • [23] Drug side effect extraction from clinical narratives of psychiatry and psychology patients
    Sohn, Sunghwan
    Kocher, Jean-Pierre A.
    Chute, Christopher G.
    Savova, Guergana K.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 : I144 - I149
  • [24] Strom BL, 2012, Pharmacoepidemiology, V5th, DOI [10.1002/9781119959946.ch4, DOI 10.1002/9781119959946.CH4]
  • [25] Selecting information in electronic health records for knowledge acquisition
    Wang, Xiaoyan
    Chase, Herbert
    Markatou, Marianthi
    Hripcsak, George
    Friedman, Carol
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2010, 43 (04) : 595 - 601
  • [26] World Health Organization, 2002, The Importance of Pharmacovigilance
  • [27] MedEx: a medication information extraction system for clinical narratives
    Xu, Hua
    Stenner, Shane P.
    Doan, Son
    Johnson, Kevin B.
    Waitman, Lemuel R.
    Denny, Joshua C.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (01) : 19 - 24
  • [28] Extracting principal diagnosis, co-morbidity and smoking status for asthma research: Evaluation of a natural language processing system
    Zeng Q.T.
    Goryachev S.
    Weiss S.
    Sordo M.
    Murphy S.N.
    Lazarus R.
    [J]. BMC Medical Informatics and Decision Making, 6 (1)