Drug side effect extraction from clinical narratives of psychiatry and psychology patients

被引:75
作者
Sohn, Sunghwan [1 ]
Kocher, Jean-Pierre A. [1 ]
Chute, Christopher G. [1 ]
Savova, Guergana K. [2 ,3 ]
机构
[1] Mayo Clin, Div Biomed Stat & Informat, Dept Hlth Sci Res, Rochester, MN 55905 USA
[2] Childrens Hosp, Boston, MA 02115 USA
[3] Harvard Univ, Sch Med, Boston, MA 02115 USA
关键词
EVENTS;
D O I
10.1136/amiajnl-2011-000351
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective To extract physician-asserted drug side effects from electronic medical record clinical narratives. Materials and methods Pattern matching rules were manually developed through examining keywords and expression patterns of side effects to discover an individual side effect and causative drug relationship. A combination of machine learning (C4.5) using side effect keyword features and pattern matching rules was used to extract sentences that contain side effect and causative drug pairs, enabling the system to discover most side effect occurrences. Our system was implemented as a module within the clinical Text Analysis and Knowledge Extraction System. Results The system was tested in the domain of psychiatry and psychology. The rule-based system extracting side effects and causative drugs produced an F score of 0.80 (0.55 excluding allergy section). The hybrid system identifying side effect sentences had an F score of 0.75 (0.56 excluding allergy section) but covered more side effect and causative drug pairs than individual side effect extraction. Discussion The rule-based system was able to identify most side effects expressed by clear indication words. More sophisticated semantic processing is required to handle complex side effect descriptions in the narrative. We demonstrated that our system can be trained to identify sentences with complex side effect descriptions that can be submitted to a human expert for further abstraction. Conclusion Our system was able to extract most physician-asserted drug side effects. It can be used in either an automated mode for side effect extraction or semi-automated mode to identify side effect sentences that can significantly simplify abstraction by a human expert.
引用
收藏
页码:I144 / I149
页数:6
相关论文
共 31 条
  • [1] Approximate is better than "exact" for interval estimation of binomial proportions
    Agresti, A
    Coull, BA
    [J]. AMERICAN STATISTICIAN, 1998, 52 (02) : 119 - 126
  • [2] [Anonymous], RXNORM
  • [3] [Anonymous], MeSH
  • [4] [Anonymous], P ICML 2003 WORKSH L
  • [5] [Anonymous], UMLS
  • [6] Detecting adverse events using information technology
    Bates, DW
    Evans, RS
    Murff, H
    Stetson, PD
    Pizziferri, L
    Hripcsak, G
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2003, 10 (02) : 115 - 128
  • [7] Chen Y., 2007, SIGKDD Explorations, V9, P22
  • [8] Preventable adverse drug events in hospitalized patients: A comparative study of intensive care and general care units
    Cullen, DJ
    Sweitzer, BJ
    Bates, DW
    Burdick, E
    Edmondson, A
    Leape, LL
    [J]. CRITICAL CARE MEDICINE, 1997, 25 (08) : 1289 - 1297
  • [9] Evaluation of a generalizable approach to clinical information retrieval using the automated retrieval console (ARC)
    D'Avolio, Leonard W.
    Nguyen, Thien M.
    Farwell, Wildon R.
    Chen, Yongming
    Fitzmeyer, Felicia
    Harris, Owen M.
    Fiore, Louis D.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (04) : 375 - 382
  • [10] What can natural language processing do for clinical decision support?
    Demner-Fushman, Dina
    Chapman, Wendy W.
    McDonald, Clement J.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2009, 42 (05) : 760 - 772