Design patterns for the development of electronic health record-driven phenotype extraction algorithms

被引:42
作者
Rasmussen, Luke V. [1 ]
Thompson, Will K. [1 ,8 ]
Pacheco, Jennifer A. [1 ]
Kho, Abel N. [1 ]
Carrell, David S. [2 ]
Pathak, Jyotishman [3 ]
Peissig, Peggy L. [4 ]
Tromp, Gerard [5 ]
Denny, Joshua C. [6 ,7 ]
Starren, Justin B. [1 ]
机构
[1] Northwestern Univ, Feinberg Sch Med, Chicago, IL 60611 USA
[2] Grp Hlth Res Inst, Seattle, WA USA
[3] Mayo Clin, Dept Hlth Sci Res, Rochester, MN USA
[4] Marshfield Clin Res Fdn, Marshfield, WI USA
[5] Geisinger Hlth Syst, Sigfried & Janet Weis Ctr Res, Danville, PA USA
[6] Vanderbilt Univ, Dept Biomed Informat, Nashville, TN 37235 USA
[7] Vanderbilt Univ, Dept Med, Nashville, TN 37235 USA
[8] NorthShore Univ HealthSystem, Ctr Biomed Res Informat, Evanston, IL USA
关键词
Electronic health record; Phenotype; Algorithms; Software design; Design patterns; MEDICAL-RECORDS; ADMINISTRATIVE DATA; EMERGE NETWORK; GENOME; CARE; INFRASTRUCTURE; SURVEILLANCE; MANAGEMENT; DISCOVERY; VARIANTS;
D O I
10.1016/j.jbi.2014.06.007
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Background: Design patterns, in the context of software development and ontologies, provide generalized approaches and guidance to solving commonly occurring problems, or addressing common situations typically informed by intuition, heuristics and experience. While the biomedical literature contains broad coverage of specific phenotype algorithm implementations, no work to date has attempted to generalize common approaches into design patterns, which may then be distributed to the informatics community to efficiently develop more accurate phenotype algorithms. Methods: Using phenotyping algorithms stored in the Phenotype KnowledgeBase (PheKB), we conducted an independent iterative review to identify recurrent elements within the algorithm definitions. We extracted and generalized recurrent elements in these algorithms into candidate patterns. The authors then assessed the candidate patterns for validity by group consensus, and annotated them with attributes. Results: A total of 24 electronic Medical Records and Genomics (eMERGE) phenotypes available in PheKB as of 1/25/2013 were downloaded and reviewed. From these, a total of 21 phenotyping patterns were identified, which are available as an online data supplement. Conclusions: Repeatable patterns within phenotyping algorithms exist, and when codified and cataloged may help to educate both experienced and novice algorithm developers. The dissemination and application of these patterns has the potential to decrease the time to develop algorithms, while improving portability and accuracy. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:280 / 286
页数:7
相关论文
共 42 条
  • [1] Alexander C., 1977, A Pattern Language Towns, Buildings, Construction
  • [2] Comparison of Electronic Laboratory Reports, Administrative Claims, and Electronic Health Record Data for Acute Viral Hepatitis Surveillance
    Allen-Dicker, Joshua
    Klompas, Michael
    [J]. JOURNAL OF PUBLIC HEALTH MANAGEMENT AND PRACTICE, 2012, 18 (03) : 209 - 214
  • [3] Carroll Robert J, 2011, AMIA Annu Symp Proc, V2011, P189
  • [4] Conway Mike, 2011, AMIA Annu Symp Proc, V2011, P274
  • [5] Genetic variants associated with the white blood cell count in 13,923 subjects in the eMERGE Network
    Crosslin, David R.
    McDavid, Andrew
    Weston, Noah
    Nelson, Sarah C.
    Zheng, Xiuwen
    Hart, Eugene
    de Andrade, Mariza
    Kullo, Iftikhar J.
    McCarty, Catherine A.
    Doheny, Kimberly F.
    Pugh, Elizabeth
    Kho, Abel
    Hayes, M. Geoffrey
    Pretel, Stephanie
    Saip, Alexander
    Ritchie, Marylyn D.
    Crawford, Dana C.
    Crane, Paul K.
    Newton, Katherine
    Li, Rongling
    Mirel, Daniel B.
    Crenshaw, Andrew
    Larson, Eric B.
    Carlson, Chris S.
    Jarvik, Gail P.
    [J]. HUMAN GENETICS, 2012, 131 (04) : 639 - 652
  • [6] Chapter 13: Mining Electronic Health Records in the Genomics Era
    Denny, Joshua C.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (12)
  • [7] Variants Near FOXE1 Are Associated with Hypothyroidism and Other Thyroid Conditions: Using Electronic Medical Records for Genome- and Phenome-wide Studies
    Denny, Joshua C.
    Crawford, Dana C.
    Ritchie, Marylyn D.
    Bielinski, Suzette J.
    Basford, Melissa A.
    Bradford, Yuki
    Chai, High Seng
    Bastarache, Lisa
    Zuvich, Rebecca
    Peissig, Peggy
    Carrell, David
    Ramirez, Andrea H.
    Pathak, Jyotishman
    Wilke, Russell A.
    Rasmussen, Luke
    Wang, Xiaoming
    Pacheco, Jennifer A.
    Kho, Abel N.
    Hayes, M. Geoffrey
    Weston, Noah
    Matsumoto, Martha
    Kopp, Peter A.
    Newton, Katherine M.
    Jarvik, Gail P.
    Li, Rongling
    Manolio, Teri A.
    Kullo, Iftikhar J.
    Chute, Christopher G.
    Chisholm, Rex L.
    Larson, Eric B.
    McCarty, Catherine A.
    Masys, Daniel R.
    Roden, Dan M.
    de Andrade, Mariza
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2011, 89 (04) : 529 - 542
  • [8] Identification of Genomic Predictors of Atrioventricular Conduction Using Electronic Medical Records as a Tool for Genome Science
    Denny, Joshua C.
    Ritchie, Marylyn D.
    Crawford, Dana C.
    Schildcrout, Jonathan S.
    Ramirez, Andrea H.
    Pulley, Jill M.
    Basford, Melissa A.
    Masys, Daniel R.
    Haines, Jonathan L.
    Roden, Dan M.
    [J]. CIRCULATION, 2010, 122 (20) : 2016 - 2021
  • [9] Implementing Automated Surveillance for Tracking Clostridium difficile Infection at Multiple Healthcare Facilities
    Dubberke, Erik R., Jr.
    Nyazee, Humaa A.
    Yokoe, Deborah S.
    Mayer, Jeanmarie
    Stevenson, Kurt B.
    Mangino, Julie E.
    Khan, Yosef M.
    Fraser, Victoria J.
    [J]. INFECTION CONTROL AND HOSPITAL EPIDEMIOLOGY, 2012, 33 (03) : 305 - 308
  • [10] Gamma E., 1995, DESIGN PATTERNS ELEM