PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability

被引:225
作者
Kirby, Jacqueline C. [1 ]
Speltz, Peter [1 ]
Rasmussen, Luke V. [4 ]
Basford, Melissa [1 ]
Gottesman, Omri [2 ]
Peissig, Peggy L. [3 ]
Pacheco, Jennifer A. [4 ]
Tromp, Gerard [5 ]
Pathak, Jyotishman [6 ]
Carrell, David S. [7 ]
Ellis, Stephen B. [2 ]
Lingren, Todd [8 ]
Thompson, Will K.
Savova, Guergana [9 ,10 ]
Haines, Jonathan [11 ]
Roden, Dan M. [1 ]
Harris, Paul A.
Denny, Joshua C. [1 ]
机构
[1] Vanderbilt Univ, Med Ctr, Nashville, TN 37235 USA
[2] Icahn Sch Med Mt Sinai, New York, NY USA
[3] Marshfield Clin Res Fdn, Marshfield, WI USA
[4] Northwestern Univ, Feinberg Sch Med, Chicago, IL USA
[5] Geisinger Hlth Syst, Danville, PA USA
[6] Mayo Clin, Rochester, MN USA
[7] Grp Hlth Res Inst, Seattle, WA USA
[8] Cincinnati Childrens Hosp Med Ctr, Cincinnati, OH USA
[9] Boston Childrens Hosp, Boston, MA USA
[10] Harvard Med Sch, Boston, MA USA
[11] Case Western Univ, Cleveland, OH USA
关键词
electronic health records; electronic phenotyping; natural language processing; genomic research; clinical research; HEALTH RECORDS; MEDICAL-RECORDS; EMERGE NETWORK; PHENOME-WIDE; VALIDATION; SYSTEMS; CARE; DATABASE; SURVEILLANCE; INFECTION;
D O I
10.1093/jamia/ocv202
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective Health care generated data have become an important source for clinical and genomic research. Often, investigators create and iteratively refine phenotype algorithms to achieve high positive predictive values (PPVs) or sensitivity, thereby identifying valid cases and controls. These algorithms achieve the greatest utility when validated and shared by multiple health care systems. Materials and Methods We report the current status and impact of the Phenotype KnowledgeBase (PheKB, http://phekb.org), an online environment supporting the workflow of building, sharing, and validating electronic phenotype algorithms. We analyze the most frequent components used in algorithms and their performance at authoring institutions and secondary implementation sites. Results As of June 2015, PheKB contained 30 finalized phenotype algorithms and 62 algorithms in development spanning a range of traits and diseases. Phenotypes have had over 3500 unique views in a 6-month period and have been reused by other institutions. International Classification of Disease codes were the most frequently used component, followed by medications and natural language processing. Among algorithms with published performance data, the median PPV was nearly identical when evaluated at the authoring institutions (n = 44; case 96.0%, control 100%) compared to implementation sites (n = 40; case 97.5%, control 100%). Discussion These results demonstrate that a broad range of algorithms to mine electronic health record data from different health systems can be developed with high PPV, and algorithms developed at one site are generally transportable to others. Conclusion By providing a central repository, PheKB enables improved development, transportability, and validity of algorithms for research-grade phenotypes using health care generated data.
引用
收藏
页码:1046 / 1052
页数:7
相关论文
共 48 条
[1]   A comparative effectiveness trial of postoperative management for lumbar spine surgery: changing behavior through physical therapy (CBPT) study protocol [J].
Archer, Kristin R. ;
Coronado, Rogelio A. ;
Haug, Christine M. ;
Vanston, Susan W. ;
Devin, Clinton J. ;
Fonnesbeck, Christopher J. ;
Aaronson, Oran S. ;
Cheng, Joseph S. ;
Skolasky, Richard L. ;
Riley, Lee H., III ;
Wegener, Stephen T. .
BMC MUSCULOSKELETAL DISORDERS, 2014, 15
[2]   Development and validation of a classification approach for extracting severity automatically from electronic health records [J].
Boland, Mary Regina ;
Tatonetti, Nicholas P. ;
Hripcsak, George .
JOURNAL OF BIOMEDICAL SEMANTICS, 2015, 6
[3]  
Borthwick KM, 2015, INT J BIOMED DATA MI, V4, P113
[4]   Portability of an algorithm to identify rheumatoid arthritis in electronic health records [J].
Carroll, Robert J. ;
Thompson, Will K. ;
Eyler, Anne E. ;
Mandelin, Arthur M. ;
Cai, Tianxi ;
Zink, Raquel M. ;
Pacheco, Jennifer A. ;
Boomershine, Chad S. ;
Lasko, Thomas A. ;
Xu, Hua ;
Karlson, Elizabeth W. ;
Perez, Raul G. ;
Gainer, Vivian S. ;
Murphy, Shawn N. ;
Ruderman, Eric M. ;
Pope, Richard M. ;
Plenge, Robert M. ;
Kho, Abel Ngo ;
Liao, Katherine P. ;
Denny, Joshua C. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (E1) :E162-E169
[5]  
Chute Christopher G, 2011, AMIA Annu Symp Proc, V2011, P248
[6]  
Conway Mike, 2011, AMIA Annu Symp Proc, V2011, P274
[7]   Chapter 13: Mining Electronic Health Records in the Genomics Era [J].
Denny, Joshua C. .
PLOS COMPUTATIONAL BIOLOGY, 2012, 8 (12)
[8]   Variants Near FOXE1 Are Associated with Hypothyroidism and Other Thyroid Conditions: Using Electronic Medical Records for Genome- and Phenome-wide Studies [J].
Denny, Joshua C. ;
Crawford, Dana C. ;
Ritchie, Marylyn D. ;
Bielinski, Suzette J. ;
Basford, Melissa A. ;
Bradford, Yuki ;
Chai, High Seng ;
Bastarache, Lisa ;
Zuvich, Rebecca ;
Peissig, Peggy ;
Carrell, David ;
Ramirez, Andrea H. ;
Pathak, Jyotishman ;
Wilke, Russell A. ;
Rasmussen, Luke ;
Wang, Xiaoming ;
Pacheco, Jennifer A. ;
Kho, Abel N. ;
Hayes, M. Geoffrey ;
Weston, Noah ;
Matsumoto, Martha ;
Kopp, Peter A. ;
Newton, Katherine M. ;
Jarvik, Gail P. ;
Li, Rongling ;
Manolio, Teri A. ;
Kullo, Iftikhar J. ;
Chute, Christopher G. ;
Chisholm, Rex L. ;
Larson, Eric B. ;
McCarty, Catherine A. ;
Masys, Daniel R. ;
Roden, Dan M. ;
de Andrade, Mariza .
AMERICAN JOURNAL OF HUMAN GENETICS, 2011, 89 (04) :529-542
[9]   PheWAS: demonstrating the feasibility of a phenome-wide scan to discover gene-disease associations [J].
Denny, Joshua C. ;
Ritchie, Marylyn D. ;
Basford, Melissa A. ;
Pulley, Jill M. ;
Bastarache, Lisa ;
Brown-Gentry, Kristin ;
Wang, Deede ;
Masys, Dan R. ;
Roden, Dan M. ;
Crawford, Dana C. .
BIOINFORMATICS, 2010, 26 (09) :1205-1210
[10]   Implementing Automated Surveillance for Tracking Clostridium difficile Infection at Multiple Healthcare Facilities [J].
Dubberke, Erik R., Jr. ;
Nyazee, Humaa A. ;
Yokoe, Deborah S. ;
Mayer, Jeanmarie ;
Stevenson, Kurt B. ;
Mangino, Julie E. ;
Khan, Yosef M. ;
Fraser, Victoria J. .
INFECTION CONTROL AND HOSPITAL EPIDEMIOLOGY, 2012, 33 (03) :305-308