A Computable Phenotype Improves Cohort Ascertainment in a Pediatric Pulmonary Hypertension Registry

被引:25
作者
Geva, Alon [1 ,2 ,3 ]
Gronsbell, Jessica L. [4 ]
Cai, Tianxi [4 ]
Cai, Tianrun [5 ]
Murphy, Shawn N. [6 ,7 ,8 ]
Lyons, Jessica C. [8 ]
Heinz, Michelle M. [1 ]
Natter, Marc D. [1 ,9 ]
Patibandla, Nandan [10 ]
Bickel, Jonathan [9 ,10 ]
Mullen, Mary P. [9 ,11 ]
Mandl, Kenneth D. [1 ,8 ,9 ]
机构
[1] Boston Childrens Hosp, Computat Hlth Informat Program, 300 Longwood Ave,1 Autumn 535,Mail Stop BCH3187, Boston, MA 02115 USA
[2] Boston Childrens Hosp, Dept Anesthesiol Perioperat & Pain Med, Div Crit Care Med, Boston, MA USA
[3] Harvard Med Sch, Dept Anesthesia, Boston, MA USA
[4] Harvard TH Chan Sch Publ Hlth, Dept Biostat, Boston, MA USA
[5] Brigham & Womens Hosp, Div Rheumatol Immunol & Allergy, 75 Francis St, Boston, MA 02115 USA
[6] Partners Healthcare, Dept Res Informat Serv & Comp, Boston, MA 02114 USA
[7] Massachusetts Gen Hosp, Dept Neurol, Boston, MA 02114 USA
[8] Harvard Med Sch, Dept Biomed Informat, Boston, MA USA
[9] Harvard Med Sch, Dept Pediat, Boston, MA USA
[10] Boston Childrens Hosp, Informat Serv Dept, Boston, MA USA
[11] Boston Childrens Hosp, Dept Cardiol, Boston, MA USA
基金
美国国家卫生研究院;
关键词
ELECTRONIC HEALTH RECORDS; MANAGEMENT; DISCOVERY;
D O I
10.1016/j.jpeds.2017.05.037
中图分类号
R72 [儿科学];
学科分类号
100202 ;
摘要
Objectives To compare registry and electronic health record (EHR) data mining approaches for cohort ascertainment in patients with pediatric pulmonary hypertension (PH) in an effort to overcome some of the limitations of registry enrollment alone in identifying patients with particular disease phenotypes. Study design This study was a single-center retrospective analysis of EHR and registry data at Boston Children's Hospital. The local Informatics for Integrating Biology and the Bedside (i2b2) data warehouse was queried for billing codes, prescriptions, and narrative data related to pediatric PH. Computable phenotype algorithms were developed by fitting penalized logistic regression models to a physician-annotated training set. Algorithms were applied to a candidate patient cohort, and performance was evaluated using a separate set of 136 records and 179 registry patients. We compared clinical and demographic characteristics of patients identified by computable phenotype and the registry. Results The computable phenotype had an area under the receiver operating characteristics curve of 90% (95% CI, 85%-95%), a positive predictive value of 85% (95% CI, 77%-93%), and identified 413 patients (an additional 231%) with pediatric PH who were not enrolled in the registry. Patients identified by the computable phenotype were clinically distinct from registry patients, with a greater prevalence of diagnoses related to perinatal distress and left heart disease. Conclusions Mining of EHRs using computable phenotypes identified a large cohort of patients not recruited using a classic registry. Fusion of EHR and registry data can improve cohort ascertainment for the study of rare diseases.
引用
收藏
页码:224 / +
页数:13
相关论文
共 30 条
[1]   Estimation and Testing for Multiple Regulation of Multivariate Mixed Outcomes [J].
Agniel, Denis ;
Liao, Katherine P. ;
Cai, Tianxi .
BIOMETRICS, 2016, 72 (04) :1194-1205
[2]   Clinical features of paediatric pulmonary hypertension: a registry study [J].
Berger, Rolf M. F. ;
Beghetti, Maurice ;
Humpl, Tilman ;
Raskob, Gary E. ;
Ivy, D. Dunbar ;
Jing, Zhi-Cheng ;
Bonnet, Damien ;
Schulze-Neick, Ingram ;
Barst, Robyn J. .
LANCET, 2012, 379 (9815) :537-546
[3]   The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[4]  
Colan SD., 2015, Pediatric and Congenital Cardiac Care: Volume 1: Outcomes Analysis, P163
[5]   Improvements on cross-validation: The .632+ bootstrap method [J].
Efron, B ;
Tibshirani, R .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1997, 92 (438) :548-560
[7]   The Biobank Portal for Partners Personalized Medicine: A Query Tool for Working with Consented Biobank Samples, Genotypes, and Phenotypes Using i2b2 [J].
Gainer, Vivian S. ;
Cagan, Andrew ;
Castro, Victor M. ;
Duey, Stacey ;
Ghosh, Bhaswati ;
Goodson, Alyssa P. ;
Goryachev, Sergey ;
Metta, Reeta ;
Wang, Taowei David ;
Wattanasin, Nich ;
Murphy, Shawn N. .
JOURNAL OF PERSONALIZED MEDICINE, 2016, 6 (01)
[8]  
Gliklich R., 2010, Agency for Healthcare Research and Quality, V2nd
[9]   PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability [J].
Kirby, Jacqueline C. ;
Speltz, Peter ;
Rasmussen, Luke V. ;
Basford, Melissa ;
Gottesman, Omri ;
Peissig, Peggy L. ;
Pacheco, Jennifer A. ;
Tromp, Gerard ;
Pathak, Jyotishman ;
Carrell, David S. ;
Ellis, Stephen B. ;
Lingren, Todd ;
Thompson, Will K. ;
Savova, Guergana ;
Haines, Jonathan ;
Roden, Dan M. ;
Harris, Paul A. ;
Denny, Joshua C. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (06) :1046-1052
[10]   Data interchange using i2b2 [J].
Klann, Jeffrey G. ;
Abend, Aaron ;
Raghavan, Vijay A. ;
Mandl, Kenneth D. ;
Murphy, Shawn N. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2016, 23 (05) :909-915