An algorithm-based approach to ascertain patients with rare diseases in electronic health records using hypereosinophilic syndrome as an example

被引:0
作者
Requena, Gema [1 ,4 ]
Joksaite, Sandra [2 ,5 ]
Galwey, Nicholas [3 ]
Jakes, Rupert W. W. [1 ]
机构
[1] GSK, Value Evidence & Outcomes, Global Res & Dev, Epidemiol, Brentford, Middx, England
[2] GSK, Real World Analyt, Value Evidence & Outcomes, Global Res & Dev, Brentford, Middx, England
[3] GSK, Real World Analyt, Epidemiol, Stevenage, England
[4] GSK House, Epidemiol, Value Evidence & Outcomes, Global Res & Dev, 980 Great West Rd, Brentford TW8 9GS, Middx, England
[5] GSK House, Value Evidence & Outcomes, Global Res & Dev, Great West Rd, Brentford TW8 9GS, Middx, England
基金
芬兰科学院;
关键词
algorithm; diagnosis; electronic health record; hypereosinophilic syndrome; rare disease;
D O I
10.1002/pds.5655
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose: Improved hypereosinophilic syndrome (HES) ascertainment in electronic health record (EHR) databases may improve disease understanding and management. An algorithm to ascertain and characterize this rare condition was therefore developed and validated. Methods: Using the UK clinical practice research datalink (CPRD)-Aurum database linked to the hospital episode statistics database (Admitted Patient Care data) from Jan 2012 to June 2019, this cross-sectional study ascertained patients with a specific HES code (index). Patients with HES were matched (age, sex and index date) 1:29 with a non-HES cohort. An algorithm was developed by identifying pre-defined variables differing between cohorts; model-fitting using Firth logistic regression and statistical determination of the top-five performing models; and internal validation using Leave-One-Out Cross Validation. Final model sensitivity and specificity were determined at an 80% probability threshold. Results: The HES and non-HES cohorts included 88 and 2552 patients, respectively; 270 models with four variables each (treatment used for HES, asthma code, white blood cell condition code, and blood eosinophil count [BEC] code) plus age and sex variables were tested. Of the top five models, the sensitivity model performed best (sensitivity, 69% [95% CI: 59%, 79%]; specificity, >99%). The strongest predictors of HES versus non-HES cases (odds >1000 times greater) were an ICD-10 code for white blood cell disorders and a BEC >= 1500 cells/mu L in the 24 months pre-index. Conclusions: Using a combination of medical codes, prescribed treatments data and laboratory results, the algorithm can help ascertain patients with HES from EHR databases; this approach may be useful for other rare diseases.
引用
收藏
页码:1261 / 1270
页数:10
相关论文
共 22 条
[1]   Mechanisms of eosinophilia in the pathogenesis of hypereosinophilic disorders [J].
Ackerman, Steven J. ;
Bochner, Bruce S. .
IMMUNOLOGY AND ALLERGY CLINICS OF NORTH AMERICA, 2007, 27 (03) :357-+
[2]  
[Anonymous], 2022 ICD 10 CM DIAGN
[3]   The use of SNOMED CT, 2013-2020: a literature review [J].
Chang, Eunsuk ;
Mostafa, Javed .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (09) :2017-2026
[4]   Incidence of myeloproliferative hypereosinophilic syndrome in the United States and an estimate of all hypereosinophilic syndrome incidence [J].
Crane, Martin M. ;
Chang, Cindy Ma ;
Kobayashi, Monica G. ;
Weller, Peter F. .
JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2010, 126 (01) :179-181
[5]   Hypereosinophilic Syndrome [J].
Curtis, Casey ;
Ogbogu, Princess .
CLINICAL REVIEWS IN ALLERGY & IMMUNOLOGY, 2016, 50 (02) :240-251
[6]   Necessary Condition Analysis (NCA): Logic and Methodology of "Necessary but Not Sufficient" Causality [J].
Dul, Jan .
ORGANIZATIONAL RESEARCH METHODS, 2016, 19 (01) :10-52
[7]   Translating and evaluating historic phenotyping algorithms using SNOMED CT [J].
Elkheder, Musaab ;
Gonzalez-Izquierdo, Arturo ;
Ul Arfeen, Muhammad Qummer ;
Kuan, Valerie ;
Lumbers, R. Thomas ;
Denaxas, Spiros ;
Shah, Anoop D. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2023, 30 (02) :222-232
[8]   A generalized Hosmer-Lemeshow goodness-of-fit test for multinomial logistic regression models [J].
Fagerland, Morten W. ;
Hosmer, David W. .
STATA JOURNAL, 2012, 12 (03) :447-453
[9]  
globalgenes, 2013, RARE DIS IMPACT REPO
[10]   (A Critical Appraisal of) Classification of Hypereosinophilic Disorders [J].
Kahn, Jean Emmanuel ;
Groh, Matthieu ;
Lefevre, Guillaume .
FRONTIERS IN MEDICINE, 2017, 4