Identification of Hypertension in Electronic Health Records Through Computable Phenotype Development and Validation for Use in Public Health Surveillance: Retrospective Study

被引:0
作者
Valvi, Nimish [1 ,2 ]
Mcfarlane, Timothy [3 ]
Allen, Katie S. [1 ,4 ]
Gibson, P. Joseph [5 ]
Dixon, Brian Edward [1 ,4 ,6 ]
机构
[1] Regenstrief Inst Hlth Care, Ctr Biomed Informat, Indianapolis, IN USA
[2] Ball State Univ, Coll Hlth, Dept Nutr & Hlth Sci, Muncie, IN USA
[3] Indiana Family & Social Serv Adm, Indianapolis, IN USA
[4] Indiana Univ, Fairbanks Sch Publ Hlth, Dept Hlth Policy & Management, Indianapolis, IN USA
[5] CDC Fdn, Atlanta, GA USA
[6] Indiana Univ, Fairbanks Sch Publ Hlth, Dept Hlth Policy & Management, 1050 Wishard Blvd RG 5000, Indianapolis, IN 46202 USA
关键词
computable phenotypes; electronic health records; health information exchange; hypertension; population surveillance; public health informatics; UNITED-STATES; CARE; POPULATION; PREVALENCE; MORBIDITY; SERVICES; DISEASES;
D O I
10.2196/46413
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: Electronic health record (EHR) systems are widely used in the United States to document care delivery and outcomes. Health information exchange (HIE) networks, which integrate EHR data from the various health care providers treating patients, are increasingly used to analyze population-level data. Existing methods for population health surveillance of essential hypertension by public health authorities may be complemented using EHR data from HIE networks to characterize disease burden at the community level. Objective: We aimed to derive and validate computable phenotypes (CPs) to estimate hypertension prevalence for population-based surveillance using an HIE network. Methods: Using existing data available from an HIE network, we developed 6 candidate CPs for essential (primary) hypertension in an adult population from a medium-sized Midwestern metropolitan area in the United States. A total of 2 independent clinician reviewers validated the phenotypes through a manual chart review of 150 randomly selected patient records. We assessed the precision of CPs by calculating sensitivity, specificity, positive predictive value (PPV), F1-score, and validity of chart reviews using prevalence-adjusted bias-adjusted kappa. We further used the most balanced CP to estimate the prevalence of hypertension in the population. Results: Among a cohort of 548,232 adults, 6 CPs produced PPVs ranging from 71% (95% CI 64.3%-76.9%) to 95.7% (95% CI 84.9%-98.9%). The F1-score ranged from 0.40 to 0.91. The prevalence-adjusted bias-adjusted kappa revealed a high percentage agreement of 0.88 for hypertension. Similarly, interrater agreement for individual phenotype determination demonstrated substantial agreement (range 0.70-0.88) for all 6 phenotypes examined. A phenotype based solely on diagnostic codes possessed reasonable performance (F1-score=0.63; PPV=95.1%) but was imbalanced with low sensitivity (47.6%). The most balanced phenotype (F1-score=0.91; PPV=83.5%) included diagnosis, blood pressure measurements, and medications and identified 210,764 (38.4%) individuals with hypertension during the study period (2014-2015). Conclusions: We identified several high-performing phenotypes to identify essential hypertension prevalence for local public health surveillance using EHR data. Given the increasing availability of EHR systems in the United States and other nations, leveraging EHR data has the potential to enhance surveillance of chronic disease in health systems and communities. Yet given variability in performance, public health authorities will need to decide whether to seek optimal balance or declare a preference for algorithms that lean toward sensitivity or specificity to estimate population prevalence of disease.
引用
收藏
页数:11
相关论文
共 48 条
  • [1] Adler-Milstein J, 2022, Health Information Exchange: Navigating and Managing a Network of Health Information Systems, V2nd
  • [2] Morbidity and drug consumption. Comparison of results between the National Health Survey and electronic medical records
    Aguilar-Palacio, Isabel
    Carrera-Lasfuentes, Patricia
    Poblador-Plou, Beatriz
    Prados-Torres, Alexandra
    Jose Rabanaque-Hernandez, M.
    [J]. GACETA SANITARIA, 2014, 28 (01) : 41 - 47
  • [3] Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study
    Anderson, Ariana E.
    Kerr, Wesley T.
    Thames, April
    Li, Tong
    Xiao, Jiayang
    Cohen, Mark S.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2016, 60 : 162 - 168
  • [4] [Anonymous], 2022, Population Estimates (Provisional), prepared by Hedderson Demographic Services for Los Angeles County Internal Services Department
  • [5] [Anonymous], 2019, National Drug Code Directory
  • [6] [Anonymous], 2019, Multiple Cause of Death (1999-2010), as Compiled from Data Provided by the 57 Vital Statistics Jurisdictions through the Vital Statistics Cooperative Program
  • [7] [Anonymous], 2022, User's guide to the trusted exchange framework and common agreement-TEFCA
  • [8] Underdiagnosis of Hypertension Using Electronic Health Records
    Banerjee, Dipanjan
    Chung, Sukyung
    Wong, Eric C.
    Wang, Elsie J.
    Stafford, Randall S.
    Palaniappan, Latha P.
    [J]. AMERICAN JOURNAL OF HYPERTENSION, 2012, 25 (01) : 97 - 102
  • [9] Measuring morbidity: self-report or health care records?
    Barber, Julie
    Muller, Sara
    Whitehurst, Tracy
    Hay, Elaine
    [J]. FAMILY PRACTICE, 2010, 27 (01) : 25 - 30
  • [10] Berry DJ, 1997, CLIN ORTHOP RELAT R, P61