Research data warehouse: using electronic health records to conduct population-based observational studies

被引:11
作者
Chen, Wansu [1 ]
Xie, Fagen [1 ]
Mccarthy, Don P. [1 ]
Reynolds, Kristi L. [1 ,2 ]
Lee, Mingsum [3 ]
Coleman, Karen J. [1 ,2 ]
Getahun, Darios [1 ,2 ]
Koebnick, Corinna [1 ,2 ]
Jacobsen, Steve J. [1 ]
机构
[1] Kaiser Permanente Southern Calif, Dept Res & Evaluat, 100 S Los Robles,2nd Floor, Pasadena, CA 91101 USA
[2] Kaiser Permanente Bernard J Tyson Sch Med, Dept Hlth Syst Sci, Pasadena, CA USA
[3] Southern Calif Permanente Med Grp, Los Angeles Med Ctr, Dept Cardiol, Los Angeles, CA USA
关键词
electronic health record; research data warehouse; health care utilization; data quality; integration; standardization; chronic disease prevalence; DATA QUALITY ASSESSMENT; VACCINE SAFETY; INFRASTRUCTURE; SYSTEM; MODEL;
D O I
10.1093/jamiaopen/ooad039
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Lay Summary Administrative data collected by healthcare organizations at the time of enrollment and during patient care are not always readily available for research. The same data type (eg, hospital admission) may come from multiple data sources in various formats and with inconsistent values, and the change of source data systems over time may leave the data fragmented. In this paper, we described the contents, development, maintenance methodology, and other aspects of a research data warehouse within a large integrated healthcare system, Kaiser Permanente Southern California. We also demonstrated the application of the data in the RDW and the volume of data that can be used for various population-based research projects. With a volume of 105 million person-years of health plan enrollment in 1981-2018 (30 million for Hispanic and 10 million for African American and 7 million for Asian patients), about 19 million clinic/emergency room visits, and more than 200k hospital admissions per year, the research data warehouse offers the opportunity to conduct high-quality population-based research studies. Background Electronic health records and many legacy systems contain rich longitudinal data that can be used for research; however, they typically are not readily available. Materials and methods At Kaiser Permanente Southern California (KPSC), a research data warehouse (RDW) has been developed and maintained since the late 1990s and widely extended in 2006, aggregating and standardizing data collected from internal and a few external sources. This article provides a high-level overview of the RDW and discusses challenges common to data warehouses or repositories for research use. To demonstrate the application of the data, we report the volume, patient characteristics, and age-adjusted prevalence of selected medical conditions and utilization rates of selected medical procedures. Results A total of 105 million person-years of health plan enrollment was recorded in the RDW between 1981 and 2018, with most healthcare utilization data available since early or middle 1990s. Among active enrollees on December 31, 2018, 15% were >= 65 years of age, 33.9% were non-Hispanic white, 43.3% Hispanic, 11.0% Asian, and 8.4% African American, and 34.4% of children (2-17 years old) and 72.1% of adults (>= 18 years old) were overweight or obese. The age-adjusted prevalence of asthma, atrial fibrillation, diabetes mellitus, hypercholesteremia, and hypertension increased between 2001 and 2018. Hospitalization and Emergency Department (ED) visit rates appeared lower, and office visit rates seemed higher at KPSC compared to the reported US averages. Discussion and conclusion Although the RDW is unique to KPSC, its methodologies and experience may provide useful insights for researchers of other healthcare systems worldwide in the era of big data analysis.
引用
收藏
页数:12
相关论文
共 58 条
  • [11] The Enterprise Data Trust at Mayo Clinic: a semantically integrated warehouse of biomedical data
    Chute, Christopher G.
    Beck, Scott A.
    Fisk, Thomas B.
    Mohr, David N.
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2010, 17 (02) : 131 - 135
  • [12] Electronic health records: new opportunities for clinical research
    Coorevits, P.
    Sundgren, M.
    Klein, G. O.
    Bahr, A.
    Claerhout, B.
    Daniel, C.
    Dugas, M.
    Dupont, D.
    Schmidt, A.
    Singleton, P.
    De Moor, G.
    Kalra, D.
    [J]. JOURNAL OF INTERNAL MEDICINE, 2013, 274 (06) : 547 - 560
  • [13] Secondary use of clinical data: The Vanderbilt approach
    Danciu, Ioana
    Cowan, James D.
    Basford, Melissa
    Wang, Xiaoming
    Saip, Alexander
    Osgood, Susan
    Shirey-Rice, Jana
    Kirby, Jacqueline
    Harris, Paul A.
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2014, 52 : 28 - 35
  • [14] Using electronic health records for clinical research: The case of the EHR4CR project
    De Moor, Georges
    Sundgren, Mats
    Kalra, Dipak
    Schmidt, Andreas
    Dugas, Martin
    Claerhout, Brecht
    Karakoyun, Toeresin
    Ohmann, Christian
    Lastic, Pierre-Yves
    Ammour, Nadir
    Kush, Rebecca
    Dupont, Danielle
    Cuggia, Marc
    Daniel, Christel
    Thienpont, Geert
    Coorevits, Pascal
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 53 : 162 - 173
  • [15] Use of Electronic Medical Records for Health Outcomes Research A Literature Review
    Dean, Bonnie B.
    Lam, Jessica
    Natoli, Jaime L.
    Butler, Qiana
    Aguilar, Daniel
    Nordyke, Robert J.
    [J]. MEDICAL CARE RESEARCH AND REVIEW, 2009, 66 (06) : 611 - 638
  • [16] Race and Ethnicity Data Quality and Imputation Using US Census Data in an Integrated Health System: The Kaiser Permanente Southern California Experience
    Derose, Stephen F.
    Contreras, Richard
    Coleman, Karen J.
    Koebnick, Corinna
    Jacobsen, Steven J.
    [J]. MEDICAL CARE RESEARCH AND REVIEW, 2013, 70 (03) : 330 - 345
  • [17] Dolin RH, 2004, ST HEAL T, V107, P346
  • [18] Dziadkowiec Oliwier, 2016, EGEMS (Wash DC), V4, P1201, DOI 10.13063/2327-9214.1201
  • [19] Data Quality in Electronic Health Records Research: Quality Domains and Assessment Methods
    Feder, Shelli L.
    [J]. WESTERN JOURNAL OF NURSING RESEARCH, 2018, 40 (05) : 753 - 766
  • [20] Ferver K., 2009, Open Public Health Journal, V2, P11, DOI 10.2174/1874944500902010011