Transforming Estonian health data to the Observational Medical Outcomes Partnership (OMOP) Common Data Model: lessons learned

被引:9
|
作者
Oja, Marek [1 ,3 ]
Tamm, Sirli [1 ]
Mooses, Kerli [1 ]
Pajusalu, Maarja [1 ]
Talvik, Harry-Anton [1 ,2 ]
Ott, Anne [1 ]
Laht, Marianna [1 ]
Malk, Maria [1 ]
Loo, Marcus [1 ]
Holm, Johannes [1 ]
Haug, Markus [1 ]
Suvalov, Hendrik [1 ]
Saerg, Dage [1 ,2 ]
Vilo, Jaak [1 ,2 ]
Laur, Sven [1 ]
Kolde, Raivo [1 ]
Reisberg, Sulev [1 ,2 ]
机构
[1] Univ Tartu, Inst Comp Sci, Tartu 51009, Estonia
[2] STACC, Tartu 51009, Estonia
[3] Univ Tartu, Inst Comp Sci, Narva mnt 18, Tartu 51009, Estonia
基金
欧盟地平线“2020”;
关键词
OMOP; electronic health record; EHR; ETL; mapping; FEASIBILITY; RECORDS;
D O I
10.1093/jamiaopen/ooad100
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective To describe the reusable transformation process of electronic health records (EHR), claims, and prescriptions data into Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM), together with challenges faced and solutions implemented.Materials and Methods We used Estonian national health databases that store almost all residents' claims, prescriptions, and EHR records. To develop and demonstrate the transformation process of Estonian health data to OMOP CDM, we used a 10% random sample of the Estonian population (n = 150 824 patients) from 2012 to 2019 (MAITT dataset). For the sample, complete information from all 3 databases was converted to OMOP CDM version 5.3. The validation was performed using open-source tools.Results In total, we transformed over 100 million entries to standard concepts using standard OMOP vocabularies with the average mapping rate 95%. For conditions, observations, drugs, and measurements, the mapping rate was over 90%. In most cases, SNOMED Clinical Terms were used as the target vocabulary.Discussion During the transformation process, we encountered several challenges, which are described in detail with concrete examples and solutions.Conclusion For a representative 10% random sample, we successfully transferred complete records from 3 national health databases to OMOP CDM and created a reusable transformation process. Our work helps future researchers to transform linked databases into OMOP CDM more efficiently, ultimately leading to better real-world evidence. Health data can be found in various sources and formats, making it challenging for researchers. To address this issue, one possible approach is to transform the data into a standardized common data model (CDM). In this study, we describe the process of converting electronic health records (EHR), claims, and prescriptions data into the Observational Medical Outcome Partnership (OMOP) CDM, along with the challenges faced and solutions implemented. We used Estonian national health databases containing information on claims, prescriptions, and EHR records of 10% of Estonian residents (MAITT dataset). The study describes how data were mapped to standardized vocabulary and successfully converted to the OMOP CDM. We discuss the encountered difficulties and problems and propose solutions to help future researchers transform linked databases into OMOP CDM more efficiently, leading to better real-world evidence.
引用
收藏
页数:10
相关论文
共 47 条
  • [21] Transforming and evaluating the UK Biobank to the OMOP Common Data Model for COVID-19 research and beyond
    Papez, Vaclav
    Moinat, Maxim
    Voss, Erica A.
    Bazakou, Sofia
    Van Winzum, Anne
    Peviani, Alessia
    Payralbe, Stefan
    Kallfelz, Michael
    Asselbergs, Folkert W.
    Prieto-Alhambra, Daniel
    Dobson, Richard J. B.
    Denaxas, Spiros
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 30 (01) : 103 - 111
  • [22] Automated Generation of Individual and Population Clinical Pathways with the OMOP Common Data Model
    Boudis, Fabio
    Clement, Guillaume
    Bruandet, Amelie
    Lamer, Antoine
    PUBLIC HEALTH AND INFORMATICS, PROCEEDINGS OF MIE 2021, 2021, 281 : 218 - 222
  • [23] Patient-Level Fall Risk Prediction Using the Observational Medical Outcomes Partnership's Common Data Model: Pilot Feasibility Study
    Jung, Hyesil
    Yoo, Sooyoung
    Kim, Seok
    Heo, Eunjeong
    Kim, Borham
    Lee, Ho-Young
    Hwang, Hee
    JMIR MEDICAL INFORMATICS, 2022, 10 (03)
  • [24] Standardizing Austrians Claims Data Using the OMOP Common Data Model: A Feasibility Study
    Haberson, Andrea
    Rinner, Christoph
    Gall, Walter
    ICT FOR HEALTH SCIENCE RESEARCH, 2019, 258 : 151 - 152
  • [25] Patient Cohort Identification on Time Series Data Using the OMOP Common Data Model
    Maier, Christian
    Kapsner, Lorenz A.
    Mate, Sebastian
    Prokosch, Hans-Ulrich
    Kraus, Stefan
    APPLIED CLINICAL INFORMATICS, 2021, 12 (01): : 57 - 64
  • [26] Transformation and Evaluation of the MIMIC Database in the OMOP Common Data Model: Development and Usability Study
    Paris, Nicolas
    Lamer, Antoine
    Parrot, Adrien
    JMIR MEDICAL INFORMATICS, 2021, 9 (12)
  • [27] A Realism-Based View on Counts in OMOP's Common Data Model
    Ceusters, Werner
    Blaisure, Jonathan
    PHEALTH 2017, 2017, 237 : 55 - 62
  • [28] Expanding the OMOP Common Data Model to Support Perinatal Research in Network Studies
    Abellan, Alicia
    Burn, Edward
    Trinh, Nhung T. H.
    Burkard, Theresa
    Callahan, Alison
    Fernandez-Bertolin, Sergio
    Hurley, Eimir
    Rodriguez, Clara
    Segundo, Elena
    Morales, Daniel R.
    Nordeng, Hedvig M. E.
    Duarte-Salles, Talita
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2025, 34 (02)
  • [29] Data harmonization and federated learning for multi-cohort dementia research using the OMOP common data model: A Netherlands consortium of dementia cohorts case study
    Mateus, Pedro
    Moonen, Justine
    Beran, Magdalena
    Jaarsma, Eva
    van der Landen, Sophie M.
    Heuvelink, Joost
    Birhanu, Mahlet
    Harms, Alexander G. J.
    Bron, Esther
    Wolters, Frank J.
    Cats, Davy
    Mei, Hailiang
    Oomens, Julie
    Jansen, Willemijn
    Schram, Miranda T.
    Dekker, Andre
    Bermejo, Inigo
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 155
  • [30] The OMOP common data model in Australian primary care data: Building a quality research ready harmonised dataset
    Ward, Roger
    Hallinan, Christine Mary
    Ormiston-Smith, David
    Chidgey, Christine
    Boyle, Dougie
    PLOS ONE, 2024, 19 (04):