Transforming Estonian health data to the Observational Medical Outcomes Partnership (OMOP) Common Data Model: lessons learned

被引:9
|
作者
Oja, Marek [1 ,3 ]
Tamm, Sirli [1 ]
Mooses, Kerli [1 ]
Pajusalu, Maarja [1 ]
Talvik, Harry-Anton [1 ,2 ]
Ott, Anne [1 ]
Laht, Marianna [1 ]
Malk, Maria [1 ]
Loo, Marcus [1 ]
Holm, Johannes [1 ]
Haug, Markus [1 ]
Suvalov, Hendrik [1 ]
Saerg, Dage [1 ,2 ]
Vilo, Jaak [1 ,2 ]
Laur, Sven [1 ]
Kolde, Raivo [1 ]
Reisberg, Sulev [1 ,2 ]
机构
[1] Univ Tartu, Inst Comp Sci, Tartu 51009, Estonia
[2] STACC, Tartu 51009, Estonia
[3] Univ Tartu, Inst Comp Sci, Narva mnt 18, Tartu 51009, Estonia
基金
欧盟地平线“2020”;
关键词
OMOP; electronic health record; EHR; ETL; mapping; FEASIBILITY; RECORDS;
D O I
10.1093/jamiaopen/ooad100
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective To describe the reusable transformation process of electronic health records (EHR), claims, and prescriptions data into Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM), together with challenges faced and solutions implemented.Materials and Methods We used Estonian national health databases that store almost all residents' claims, prescriptions, and EHR records. To develop and demonstrate the transformation process of Estonian health data to OMOP CDM, we used a 10% random sample of the Estonian population (n = 150 824 patients) from 2012 to 2019 (MAITT dataset). For the sample, complete information from all 3 databases was converted to OMOP CDM version 5.3. The validation was performed using open-source tools.Results In total, we transformed over 100 million entries to standard concepts using standard OMOP vocabularies with the average mapping rate 95%. For conditions, observations, drugs, and measurements, the mapping rate was over 90%. In most cases, SNOMED Clinical Terms were used as the target vocabulary.Discussion During the transformation process, we encountered several challenges, which are described in detail with concrete examples and solutions.Conclusion For a representative 10% random sample, we successfully transferred complete records from 3 national health databases to OMOP CDM and created a reusable transformation process. Our work helps future researchers to transform linked databases into OMOP CDM more efficiently, ultimately leading to better real-world evidence. Health data can be found in various sources and formats, making it challenging for researchers. To address this issue, one possible approach is to transform the data into a standardized common data model (CDM). In this study, we describe the process of converting electronic health records (EHR), claims, and prescriptions data into the Observational Medical Outcome Partnership (OMOP) CDM, along with the challenges faced and solutions implemented. We used Estonian national health databases containing information on claims, prescriptions, and EHR records of 10% of Estonian residents (MAITT dataset). The study describes how data were mapped to standardized vocabulary and successfully converted to the OMOP CDM. We discuss the encountered difficulties and problems and propose solutions to help future researchers transform linked databases into OMOP CDM more efficiently, leading to better real-world evidence.
引用
收藏
页数:10
相关论文
共 47 条
  • [31] Can We Rely on Results From IQVIA Medical Research Data UK Converted to the Observational Medical Outcome Partnership Common Data Model? A Validation Study Based on Prescribing Codeine in Children
    Candore, Gianmario
    Hedenmalm, Karin
    Slattery, Jim
    Cave, Alison
    Kurz, Xavier
    Arlett, Peter
    CLINICAL PHARMACOLOGY & THERAPEUTICS, 2020, 107 (04) : 915 - 925
  • [32] Optimization of Electronic Medical Records for Data Mining Using a Common Data Model
    Kwong, Manlik
    Gardner, Heather L.
    Dieterle, Neil
    Rentko, Virginia
    TOPICS IN COMPANION ANIMAL MEDICINE, 2019, 37
  • [33] Harmonizing population health data into OMOP common data model: a demonstration using COVID-19 sero-surveillance data from Nairobi Urban Health and Demographic Surveillance System
    Ochola, Michael
    Kiwuwa-Muyingo, Sylvia
    Bhattacharjee, Tathagata
    Amadi, David
    Ng'etich, Maureen
    Kadengye, Damazo
    Owoko, Henry
    Igumba, Boniface
    Greenfield, Jay
    Todd, Jim
    Kiragga, Agnes
    INSPIRE Network
    FRONTIERS IN DIGITAL HEALTH, 2025, 7
  • [34] Transforming the Information System for Research in Primary Care (SIDIAP) in Catalonia to the OMOP Common Data Model and Its Use for COVID-19 Research
    Raventos, Berta
    Fernandez-Bertolin, Sergio
    Aragon, Maria
    Voss, Erica A.
    Blacketer, Clair
    Mendez-Boo, Leonardo
    Recalde, Martina
    Roel, Elena
    Pistillo, Andrea
    Reyes, Carlen
    van Sandijk, Sebastiaan
    Halvorsen, Lars
    Rijnbeek, Peter R.
    Burn, Edward
    Duarte-Salles, Talita
    CLINICAL EPIDEMIOLOGY, 2023, 15 : 969 - 986
  • [35] Data Quality- and Utility-Compliant Anonymization of Common Data Model-Harmonized Electronic Health Record Data: Protocol for a Scoping Review
    Wabo, Gaetan Kamdje
    Prasser, Fabian
    Gierend, Kerstin
    Siegel, Fabian
    Ganslandt, Thomas
    JMIR RESEARCH PROTOCOLS, 2023, 12
  • [36] Coronary Artery Computed Tomography Angiography for Preventing Cardio-Cerebrovascular Disease: Observational Cohort Study Using the Observational Health Data Sciences and Informatics' Common Data Model
    Bae, Woo Kyung
    Cho, Jihoon
    Kim, Seok
    Kim, Borham
    Baek, Hyunyoung
    Song, Wongeun
    Yoo, Sooyoung
    JMIR MEDICAL INFORMATICS, 2022, 10 (10)
  • [37] IncidencePrevalence: An R package to calculate population-level incidence rates and prevalence using the OMOP common data model
    Raventos, Berta
    Catala, Marti
    Du, Mike
    Guo, Yuchen
    Black, Adam
    Inberg, Ger
    Li, Xintong
    Lopez-Guell, Kim
    Newby, Danielle
    de Ridder, Maria
    Barboza, Cesar
    Duarte-Salles, Talita
    Verhamme, Katia
    Rijnbeek, Peter
    Alhambra, Daniel Prieto
    Burn, Edward
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 (01)
  • [38] Harmonizing Norwegian registries onto OMOP common data model: Mapping challenges and opportunities for pregnancy and COVID-19 research
    Trinh, Nhung T. H.
    Houghtaling, Jared
    Bernal, Fabian L. M.
    Hayati, Saeed
    Maglanoc, Luigi A.
    Lupattelli, Angela
    Halvorsen, Lars
    Nordeng, Hedvig M. E.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2024, 191
  • [39] Data Collection Using the Electronic Health Record: Lessons Learned From the Chart Review Process
    Spratling, Regena
    Powers, Erin
    JOURNAL OF PEDIATRIC HEALTH CARE, 2015, 29 (03) : 294 - 296
  • [40] Common Problems, Common Data Model Solutions: Evidence Generation for Health Technology Assessment
    Kent, Seamus
    Burn, Edward
    Dawoud, Dalia
    Jonsson, Pall
    Ostby, Jens Torup
    Hughes, Nigel
    Rijnbeek, Peter
    Bouvy, Jacoline C.
    PHARMACOECONOMICS, 2021, 39 (03) : 275 - 285