Understanding data provenance when using electronic medical records for research: Lessons learned from the Deliver Primary Healthcare Information (DELPHI) database

被引:0
|
作者
Black, Jason Edward [1 ]
Terry, Amanda L. [1 ,2 ,3 ]
Cejic, Sonny [1 ]
Freeman, Tom [1 ]
Lizotte, Dan [2 ,3 ,4 ]
McKay, Scott [1 ]
Speechley, Mark [2 ,3 ]
Ryan, Bridget [1 ,2 ,3 ]
机构
[1] Western Univ, Schulich Sch Med & Dent, Dept Family Med, London, ON, Canada
[2] Western Univ, Schulich Sch Med & Dent, Dept Epidemiol & Biostat, London, ON, Canada
[3] Western Univ, Schulich Sch Med & Dent, Schulich Interfac Program Publ Hlth, London, ON, Canada
[4] Western Univ, Dept Comp Sci, Fac Sci, London, ON, Canada
来源
INTERNATIONAL JOURNAL OF POPULATION DATA SCIENCE (IJPDS) | 2023年 / 8卷 / 05期
关键词
electronic medical records; DELPHI; Choosing Wisely Canada; longitudinal analysis; interrupted time series; data quality; data provenance; data linkage;
D O I
10.23889/ijpds.v8i5.2177
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Introduction We set out to assess the impact of Choosing Wisely Canada recommendations (2014) on reducing unnecessary health investigations and interventions in primary care across Southwestern Ontario. Methods We used the Deliver Primary Healthcare Information (DELPHI) database, which stores deidentified electronic medical records (EMR) of nearly 65,000 primary care patients across Southwestern Ontario. When conducting research using EMR data, data provenance (i.e., how the data came to be) should first be established. We first considered DELPHI data provenance in relation to longitudinal analyses, flagging a change in EMR software that occurred during 2012 and 2013. We attempted to link records between EMR databases produced by different software using probabilistic linkage and inspected 10 years of data in the DELPHI database (2009 to 2019) for data quality issues, including comparability over time. Results We encountered several issues resulting from this change in EMR software. These included limited linkage of records between software without a common identifier; data migration issues that distorted procedure dates; and unusual changes in laboratory test and medication prescription volumes. Conclusion This study reinforces the necessity of assessing data provenance and quality for new research projects. By understanding data provenance, we can anticipate related data quality issues such as changes in EMR data over time-which represent a growing concern as longitudinal data analyses increase in feasibility and popularity.
引用
收藏
页数:8
相关论文
共 9 条
  • [1] Identifying musculoskeletal conditions in electronic medical records: a prevalence and validation study using the Deliver Primary Healthcare Information (DELPHI) database
    Ryan, Bridget L.
    Maddocks, Heather L.
    McKay, Scott
    Petrella, Robert
    Terry, Amanda L.
    Stewart, Moira
    BMC MUSCULOSKELETAL DISORDERS, 2019, 20 (1)
  • [2] Identifying musculoskeletal conditions in electronic medical records: a prevalence and validation study using the Deliver Primary Healthcare Information (DELPHI) database
    Bridget L. Ryan
    Heather L. Maddocks
    Scott McKay
    Robert Petrella
    Amanda L. Terry
    Moira Stewart
    BMC Musculoskeletal Disorders, 20
  • [3] Views on health information sharing and privacy from primary care practices using electronic medical records
    Perera, Gihan
    Holbrook, Anne
    Thabane, Lehana
    Foster, Gary
    Willison, Donald J.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2011, 80 (02) : 94 - 101
  • [4] From patient care to research: a validation study examining the factors contributing to data quality in a primary care electronic medical record database
    Nathan Coleman
    Gayle Halas
    William Peeler
    Natalie Casaclang
    Tyler Williamson
    Alan Katz
    BMC Family Practice, 16
  • [5] From patient care to research: a validation study examining the factors contributing to data quality in a primary care electronic medical record database
    Coleman, Nathan
    Halas, Gayle
    Peeler, William
    Casaclang, Natalie
    Williamson, Tyler
    Katz, Alan
    BMC FAMILY PRACTICE, 2015, 16
  • [6] Development and validation of a rheumatoid arthritis case definition: a machine learning approach using data from primary care electronic medical records
    Pham, Anh N. Q.
    Barber, Claire E. H.
    Drummond, Neil
    Jasper, Lisa
    Klein, Doug
    Lindeman, Cliff
    Widdifield, Jessica
    Williamson, Tyler
    Jones, C. Allyson
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [7] Using Electronic Medical Record Data for Research in a Healthcare Information and Management Systems Society (HIMSS) Analytics Electronic Medical Record Adoption Model (EMRAM) Stage 7 Hospital in Beijing: Cross-sectional Study
    Li, Rui
    Niu, Yue
    Scott, Sarah Robbins
    Zhou, Chu
    Lan, Lan
    Liang, Zhigang
    Li, Jia
    JMIR MEDICAL INFORMATICS, 2021, 9 (08)
  • [8] Identification of Dyslipidemic Patients Attending Primary Care Clinics Using Electronic Medical Record (EMR) Data from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) Database
    Erfan Aref-Eshghi
    Justin Oake
    Marshall Godwin
    Kris Aubrey-Bassler
    Pauline Duke
    Masoud Mahdavian
    Shabnam Asghari
    Journal of Medical Systems, 2017, 41
  • [9] Identification of Dyslipidemic Patients Attending Primary Care Clinics Using Electronic Medical Record (EMR) Data from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) Database
    Aref-Eshghi, Erfan
    Oake, Justin
    Godwin, Marshall
    Aubrey-Bassler, Kris
    Duke, Pauline
    Mahdavian, Masoud
    Asghari, Shabnam
    JOURNAL OF MEDICAL SYSTEMS, 2017, 41 (03)