Biases introduced by filtering electronic health records for patients with "complete data"

被引:60
作者
Weber, Griffin M. [1 ,2 ]
Adams, William G. [3 ]
Bernstam, Elmer V. [4 ]
Bickel, Jonathan P. [5 ]
Fox, Kathe P. [6 ]
Marsolo, Keith [7 ]
Raghavan, Vijay A. [8 ]
Turchin, Alexander [9 ]
Zhou, Xiaobo [10 ]
Murphy, Shawn N. [11 ]
Mandl, Kenneth D. [1 ,5 ]
机构
[1] Harvard Med Sch, Dept Biomed Informat, Boston, MA USA
[2] Beth Israel Deaconess Med Ctr, Dept Med, Boston, MA 02215 USA
[3] Boston Med Ctr, Dept Pediat, Boston, MA USA
[4] Univ Texas Hlth Sci Ctr Houston, Sch Biomed Informat, McGovern Med Sch, Dept Internal Med, Houston, TX 77030 USA
[5] Boston Childrens Hosp, Computat Hlth Informat Program, Boston, MA USA
[6] Aetna, Dept Analyt & Behav Change, Hartford, CT USA
[7] Univ Cincinnati, Coll Med, Dept Pediat, Div Biomed Informat,Cincinnati Childrens Hosp Med, Cincinnati, OH USA
[8] Merck, Sci Informat Management, Boston, MA USA
[9] Brigham & Womens Hosp, Div Endocrinol, 75 Francis St, Boston, MA 02115 USA
[10] Wake Forest Univ, Bowman Gray Sch Med, Dept Radiol, Winston Salem, NC USA
[11] Massachusetts Gen Hosp, Dept Neurol, Boston, MA 02114 USA
基金
美国国家卫生研究院;
关键词
electronic health records; claims data; data accuracy; information storage and retrieval; selection bias; ACCOUNTABLE CARE ORGANIZATIONS; TYPE-2; DIABETES-MELLITUS; INFORMATION EXCHANGE; MEDICAID CLAIMS; FRAGMENTATION; ACCURACY; CENTERS; MARKET;
D O I
10.1093/jamia/ocx071
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One promise of nationwide adoption of electronic health records (EHRs) is the availability of data for large-scale clinical research studies. However, because the same patient could be treated at multiple health care institutions, data from only a single site might not contain the complete medical history for that patient, meaning that critical events could be missing. In this study, we evaluate how simple heuristic checks for data "completeness" affect the number of patients in the resulting cohort and introduce potential biases. We began with a set of 16 filters that check for the presence of demographics, laboratory tests, and other types of data, and then systematically applied all 2(16) possible combinations of these filters to the EHR data for 12 million patients at 7 health care systems and a separate payor claims database of 7 million members. EHR data showed considerable variability in data completeness across sites and high correlation between data types. For example, the fraction of patients with diagnoses increased from 35.0% in all patients to 90.9% in those with at least 1 medication. An unrelated claims dataset independently showed that most filters select members who are older and more likely female and can eliminate large portions of the population whose data are actually complete. As investigators design studies, they need to balance their confidence in the completeness of the data with the effects of placing requirements on the data on the resulting patient cohort.
引用
收藏
页码:1134 / 1141
页数:8
相关论文
共 22 条
[1]   Health information exchange among U.S. hospitals: who's in, who's out, and why? [J].
Adler-Milstein, Julia ;
Jha, Ashish K. .
HEALTHCARE-THE JOURNAL OF DELIVERY SCIENCE AND INNOVATION, 2014, 2 (01) :26-32
[2]   Operational Health Information Exchanges Show Substantial Growth, But Long-Term Funding Remains A Concern [J].
Adler-Milstein, Julia ;
Bates, David W. ;
Jha, Ashish K. .
HEALTH AFFAIRS, 2013, 32 (08) :1486-1492
[3]  
Botsis Taxiarchis, 2010, Summit Transl Bioinform, V2010, P1
[4]   HEALTH CARE REFORM Patients Treated at Multiple Acute Health Care Facilities Quantifying Information Fragmentation [J].
Bourgeois, Fabienne C. ;
Olson, Karen L. ;
Mandl, Kenneth D. .
ARCHIVES OF INTERNAL MEDICINE, 2010, 170 (22) :1989-1995
[5]   PCORnet: turning a dream into reality [J].
Collins, Francis S. ;
Hudson, Kathy L. ;
Briggs, Josephine P. ;
Lauer, Michael S. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (04) :576-577
[6]   Electronic Health Records vs Medicaid Claims: Completeness of Diabetes Preventive Care Data in Community Health Centers [J].
DeVoe, Jennifer E. ;
Gold, Rachel ;
McIntire, Patti ;
Puro, Jon ;
Chauvie, Susan ;
Gallia, Charles A. .
ANNALS OF FAMILY MEDICINE, 2011, 9 (04) :351-358
[7]  
Finnell John T, 2011, AMIA Annu Symp Proc, V2011, P409
[8]   Agreement of Medicaid claims and electronic health records for assessing preventive care quality among adults [J].
Heintzman, John ;
Bailey, Steffani R. ;
Hoopes, Megan J. ;
Thuy Le ;
Gold, Rachel ;
O'Malley, Jean P. ;
Cowburn, Stuart ;
Marino, Miguel ;
Krist, Alex ;
DeVoe, Jennifer E. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2014, 21 (04) :720-724
[9]   Caveats for the Use of Operational Electronic Health Record Data in Comparative Effectiveness Research [J].
Hersh, William R. ;
Weiner, Mark G. ;
Embi, Peter J. ;
Logan, Judith R. ;
Payne, Philip R. O. ;
Bernstam, Elmer V. ;
Lehmann, Harold P. ;
Hripcsak, George ;
Hartzog, Timothy H. ;
Cimino, James J. ;
Saltz, Joel H. .
MEDICAL CARE, 2013, 51 (08) :S30-S37
[10]  
Lau EC, 2011, CLIN EPIDEMIOL, V3, P259, DOI DOI 10.2147/CLEP.S23690]