Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms

被引:29
作者
Baldwin, Eric [1 ]
Johnson, Karin [1 ]
Berthoud, Heidi [1 ]
Dublin, Sascha [1 ]
机构
[1] Grp Hlth Cooperat Puget Sound, Grp Hlth Res Inst, Seattle, WA 98109 USA
关键词
pregnancy outcome; epidemiology; medical record linkage; pharmacoepidemiology; LINKED BIRTH; DATA LINKAGE; PREGNANCY; UTAH;
D O I
10.1002/pds.3728
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
PurposeTo compare probabilistic and deterministic algorithms for linking mothers and infants within electronic health records (EHRs) to support pregnancy outcomes research. MethodsThe study population was women enrolled in Group Health (Washington State, USA) delivering a liveborn infant from 2001 through 2008 (N=33093 deliveries) and infant members born in these years. We linked women to infants by surname, address, and dates of birth and delivery using deterministic and probabilistic algorithms. In a subset previously linked using gold standard identifiers (N=14449), we assessed each approach's sensitivity and positive predictive value (PPV). For deliveries with no gold standard linkage (N=18644), we compared the algorithms' linkage proportions. We repeated our analyses in an independent test set of deliveries from 2009 through 2013. We reviewed medical records to validate a sample of pairs apparently linked by one algorithm but not the other (N=51 or 1.4% of discordant pairs). ResultsIn the 2001-2008 gold standard population, the probabilistic algorithm's sensitivity was 84.1% (95% CI, 83.5-84.7) and PPV 99.3% (99.1-99.4), while the deterministic algorithm had sensitivity 74.5% (73.8-75.2) and PPV 95.7% (95.4-96.0). In the test set, the probabilistic algorithm again had higher sensitivity and PPV. For deliveries in 2001-2008 with no gold standard linkage, the probabilistic algorithm found matched infants for 58.3% and the deterministic algorithm, 52.8%. On medical record review, 100% of linked pairs appeared valid. ConclusionsA probabilistic algorithm improved linkage proportion and accuracy compared to a deterministic algorithm. Better linkage methods can increase the value of EHRs for pregnancy outcomes research. Copyright (c) 2014 John Wiley & Sons, Ltd.
引用
收藏
页码:45 / 51
页数:7
相关论文
共 19 条
[1]   Medication Exposure in Pregnancy Risk Evaluation Program [J].
Andrade, Susan E. ;
Davis, Robert L. ;
Cheetham, T. Craig ;
Cooper, William O. ;
Li, De-Kun ;
Amini, Thushi ;
Beaton, Sarah J. ;
Dublin, Sascha ;
Hammad, Tarek A. ;
Pawloski, Pamala A. ;
Raebel, Marsha A. ;
Smith, David H. ;
Staffa, Judy A. ;
Toh, Sengwee ;
Dashevsky, Inna ;
Haffenreffer, Katherine ;
Lane, Kimberly ;
Platt, Richard ;
Scott, Pamela E. .
MATERNAL AND CHILD HEALTH JOURNAL, 2012, 16 (07) :1349-1354
[2]   Investigating linkage rates among probabilistically linked birth and hospitalization records [J].
Bentley, Jason P. ;
Ford, Jane B. ;
Taylor, Lee K. ;
Irvine, Katie A. ;
Roberts, Christine L. .
BMC MEDICAL RESEARCH METHODOLOGY, 2012, 12
[3]   Data Linkage: A powerful research tool with potential problems [J].
Bohensky, Megan A. ;
Jolley, Damien ;
Sundararajan, Vijaya ;
Evans, Sue ;
Pilcher, David V. ;
Scott, Ian ;
Brand, Caroline A. .
BMC HEALTH SERVICES RESEARCH, 2010, 10
[4]  
Campbell Kevin M, 2008, Health Informatics J, V14, P5, DOI 10.1177/1460458208088855
[5]  
Clark D E, 1995, Proc Annu Symp Comput Appl Med Care, P397
[6]   Accuracy of probabilistic record linkage applied to health databases: systematic review [J].
da Silveira, Daniele Pinto ;
Artmann, Elizabeth .
REVISTA DE SAUDE PUBLICA, 2009, 43 (05) :875-882
[7]   Probabilistic Record Linkage for Monitoring the Safety of Artemisinin-Based Combination Therapy in the First Trimester of Pregnancy in Senegal [J].
Dellicour, Stephanie ;
Brasseur, Philippe ;
Thorn, Per ;
Gaye, Oumar ;
Olliaro, Piero ;
Badiane, Malik ;
Stergachis, Andy ;
ter Kuile, Feiko O. .
DRUG SAFETY, 2013, 36 (07) :505-513
[8]   Evaluation of record linkage between a large healthcare provider and the Utah Population Database [J].
DuVall, Scott L. ;
Fraser, Alison M. ;
Rowe, Kerry ;
Thomas, Alun ;
Mineau, Geraldine P. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (E1) :E54-E59
[9]   Data linkage methods used in maternally-linked birth and infant death surveillance data sets from the United States (Georgia, Missouri, Utah and Washington State), Israel, Norway, Scotland and Western Australia [J].
Herman, AA ;
McCarthy, BJ ;
Bakewell, JM ;
Ward, RH ;
Mueller, BA ;
Maconochie, NE ;
Read, AW ;
Zadka, P ;
Skjaerven, R .
PAEDIATRIC AND PERINATAL EPIDEMIOLOGY, 1997, 11 :5-22
[10]   Methods of linking mothers and infants using health plan data for studies of pregnancy outcomes [J].
Johnson, Karin E. ;
Beaton, Sarah J. ;
Andrade, Susan E. ;
Cheetham, T. Craig ;
Scott, Pamela E. ;
Hammad, Tarek A. ;
Dashevsky, Inna ;
Cooper, William O. ;
Davis, Robert L. ;
Pawloski, Pamala A. ;
Raebel, Marsha A. ;
Smith, David H. ;
Toh, Sengwee ;
Li, De-Kun ;
Haffenreffer, Katherine ;
Dublin, Sascha .
PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2013, 22 (07) :776-782