Error adjustments for file linking methods using encrypted unique client identifier (eUCI) with application to recently released prisoners who are HIV

被引:13
作者
Gutman, R. [1 ]
Sammartino, C. J. [2 ]
Green, T. C. [3 ,4 ]
Montague, B. T. [5 ]
机构
[1] Brown Univ, Dept Biostat, Providence, RI 02912 USA
[2] Brown Univ, Dept Hlth Serv Policy & Practice, Providence, RI 02912 USA
[3] Brown Univ, Rhode Isl Hosp, Dept Emergency Med, Warren Alpert Sch Med, Providence, RI 02912 USA
[4] Brown Univ, Rhode Isl Hosp, Dept Epidemiol, Warren Alpert Sch Med, Providence, RI 02912 USA
[5] Brown Univ, Warren Alpert Sch Med, Miriam Hosp, Providence, RI 02912 USA
基金
美国国家卫生研究院;
关键词
file linking; multiple imputation; eUCI; mixture models; MULTIPLE IMPUTATION; ANTIRETROVIRAL THERAPY; CATEGORICAL VARIABLES; INFECTED PRISONERS; LINKAGE; CARE; INFERENCE; MIXTURES;
D O I
10.1002/sim.6586
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Incarceration provides an opportunity to test for HIV, provide treatment such as highly active anti-retroviral therapy, as well as link infected persons to comprehensive HIV care upon their release. A key factor in assessing the success of a program that links released individuals to care is the time from release to receiving care in the community (linkage time). To estimate the linkage time, records from correction systems are linked to Ryan White Clinic data using encrypted Unique Client Identifier (eUCI). Most of the records that were linked using eUCI belong to the same individual; however, in some cases, it may link records incorrectly, or not identify records that should have been linked. We propose a Bayesian procedure that relies on the relationships between variables that appear in either of the data sources, as well as variables that exists in both to identify correctly linked records among all linked records. The procedure generates K datasets in which each pair of linked records is identified as a true link or a false link. The K datasets are analyzed independently, and the results are combined using Rubin's multiple imputation rules. A small validation dataset is used to examine different statistical models and to inform the prior distributions of the parameters. In comparison with previously proposed methods, the proposed method utilizes all of the available data and is both flexible and computationally efficient. In addition, this approach can be applied in other file linking applications. Copyright (C) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:115 / 129
页数:15
相关论文
共 61 条
  • [11] Fortini Marco., 2002, P SECTION SURVEY RES, P1008
  • [12] Gelman A., 2004, BAYESIAN DATA ANAL
  • [13] Gelman A., 1992, Statistical Science, V7, P457, DOI [DOI 10.1214/SS/1177011136, 10.1214/ss/1177011136]
  • [14] The analysis of record-linked data using multiple imputation with data value priors
    Goldstein, Harvey
    Harron, Katie
    Wade, Angie
    [J]. STATISTICS IN MEDICINE, 2012, 31 (28) : 3481 - 3493
  • [15] An empirical comparison of record linkage procedures
    Gomatam, S
    Carter, R
    Ariet, M
    Mitchell, G
    [J]. STATISTICS IN MEDICINE, 2002, 21 (10) : 1485 - 1496
  • [16] A Bayesian Procedure for File Linking to Analyze End-of-Life Medical Costs
    Gutman, Roee
    Afendulis, Christopher C.
    Zaslavsky, Alan M.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2013, 108 (501) : 34 - 47
  • [17] Methods for analyzing data from probabilistic linkage strategies based on partially identifying variables
    Hof, M. H. P.
    Zwinderman, A. H.
    [J]. STATISTICS IN MEDICINE, 2012, 31 (30) : 4231 - 4242
  • [18] Ibrahim J.G., 2005, BAYESIAN SURVIVAL AN
  • [19] Information Technology Laboratory, 2012, TECHNICAL REPORT
  • [20] Kalbfleisch JD, 2011, STAT ANAL FAILURE TI