Error adjustments for file linking methods using encrypted unique client identifier (eUCI) with application to recently released prisoners who are HIV

被引:13
作者
Gutman, R. [1 ]
Sammartino, C. J. [2 ]
Green, T. C. [3 ,4 ]
Montague, B. T. [5 ]
机构
[1] Brown Univ, Dept Biostat, Providence, RI 02912 USA
[2] Brown Univ, Dept Hlth Serv Policy & Practice, Providence, RI 02912 USA
[3] Brown Univ, Rhode Isl Hosp, Dept Emergency Med, Warren Alpert Sch Med, Providence, RI 02912 USA
[4] Brown Univ, Rhode Isl Hosp, Dept Epidemiol, Warren Alpert Sch Med, Providence, RI 02912 USA
[5] Brown Univ, Warren Alpert Sch Med, Miriam Hosp, Providence, RI 02912 USA
基金
美国国家卫生研究院;
关键词
file linking; multiple imputation; eUCI; mixture models; MULTIPLE IMPUTATION; ANTIRETROVIRAL THERAPY; CATEGORICAL VARIABLES; INFECTED PRISONERS; LINKAGE; CARE; INFERENCE; MIXTURES;
D O I
10.1002/sim.6586
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Incarceration provides an opportunity to test for HIV, provide treatment such as highly active anti-retroviral therapy, as well as link infected persons to comprehensive HIV care upon their release. A key factor in assessing the success of a program that links released individuals to care is the time from release to receiving care in the community (linkage time). To estimate the linkage time, records from correction systems are linked to Ryan White Clinic data using encrypted Unique Client Identifier (eUCI). Most of the records that were linked using eUCI belong to the same individual; however, in some cases, it may link records incorrectly, or not identify records that should have been linked. We propose a Bayesian procedure that relies on the relationships between variables that appear in either of the data sources, as well as variables that exists in both to identify correctly linked records among all linked records. The procedure generates K datasets in which each pair of linked records is identified as a true link or a false link. The K datasets are analyzed independently, and the results are combined using Rubin's multiple imputation rules. A small validation dataset is used to examine different statistical models and to inform the prior distributions of the parameters. In comparison with previously proposed methods, the proposed method utilizes all of the available data and is both flexible and computationally efficient. In addition, this approach can be applied in other file linking applications. Copyright (C) 2015 John Wiley & Sons, Ltd.
引用
收藏
页码:115 / 129
页数:15
相关论文
共 61 条
  • [1] Accessing Antiretroviral Therapy Following Release From Prison
    Baillargeon, Jacques
    Giordano, Thomas P.
    Rich, Josiah D.
    Wu, Z. Helen
    Wells, Katherine
    Pollock, Brad H.
    Paar, David P.
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2009, 301 (08): : 848 - 857
  • [2] Small-sample degrees of freedom with multiple imputation
    Barnard, J
    Rubin, DB
    [J]. BIOMETRIKA, 1999, 86 (04) : 948 - 955
  • [3] BELIN TR, 1995, J AM STAT ASSOC, V90, P694
  • [4] Campbell Kevin M, 2008, Health Informatics J, V14, P5, DOI 10.1177/1460458208088855
  • [5] Chambers R, 2009, 1809 U WOLL CTR STAT
  • [6] Data and Reporting Team (DART), 2011, THE EUCI AND YOU
  • [7] Data and Reporting Team (DART), 2014, TECHNICAL REPORT
  • [8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38
  • [9] A THEORY FOR RECORD LINKAGE
    FELLEGI, IP
    SUNTER, AB
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1969, 64 (328) : 1183 - &
  • [10] Fortini M., 2001, Research in Official Statistics, V4, P185