Using the Bayesian Improved Surname Geocoding Method (BISG) to Create a Working Classification of Race and Ethnicity in a Diverse Managed Care Population: A Validation Study

被引:43
作者
Adjaye-Gbewonyo, Dzifa [1 ]
Bednarczyk, Robert A. [1 ,2 ]
Davis, Robert L. [1 ]
Omer, Saad B. [1 ,2 ]
机构
[1] Kaiser Permanente, Ctr Hlth Res South East, Atlanta, GA 30305 USA
[2] Emory Univ, Rollins Sch Publ Hlth, Atlanta, GA 30322 USA
关键词
CHINESE ETHNICITY; COLLECTION;
D O I
10.1111/1475-6773.12089
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objective To validate classification of race/ethnicity based on the Bayesian Improved Surname Geocoding method (BISG) and assess variations in validity by gender and age. Data Sources/Study Setting Secondary data on members of Kaiser Permanente Georgia, an integrated managed care organization, through 2010. Study Design For 191,494 members with self-reported race/ethnicity, probabilities for belonging to each of six race/ethnicity categories predicted from the BISG algorithm were used to assign individuals to a race/ethnicity category over a range of cutoffs greater than a probability of 0.50. Overall as well as gender- and age-stratified sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. Receiver operating characteristic (ROC) curves were generated and used to identify optimal cutoffs for race/ethnicity assignment. Principal Findings The overall cutoffs for assignment that optimized sensitivity and specificity ranged from 0.50 to 0.57 for the four main racial/ethnic categories (White, Black, Asian/Pacific Islander, Hispanic). Corresponding sensitivity, specificity, PPV, and NPV ranged from 64.4 to 81.4 percent, 80.8 to 99.7 percent, 75.0 to 91.6 percent, and 79.4 to 98.0 percent, respectively. Accuracy of assignment was better among males and individuals of 65 years or older. Conclusions BISG may be useful for classifying race/ethnicity of health plan members when needed for health care studies. © Health Research and Educational Trust.
引用
收藏
页码:268 / 283
页数:16
相关论文
共 22 条
  • [1] Understanding diagnostic tests 3: receiver operating characteristic curves
    Akobeng, Anthony K.
    [J]. ACTA PAEDIATRICA, 2007, 96 (05) : 644 - 647
  • [2] America's Health Insurance Plans and Robert Wood Johnson Foundation, 2004, COLL RAC ETHN DAT HL
  • [3] Elliott M. N., 2009, USE INDIRECT MEASURE
  • [4] A new method for estimating race/ethnicity and associated disparities where administrative records lack self-reported race/ethnicity
    Elliott, Marc N.
    Fremont, Allen
    Morrison, Peter A.
    Pantoja, Philip
    Lurie, Nicole
    [J]. HEALTH SERVICES RESEARCH, 2008, 43 (05) : 1722 - 1736
  • [5] Using the Census Bureau's surname list to improve estimates of race/ ethnicity and associated disparities
    Elliott M.N.
    Morrison P.A.
    Fremont A.
    McCaffrey D.F.
    Pantoja P.
    Lurie N.
    [J]. Health Services and Outcomes Research Methodology, 2009, 9 (2) : 69 - 83
  • [6] Collection Of Race And Ethnicity Data By Health Plans Has Grown Substantially, But Opportunities Remain To Expand Efforts
    Escarce, Jose J.
    Carreon, Rita
    Veselovskiy, German
    Lawson, Elisa H.
    [J]. HEALTH AFFAIRS, 2011, 30 (10) : 1984 - 1991
  • [7] Esri, 2012, DEM DAT INCL US POP
  • [8] Use of geocoding and surname analysis to estimate race and ethnicity
    Fiscella, Kevin
    Fremont, Allen M.
    [J]. HEALTH SERVICES RESEARCH, 2006, 41 (04) : 1482 - 1500
  • [9] Gazmararian J, 2012, AM J MANAG CARE, V18, pE254
  • [10] Gonen M., 2007, Analyzing receiver operating characteristic curves with SAS