coarsened data mechanism;
EM algorithm;
logistic regression;
maximum likelihood estimation;
Newton-Raphson algorithm;
D O I:
10.1046/j.1467-9876.2003.05009.x
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
We consider generalized linear models with a coarsened covariate. The term 'coarsened' is used here to refer to the case where the exact value of the covariate of interest is not fully observed. Instead, only some set or grouping that contains the exact value is observed. In particular, we propose a likelihood-based method for estimating regression parameters in a generalized linear model relating the mean of the outcome to covariates. We outline Newton-Raphson and EM algorithms for obtaining maximum likelihood estimates of the regression parameters. We also compare and contrast this likelihood-based approach with two somewhat ad hoc procedures: a complete-case analysis in which individuals with coarsened data are excluded and estimation is based on the remaining 'complete cases', and a coarsened data regression model in which the covariate values for all the complete cases are coarsened and then included in a regression model relating the mean to the coarsened covariate. The methodology that is presented is motivated by coarsened data on the racial-ethnicity categorization of patients in the US's National Ambulatory Medical Care Survey, a study to examine the medical care that is provided to a patient in a physician's office. In this study, the outcome of interest is the level of tests (none, non-invasive tests or invasive tests) ordered for the patient at the doctor's visit. One of the covariates of interest is the patient's four-level discrete covariate comprised of four racial-ethnicity categories: white-Hispanic, white-non-Hispanic, African-American-Hispanic and African-American-non-Hispanic. However, of the 19 095 patients in the sample, 14 955 (or 78%) have the exact category of race-ethnicity recorded and 4140 (or 22%) have race-ethnicity coarsened. For the latter group of 4140 individuals, the ethnicity is not recorded, but we know that 3683 are white and 457 are African-American.