coarsened data mechanism;
EM algorithm;
logistic regression;
maximum likelihood estimation;
Newton-Raphson algorithm;
D O I:
10.1046/j.1467-9876.2003.05009.x
中图分类号:
O21 [概率论与数理统计];
C8 [统计学];
学科分类号:
020208 ;
070103 ;
0714 ;
摘要:
We consider generalized linear models with a coarsened covariate. The term 'coarsened' is used here to refer to the case where the exact value of the covariate of interest is not fully observed. Instead, only some set or grouping that contains the exact value is observed. In particular, we propose a likelihood-based method for estimating regression parameters in a generalized linear model relating the mean of the outcome to covariates. We outline Newton-Raphson and EM algorithms for obtaining maximum likelihood estimates of the regression parameters. We also compare and contrast this likelihood-based approach with two somewhat ad hoc procedures: a complete-case analysis in which individuals with coarsened data are excluded and estimation is based on the remaining 'complete cases', and a coarsened data regression model in which the covariate values for all the complete cases are coarsened and then included in a regression model relating the mean to the coarsened covariate. The methodology that is presented is motivated by coarsened data on the racial-ethnicity categorization of patients in the US's National Ambulatory Medical Care Survey, a study to examine the medical care that is provided to a patient in a physician's office. In this study, the outcome of interest is the level of tests (none, non-invasive tests or invasive tests) ordered for the patient at the doctor's visit. One of the covariates of interest is the patient's four-level discrete covariate comprised of four racial-ethnicity categories: white-Hispanic, white-non-Hispanic, African-American-Hispanic and African-American-non-Hispanic. However, of the 19 095 patients in the sample, 14 955 (or 78%) have the exact category of race-ethnicity recorded and 4140 (or 22%) have race-ethnicity coarsened. For the latter group of 4140 individuals, the ethnicity is not recorded, but we know that 3683 are white and 457 are African-American.
机构:
E China Normal Univ, Sch Finance & Stat, Shanghai 200062, Peoples R China
Univ Wisconsin, Dept Stat, Madison, WI 53706 USAE China Normal Univ, Sch Finance & Stat, Shanghai 200062, Peoples R China
Shao, Jun
Yu, Xinxin
论文数: 0引用数: 0
h-index: 0
机构:
Univ Wisconsin, Dept Stat, Madison, WI 53706 USAE China Normal Univ, Sch Finance & Stat, Shanghai 200062, Peoples R China
机构:
Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USAUniv Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA
Neuhaus, John M.
McCulloch, Charles E.
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USAUniv Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA
McCulloch, Charles E.
Boylan, Ross
论文数: 0引用数: 0
h-index: 0
机构:
Univ Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USAUniv Calif San Francisco, Dept Epidemiol & Biostat, San Francisco, CA 94143 USA