Methods for improving regression analysis for skewed continuous or counted responses

被引:224
作者
Afifi, Abdelmonem A. [1 ]
Kotlerman, Jenny B.
Ettner, Susan L.
Cowan, Marie
机构
[1] Univ Calif Los Angeles, Sch Publ Hlth, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Sch Nursing, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Sch Med, Los Angeles, CA 90095 USA
关键词
smear factor; multiple imputation; bootstrap; attrition weight; zero inflation; two-part models;
D O I
10.1146/annurev.publhealth.28.082206.094100
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Standard inference procedures for regression analysis make assumptions that are rarely satisfied in practice. Adjustments must be made to insure the validity of statistical inference. These adjustments, known for many years, are used routinely by some health researchers but not by others. We review some of these methods and give an example of their use in a health services study for a continuous and a count outcome. For the continuous outcome, we describe retransformation using the smear factor, accounting for missing cases via multiple imputation and attrition weights and improving results with bootstrap methods. For the count outcome, we describe zero inflated Poisson and negative binomial models and the two-part model to account for overabundance of zero values. Recent advances in computing and software development have produced user-friendly computer programs that enable the data analyst to improve prediction and inference based on regression analysis.
引用
收藏
页码:95 / 111
页数:19
相关论文
共 68 条