A comparison of two approaches for power and sample size calculations in logistic regression models

被引:13
作者
Shieh, G [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Management Sci, Hsinchu 30050, Taiwan
关键词
likelihood ratio test; logistic regression; maximum likelihood estimate; power; sample size;
D O I
10.1080/03610910008813639
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Whittemore (1981) proposed an approach for calculating the sample size needed to test hypotheses with specified significance and power against a given alternative for logistic regression with small response probability. Based on the distribution of covariate, which could be either discrete or continuous, this approach first provides a simple closed-form approximation to the asymptotic covariance matrix of the maximum likelihood estimates, and then uses it to calculate the sample size needed to test a hypothesis about the parameter. Self et al. (1992) described a general approach for power and sample size calculations within the framework of generalized linear models, which include logistic regression as a special case. Their approach is based on an approximation to the distribution of the likelihood ratio statistic. Unlike the Whittemore approach, their approach is not limited to situations of small response probability. However, it is restricted to models with a finite number of covariate configurations. This study compares these two approaches to see how accurate they would be for the calculations of power and sample size in logistic regression models with various response probabilities and covariate distributions. The results indicate that the Whittemore approach has a slight advantage in achieving the nominal power only for one case with small response probability. It is outperformed for all other cases with larger response probabilities. In general, the approach proposed in Self et al. (1992) is recommended for all values of the response probability. However, its extension for logistic regression models with an infinite number of covariate configurations involves an arbitrary decision for categorization and leads to a discrete approximation. As shown in this paper, the examined discrete approximations appear to be sufficiently accurate for practical purpose.
引用
收藏
页码:763 / 791
页数:29
相关论文
共 7 条
[1]   SAMPLE-SIZE TABLES FOR LOGISTIC-REGRESSION [J].
HSIEH, FY .
STATISTICS IN MEDICINE, 1989, 8 (07) :795-802
[2]   EPIDEMIOLOGY AS A GUIDE TO CLINICAL DECISIONS - THE ASSOCIATION BETWEEN TRIGLYCERIDE AND CORONARY HEART-DISEASE [J].
HULLEY, SB ;
ROSENMAN, RH ;
BAWOL, RD ;
BRAND, RJ .
NEW ENGLAND JOURNAL OF MEDICINE, 1980, 302 (25) :1383-1389
[3]  
*SAS I, 1989, SAS IML SOFTW US REF
[4]   POWER SAMPLE-SIZE CALCULATIONS FOR GENERALIZED LINEAR-MODELS [J].
SELF, SG ;
MAURITSEN, RH .
BIOMETRICS, 1988, 44 (01) :79-86
[5]   POWER CALCULATIONS FOR LIKELIHOOD RATIO TESTS IN GENERALIZED LINEAR-MODELS [J].
SELF, SG ;
MAURITSEN, RH ;
OHARA, J .
BIOMETRICS, 1992, 48 (01) :31-39
[6]  
SHIEH G, 1998, ANN JOINT STAT M AM