Improving semiparametric estimation by using surrogate data

被引:28
作者
Chen, Song Xi [1 ,2 ]
Leung, Denis H. Y. [3 ]
Qin, Jing [4 ]
机构
[1] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
[2] Peking Univ, Beijing 100871, Peoples R China
[3] Singapore Management Univ, Singapore, Singapore
[4] NIH, Bethesda, MD 20892 USA
关键词
empirical likelihood; estimating equations; missing values; surrogate outcome;
D O I
10.1111/j.1467-9868.2008.00662.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The paper considers estimating a parameter beta that defines an estimating function U(y, x, beta) for an outcome variable y and its covariate x when the outcome is missing in some of the observations. We assume that, in addition to the outcome and the covariate, a surrogate outcome is available in every observation. The efficiency of existing estimators for beta depends critically on correctly specifying the conditional expectation of U given the surrogate and the covariate. When the conditional expectation is not correctly specified, which is the most likely scenario in practice, the efficiency of estimation can be severely compromised even if the propensity function (of missingness) is correctly specified. We propose an estimator that is robust against the choice of the conditional expectation via an empirical likelihood. We demonstrate that the estimator proposed achieves a gain in efficiency whether the conditional score is correctly specified or not. When the conditional score is correctly specified, the estimator reaches the semiparametric variance bound within the class of estimating functions that are generated by U. The practical performance of the estimator is evaluated by using simulation and a data set that is based on the 1996 US presidential election.
引用
收藏
页码:803 / 823
页数:21
相关论文
共 34 条
[1]   Economics, entitlements, and social issues: Voter choice in the 1996 presidential election [J].
Alvarez, RM ;
Nagler, J .
AMERICAN JOURNAL OF POLITICAL SCIENCE, 1998, 42 (04) :1349-1363
[2]  
[Anonymous], 2002, STAT ANAL MISSING VA
[3]   Resolving paradoxes involving surrogate end points [J].
Baker, SG ;
Izmirlian, G ;
Kipnis, V .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2005, 168 :753-762
[4]   Surrogate endpoints: Wishful thinking or reality? [J].
Baker, Stuart G. .
JNCI-JOURNAL OF THE NATIONAL CANCER INSTITUTE, 2006, 98 (08) :502-503
[5]   On the use of surrogate end points in randomized trials [J].
Begg, CB ;
Leung, DHY .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2000, 163 :15-24
[6]  
Burzykowski T., 2005, EVALUATION SURROGATE
[7]  
CASSEL CM, 1976, BIOMETRIKA, V63, P615, DOI 10.1093/biomet/63.3.615
[8]   Semiparametric efficient estimation for the auxiliary outcome problem with the conditional mean model [J].
Chen, JB ;
Breslow, NE .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2004, 32 (04) :359-372
[9]   Information recovery in a study with surrogate endpoints [J].
Chen, SX ;
Leung, DHY ;
Qin, J .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (464) :1052-1062
[10]   Measurement error models with auxiliary data [J].
Chen, XH ;
Hong, H ;
Tamer, E .
REVIEW OF ECONOMIC STUDIES, 2005, 72 (02) :343-366