A SEMIPARAMETRIC MODEL FOR CLUSTER DATA

被引:28
|
作者
Zhang, Wenyang [1 ]
Fan, Jianqing [2 ]
Sun, Yan [3 ]
机构
[1] Univ Bath, Dept Math Sci, Bath BA2 7AY, Avon, England
[2] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA
[3] Shanghai Univ Finance & Econ, Sch Econ, Shanghai, Peoples R China
来源
ANNALS OF STATISTICS | 2009年 / 37卷 / 5A期
关键词
Varying-coefficient models; local linear modeling; cluster level variable; cluster effect; VARYING-COEFFICIENT MODELS; LONGITUDINAL DATA; NONPARAMETRIC REGRESSION; CONFIDENCE BANDS; INFERENCE; SELECTION; SPLINE;
D O I
10.1214/08-AOS662
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In the analysis of cluster data, the regression coefficients are frequently assumed to be the same across all clusters. This hampers the ability to Study the varying impacts of factors on each cluster. In this paper, a semiparametric model is introduced to account for varying impacts of factors over clusters by using cluster-level covariates. It achieves the parsimony of parametrization and allows the explorations of nonlinear interactions. The random effect ill the semiparametric model also accounts for within-cluster correlation. Local. linear-based estimation procedure is proposed for estimating functional coefficients, residual variance and within-cluster correlation matrix. The asymptotic properties of the proposed estimators are established, and the method for constructing Simultaneous confidence bands are proposed and studied. In addition, relevant hypothesis testing problems ire addressed. Simulation studies are carried out to demonstrate the methodological power of the proposed methods in the finite sample. The proposed model and methods are used to analyse the second birth interval in Bangladesh, leading to some interesting findings.
引用
收藏
页码:2377 / 2408
页数:32
相关论文
共 50 条