Estimating and Identifying Unspecified Correlation Structure for Longitudinal Data

被引:5
|
作者
Hu, Jianhua [1 ]
Wang, Peng [2 ]
Qu, Annie [3 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[2] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH 45221 USA
[3] Univ Illinois, Dept Stat, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
Eigenvector decomposition; Correlated data; Oracle property; Quadratic inference function; SCAD penalty; LARGE COVARIANCE MATRICES; NONCONCAVE PENALIZED LIKELIHOOD; GENERALIZED LINEAR-MODELS; ESTIMATING EQUATIONS; SPARSE ESTIMATION; ORACLE PROPERTIES; REGRESSION; SELECTION; LASSO; INVERSE;
D O I
10.1080/10618600.2014.909733
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Identifying correlation structure is important to achieving estimation efficiency in analyzing longitudinal data, and is also crucial for drawing valid statistical inference for large-size clustered data. In this article, we propose a nonparametric method to estimate the correlation structure, which is applicable for discrete longitudinal data. We use eigenvector-based basis matrices to approximate the inverse of the empirical correlation matrix and determine the number of basis matrices via model selection. A penalized objective function based on the difference between the empirical and model approximation of the correlation matrices is adopted to select an informative structure for the correlation matrix. The eigenvector representation of the correlation estimation is capable of reducing the risk of model misspecification, and also provides useful information on the specific within-cluster correlation pattern of the data. We show that the proposed method possesses the oracle property and selects the true correlation structure consistently. The proposed method is illustrated through simulations and two data examples on air pollution and sonar signal studies .
引用
收藏
页码:455 / 476
页数:22
相关论文
共 50 条
  • [21] Estimating missing reference evapotranspiration data by correlation analysis
    Eching, SO
    PROCEEDINGS OF THE IVTH INTERNATIONAL SYMPOSIUM ON IRRIGATION OF HORTICULTURAL CROPS, 2004, (664): : 181 - 187
  • [22] Empirical likelihood analysis of longitudinal data involving within-subject correlation
    Hu, Shuang
    Lin, Lu
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2012, 28 (04): : 731 - 744
  • [23] Selection of working correlation structure in generalized estimating equations
    Wang, You-Gan
    Fu, Liya
    STATISTICS IN MEDICINE, 2017, 36 (14) : 2206 - 2219
  • [24] Smoothing combined estimating equations in quantile regression for longitudinal data
    Chenlei Leng
    Weiping Zhang
    Statistics and Computing, 2014, 24 : 123 - 136
  • [25] Conditional generalized estimating equations for the analysis of clustered and longitudinal data
    Goetgeluk, Sylvie
    Vansteelandt, Stijn
    BIOMETRICS, 2008, 64 (03) : 772 - 780
  • [26] Weighted estimating equation: modified GEE in longitudinal data analysis
    Liu, Tianqing
    Bai, Zhidong
    Zhang, Baoxue
    FRONTIERS OF MATHEMATICS IN CHINA, 2014, 9 (02) : 329 - 353
  • [27] Penalized joint generalized estimating equations for longitudinal binary data
    Huang, Youjun
    Pan, Jianxin
    BIOMETRICAL JOURNAL, 2022, 64 (01) : 57 - 73
  • [28] Smoothing combined estimating equations in quantile regression for longitudinal data
    Leng, Chenlei
    Zhang, Weiping
    STATISTICS AND COMPUTING, 2014, 24 (01) : 123 - 136
  • [29] Testing ignorable missingness in estimating equation approaches for longitudinal data
    Qu, A
    Song, PXK
    BIOMETRIKA, 2002, 89 (04) : 841 - 850
  • [30] Generalized estimating equations with stabilized working correlation structure
    Kwon, Yongchan
    Choi, Young-Geun
    Park, Taesung
    Ziegler, Andreas
    Paik, Myunghee Cho
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 106 : 1 - 11