Estimating and Identifying Unspecified Correlation Structure for Longitudinal Data

被引:5
|
作者
Hu, Jianhua [1 ]
Wang, Peng [2 ]
Qu, Annie [3 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[2] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH 45221 USA
[3] Univ Illinois, Dept Stat, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
Eigenvector decomposition; Correlated data; Oracle property; Quadratic inference function; SCAD penalty; LARGE COVARIANCE MATRICES; NONCONCAVE PENALIZED LIKELIHOOD; GENERALIZED LINEAR-MODELS; ESTIMATING EQUATIONS; SPARSE ESTIMATION; ORACLE PROPERTIES; REGRESSION; SELECTION; LASSO; INVERSE;
D O I
10.1080/10618600.2014.909733
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Identifying correlation structure is important to achieving estimation efficiency in analyzing longitudinal data, and is also crucial for drawing valid statistical inference for large-size clustered data. In this article, we propose a nonparametric method to estimate the correlation structure, which is applicable for discrete longitudinal data. We use eigenvector-based basis matrices to approximate the inverse of the empirical correlation matrix and determine the number of basis matrices via model selection. A penalized objective function based on the difference between the empirical and model approximation of the correlation matrices is adopted to select an informative structure for the correlation matrix. The eigenvector representation of the correlation estimation is capable of reducing the risk of model misspecification, and also provides useful information on the specific within-cluster correlation pattern of the data. We show that the proposed method possesses the oracle property and selects the true correlation structure consistently. The proposed method is illustrated through simulations and two data examples on air pollution and sonar signal studies .
引用
收藏
页码:455 / 476
页数:22
相关论文
共 50 条
  • [31] Performance of weighted estimating equations for longitudinal binary data with drop-outs missing at random
    Preisser, JS
    Lohman, KK
    Rathouz, PJ
    STATISTICS IN MEDICINE, 2002, 21 (20) : 3035 - 3054
  • [32] Robust estimation and model identification for longitudinal data varying-coefficient model
    Liu, Shu
    Lian, Heng
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (11) : 2701 - 2719
  • [33] Selection Criterion of Working Correlation Structure for Spatially Correlated Data
    dos Santos, Marcelo
    De Bastiani, Fernanda
    Uribe-Opazo, Miguel A.
    Galea, Manuel
    AMERICAN STATISTICIAN, 2023, 77 (03) : 283 - 291
  • [34] Working-correlation-structure identification in generalized estimating equations
    Hin, Lin-Yee
    Wang, You-Gan
    STATISTICS IN MEDICINE, 2009, 28 (04) : 642 - 658
  • [35] A Comparison of Correlation Structure Selection Penalties for Generalized Estimating Equations
    Westgate, Philip M.
    Burchett, Woodrow W.
    AMERICAN STATISTICIAN, 2017, 71 (04) : 344 - 353
  • [36] Variable selection and estimation for longitudinal survey data
    Wang, Li
    Wang, Suojin
    Wang, Guannan
    JOURNAL OF MULTIVARIATE ANALYSIS, 2014, 130 : 409 - 424
  • [37] OPTICAL CROSS-CORRELATION FILTERS: AN ECONOMICAL APPROACH FOR IDENTIFYING SNe Ia AND ESTIMATING THEIR REDSHIFTS
    Scolnic, Daniel M.
    Riess, Adam G.
    Huber, Mark E.
    Rest, Armin
    Stubbs, Christoper W.
    Tonry, John L.
    ASTROPHYSICAL JOURNAL, 2009, 706 (01) : 94 - 107
  • [38] Estimating Haplotype Structure and Frequencies: A Bayesian Approach to Unknown Design in Pooled Genomic Data
    Wang, Yuexuan
    Dutta, Ritabrata
    Futschik, Andreas
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2024, 31 (08) : 708 - 726
  • [39] Estimating Network Structure from Incomplete Event Data
    Mark, Benjamin
    Raskutti, Garvesh
    Willett, Rebecca
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
  • [40] Analysis of data with multiple sources of correlation in the framework of generalized estimating equations
    Shults, J
    Whitt, MC
    Kumanyika, S
    STATISTICS IN MEDICINE, 2004, 23 (20) : 3209 - 3226