Estimating and Identifying Unspecified Correlation Structure for Longitudinal Data

被引:5
|
作者
Hu, Jianhua [1 ]
Wang, Peng [2 ]
Qu, Annie [3 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[2] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH 45221 USA
[3] Univ Illinois, Dept Stat, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
Eigenvector decomposition; Correlated data; Oracle property; Quadratic inference function; SCAD penalty; LARGE COVARIANCE MATRICES; NONCONCAVE PENALIZED LIKELIHOOD; GENERALIZED LINEAR-MODELS; ESTIMATING EQUATIONS; SPARSE ESTIMATION; ORACLE PROPERTIES; REGRESSION; SELECTION; LASSO; INVERSE;
D O I
10.1080/10618600.2014.909733
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Identifying correlation structure is important to achieving estimation efficiency in analyzing longitudinal data, and is also crucial for drawing valid statistical inference for large-size clustered data. In this article, we propose a nonparametric method to estimate the correlation structure, which is applicable for discrete longitudinal data. We use eigenvector-based basis matrices to approximate the inverse of the empirical correlation matrix and determine the number of basis matrices via model selection. A penalized objective function based on the difference between the empirical and model approximation of the correlation matrices is adopted to select an informative structure for the correlation matrix. The eigenvector representation of the correlation estimation is capable of reducing the risk of model misspecification, and also provides useful information on the specific within-cluster correlation pattern of the data. We show that the proposed method possesses the oracle property and selects the true correlation structure consistently. The proposed method is illustrated through simulations and two data examples on air pollution and sonar signal studies .
引用
收藏
页码:455 / 476
页数:22
相关论文
共 50 条
  • [1] Correlation structure selection for longitudinal data with diverging cluster size
    Wang, Peng
    Zhou, Jianhui
    Qu, Annie
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2016, 44 (03): : 343 - 360
  • [2] Informative Estimation and Selection of Correlation Structure for Longitudinal Data
    Zhou, Jianhui
    Qu, Annie
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (498) : 701 - 710
  • [3] Improving the correlation structure selection approach for generalized estimating equations and balanced longitudinal data
    Westgate, Philip M.
    STATISTICS IN MEDICINE, 2014, 33 (13) : 2222 - 2237
  • [4] NONPARAMETRIC INDEPENDENCE SCREENING AND STRUCTURE IDENTIFICATION FOR ULTRA-HIGH DIMENSIONAL LONGITUDINAL DATA
    Cheng, Ming-Yen
    Honda, Toshio
    Li, Jialiang
    Peng, Heng
    ANNALS OF STATISTICS, 2014, 42 (05) : 1819 - 1849
  • [5] Exploration of model misspecification in latent class methods for longitudinal data: Correlation structure matters
    Neely, Megan L.
    Pieper, Carl F.
    Gu, Bida
    Dmitrieva, Natalia O.
    Pendergast, Jane F.
    STATISTICS IN MEDICINE, 2023, 42 (14) : 2420 - 2438
  • [6] Longitudinal data estimating equations for dispersion models
    Artes, R
    Jorgensen, B
    SCANDINAVIAN JOURNAL OF STATISTICS, 2000, 27 (02) : 321 - 334
  • [7] SPARSE VARYING COEFFICIENT MODELS FOR LONGITUDINAL DATA
    Noh, Hoh Suk
    Park, Byeong U.
    STATISTICA SINICA, 2010, 20 (03) : 1183 - 1202
  • [8] Optimal model averaging estimation for correlation structure in generalized estimating equations
    Fang, Fang
    Li, Jialiang
    Wang, Jingli
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (05) : 1574 - 1593
  • [9] Estimating missing data using novel correlation maximization based methods
    Sefidian, Amir Masoud
    Daneshpour, Negin
    APPLIED SOFT COMPUTING, 2020, 91
  • [10] Conditional mix-GEE models for longitudinal data with unspecified random-effects distributions
    Xing, Yanchun
    Xu, Lili
    Ma, Wenqing
    Zhu, Zhichuan
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (04) : 862 - 876