Estimating and Identifying Unspecified Correlation Structure for Longitudinal Data

被引:5
|
作者
Hu, Jianhua [1 ]
Wang, Peng [2 ]
Qu, Annie [3 ]
机构
[1] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
[2] Univ Cincinnati, Dept Operat Business Analyt & Informat Syst, Cincinnati, OH 45221 USA
[3] Univ Illinois, Dept Stat, Champaign, IL 61820 USA
基金
美国国家科学基金会;
关键词
Eigenvector decomposition; Correlated data; Oracle property; Quadratic inference function; SCAD penalty; LARGE COVARIANCE MATRICES; NONCONCAVE PENALIZED LIKELIHOOD; GENERALIZED LINEAR-MODELS; ESTIMATING EQUATIONS; SPARSE ESTIMATION; ORACLE PROPERTIES; REGRESSION; SELECTION; LASSO; INVERSE;
D O I
10.1080/10618600.2014.909733
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Identifying correlation structure is important to achieving estimation efficiency in analyzing longitudinal data, and is also crucial for drawing valid statistical inference for large-size clustered data. In this article, we propose a nonparametric method to estimate the correlation structure, which is applicable for discrete longitudinal data. We use eigenvector-based basis matrices to approximate the inverse of the empirical correlation matrix and determine the number of basis matrices via model selection. A penalized objective function based on the difference between the empirical and model approximation of the correlation matrices is adopted to select an informative structure for the correlation matrix. The eigenvector representation of the correlation estimation is capable of reducing the risk of model misspecification, and also provides useful information on the specific within-cluster correlation pattern of the data. We show that the proposed method possesses the oracle property and selects the true correlation structure consistently. The proposed method is illustrated through simulations and two data examples on air pollution and sonar signal studies .
引用
收藏
页码:455 / 476
页数:22
相关论文
共 50 条
  • [41] Generalized estimating equations for ordinal data: A note on working correlation structures
    Lumley, T
    BIOMETRICS, 1996, 52 (01) : 354 - 361
  • [42] A conditional estimating equation approach for recurrent event data with additional longitudinal information
    Shen, Ye
    Huang, Hui
    Guan, Yongtao
    STATISTICS IN MEDICINE, 2016, 35 (24) : 4306 - 4319
  • [43] Efficient quantile marginal regression for longitudinal data with dropouts
    Cho, Hyunkeun
    Hong, Hyokyoung Grace
    Kim, Mi-Ok
    BIOSTATISTICS, 2016, 17 (03) : 561 - 575
  • [44] Penalized Generalized Estimating Equations for High-Dimensional Longitudinal Data Analysis
    Wang, Lan
    Zhou, Jianhui
    Qu, Annie
    BIOMETRICS, 2012, 68 (02) : 353 - 360
  • [45] A general framework for estimating volume-outcome associations from longitudinal data
    French, Benjamin
    Farjah, Farhood
    Flum, David R.
    Heagerty, Patrick J.
    STATISTICS IN MEDICINE, 2012, 31 (04) : 366 - 382
  • [46] A Generalized Estimating Equation in Longitudinal Data to Determine an Efficiency Indicator for Football Teams
    Crisci, Anna
    D'Ambra, Luigi
    Esposito, Vincenzo
    SOCIAL INDICATORS RESEARCH, 2019, 146 (1-2) : 249 - 261
  • [47] ESTIMATING LARGE CORRELATION MATRICES FOR INTERNATIONAL MIGRATION
    Azose, Jonathan J.
    Raftery, Adrian E.
    ANNALS OF APPLIED STATISTICS, 2018, 12 (02) : 940 - 970
  • [48] Robust statistical inference for longitudinal data with nonignorable dropouts
    Shao, Yujing
    Ma, Wei
    Wang, Lei
    STATISTICS, 2022, 56 (05) : 1072 - 1094
  • [49] Weighted Generalized Estimating Functions for Longitudinal Response and Covariate Data That Are Missing at Random
    Chen, Baojiang
    Yi, Grace Y.
    Cook, Richard J.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2010, 105 (489) : 336 - 353
  • [50] Joint modeling of multivariate nonparametric longitudinal data and survival data: A local smoothing approach
    You, Lu
    Qiu, Peihua
    STATISTICS IN MEDICINE, 2021, 40 (29) : 6689 - 6706