STATISTICAL INFERENCE FOR GENETIC RELATEDNESS BASED ON HIGH-DIMENSIONAL LOGISTIC REGRESSION

被引:1
|
作者
Ma, Rong [1 ]
Guo, Zijian [2 ]
Cai, T. Tony [3 ]
Li, Hongzhe [4 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 02135 USA
[2] Rutgers State Univ, Dept Stat, Piscataway, NJ 08854 USA
[3] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[4] Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
关键词
Confidence interval; debiasing methods; functional estimation; genetic correlation; hypothesis testing; GENERALIZED LINEAR-MODELS; CONFIDENCE-INTERVALS; HERITABILITY; ARCHITECTURE; METAANALYSIS; COVARIANCE; DISEASES; REGIONS; COMMON; TESTS;
D O I
10.5705/ss.202021.0386
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We examine statistical inference for genetic relatedness between binary traits, based on individual -level genome-wide association data. Specifically, for high -dimensional logistic regression models, we define parameters characterizing the cross -trait genetic correlation, genetic covariance, and trait -specific genetic variance. We develop a novel weighted debiasing method for the logistic Lasso estimator and propose computationally efficient debiased estimators. Further more, we study the rates of convergence for these estimators and establish their asymptotic normality under mild conditions. Moreover, we construct confidence intervals and statistical tests for these parameters, and provide theoretical justifications for the methods, including the coverage probability and expected length of the confidence intervals, and the size and power of the proposed tests. Numerical studies under both modelgenerated data and simulated genetic data show the superiority of the proposed methods. By analyzing a real data set on autoimmune diseases, we demonstrate their ability to obtain novel insights about the shared genetic architecture between 10 pediatric autoimmune diseases.
引用
收藏
页码:1023 / 1043
页数:21
相关论文
共 50 条
  • [1] Debiased inference for heterogeneous subpopulations in a high-dimensional logistic regression model
    Kim, Hyunjin
    Lee, Eun Ryung
    Park, Seyoung
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [2] On inference in high-dimensional regression
    Battey, Heather S.
    Reid, Nancy
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (01) : 149 - 175
  • [3] Statistical inference and large-scale multiple testing for high-dimensional regression models
    Cai, T. Tony
    Guo, Zijian
    Xia, Yin
    TEST, 2023, 32 (04) : 1135 - 1171
  • [4] HIGH-DIMENSIONAL GAUSSIAN COPULA REGRESSION: ADAPTIVE ESTIMATION AND STATISTICAL INFERENCE
    Cai, T. Tony
    Zhang, Linjun
    STATISTICA SINICA, 2018, 28 (02) : 963 - 993
  • [5] Statistical Inference for High-Dimensional Generalized Linear Models With Binary Outcomes
    Cai, T. Tony
    Guo, Zijian
    Ma, Rong
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (542) : 1319 - 1332
  • [6] On inference in high-dimensional logistic regression models with separated data
    Lewis, R. M.
    Battey, H. S.
    BIOMETRIKA, 2024, 111 (03) : 989 - 1011
  • [7] Statistical Inference for High-Dimensional Models via Recursive Online-Score Estimation
    Shi, Chengchun
    Song, Rui
    Lu, Wenbin
    Li, Runze
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (535) : 1307 - 1318
  • [8] Post-selection Inference of High-dimensional Logistic Regression Under Case-Control Design
    Lin, Yuanyuan
    Xie, Jinhan
    Han, Ruijian
    Tang, Niansheng
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2023, 41 (02) : 624 - 635
  • [9] Markov neighborhood regression for statistical inference of high-dimensional generalized linear models
    Sun, Lizhe
    Liang, Faming
    STATISTICS IN MEDICINE, 2022, 41 (20) : 4057 - 4078
  • [10] Optimal Estimation of Genetic Relatedness in High-Dimensional Linear Models
    Guo, Zijian
    Wang, Wanjie
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2019, 114 (525) : 358 - 369