STATISTICAL INFERENCE FOR GENETIC RELATEDNESS BASED ON HIGH-DIMENSIONAL LOGISTIC REGRESSION

被引:1
|
作者
Ma, Rong [1 ]
Guo, Zijian [2 ]
Cai, T. Tony [3 ]
Li, Hongzhe [4 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 02135 USA
[2] Rutgers State Univ, Dept Stat, Piscataway, NJ 08854 USA
[3] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[4] Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
关键词
Confidence interval; debiasing methods; functional estimation; genetic correlation; hypothesis testing; GENERALIZED LINEAR-MODELS; CONFIDENCE-INTERVALS; HERITABILITY; ARCHITECTURE; METAANALYSIS; COVARIANCE; DISEASES; REGIONS; COMMON; TESTS;
D O I
10.5705/ss.202021.0386
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We examine statistical inference for genetic relatedness between binary traits, based on individual -level genome-wide association data. Specifically, for high -dimensional logistic regression models, we define parameters characterizing the cross -trait genetic correlation, genetic covariance, and trait -specific genetic variance. We develop a novel weighted debiasing method for the logistic Lasso estimator and propose computationally efficient debiased estimators. Further more, we study the rates of convergence for these estimators and establish their asymptotic normality under mild conditions. Moreover, we construct confidence intervals and statistical tests for these parameters, and provide theoretical justifications for the methods, including the coverage probability and expected length of the confidence intervals, and the size and power of the proposed tests. Numerical studies under both modelgenerated data and simulated genetic data show the superiority of the proposed methods. By analyzing a real data set on autoimmune diseases, we demonstrate their ability to obtain novel insights about the shared genetic architecture between 10 pediatric autoimmune diseases.
引用
收藏
页码:1023 / 1043
页数:21
相关论文
共 50 条
  • [21] The Impact of Regularization on High-dimensional Logistic Regression
    Salehi, Fariborz
    Abbasi, Ehsan
    Hassibi, Babak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [22] Targeted Inference Involving High-Dimensional Data Using Nuisance Penalized Regression
    Sun, Qiang
    Zhang, Heping
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (535) : 1472 - 1486
  • [23] Inference for high-dimensional varying-coefficient quantile regression
    Dai, Ran
    Kolar, Mladen
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 5696 - 5757
  • [24] Global and Simultaneous Hypothesis Testing for High-Dimensional Logistic Regression Models
    Ma, Rong
    Cai, T. Tony
    Li, Hongzhe
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2021, 116 (534) : 984 - 998
  • [25] AN ADAPTIVELY RESIZED PARAMETRIC BOOTSTRAP FOR INFERENCE IN HIGH-DIMENSIONAL GENERALIZED LINEAR MODELS
    Zhao, Qian
    Candes, Emmanuel J.
    STATISTICA SINICA, 2025, 35 (01) : 91 - 110
  • [26] High-dimensional Mixed Graphical Model with Ordinal Data: Parameter Estimation and Statistical Inference
    Feng, Huijie
    Ning, Yang
    22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89 : 654 - 663
  • [27] STATISTICAL INFERENCE IN SPARSE HIGH-DIMENSIONAL ADDITIVE MODELS
    Gregory, Karl
    Mammen, Enno
    Wahl, Martin
    ANNALS OF STATISTICS, 2021, 49 (03) : 1514 - 1536
  • [28] Group inference for high-dimensional mediation models
    Yu, Ke
    Guo, Xu
    Luo, Shan
    STATISTICS AND COMPUTING, 2025, 35 (03)
  • [29] Inference in High-Dimensional Online Changepoint Detection
    Chen, Yudong
    Wang, Tengyao
    Samworth, Richard J.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1461 - 1472
  • [30] Sparse and debiased lasso estimation and inference for high-dimensional composite quantile regression with distributed data
    Hou, Zhaohan
    Ma, Wei
    Wang, Lei
    TEST, 2023, 32 (04) : 1230 - 1250