STATISTICAL INFERENCE FOR GENETIC RELATEDNESS BASED ON HIGH-DIMENSIONAL LOGISTIC REGRESSION

被引:1
|
作者
Ma, Rong [1 ]
Guo, Zijian [2 ]
Cai, T. Tony [3 ]
Li, Hongzhe [4 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 02135 USA
[2] Rutgers State Univ, Dept Stat, Piscataway, NJ 08854 USA
[3] Univ Penn, Dept Stat & Data Sci, Philadelphia, PA 19104 USA
[4] Univ Penn, Perelman Sch Med, Dept Biostat Epidemiol & Informat, Philadelphia, PA 19104 USA
关键词
Confidence interval; debiasing methods; functional estimation; genetic correlation; hypothesis testing; GENERALIZED LINEAR-MODELS; CONFIDENCE-INTERVALS; HERITABILITY; ARCHITECTURE; METAANALYSIS; COVARIANCE; DISEASES; REGIONS; COMMON; TESTS;
D O I
10.5705/ss.202021.0386
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We examine statistical inference for genetic relatedness between binary traits, based on individual -level genome-wide association data. Specifically, for high -dimensional logistic regression models, we define parameters characterizing the cross -trait genetic correlation, genetic covariance, and trait -specific genetic variance. We develop a novel weighted debiasing method for the logistic Lasso estimator and propose computationally efficient debiased estimators. Further more, we study the rates of convergence for these estimators and establish their asymptotic normality under mild conditions. Moreover, we construct confidence intervals and statistical tests for these parameters, and provide theoretical justifications for the methods, including the coverage probability and expected length of the confidence intervals, and the size and power of the proposed tests. Numerical studies under both modelgenerated data and simulated genetic data show the superiority of the proposed methods. By analyzing a real data set on autoimmune diseases, we demonstrate their ability to obtain novel insights about the shared genetic architecture between 10 pediatric autoimmune diseases.
引用
收藏
页码:1023 / 1043
页数:21
相关论文
共 50 条
  • [41] Two-Stage Online Debiased Lasso Estimation and Inference for High-Dimensional Quantile Regression with Streaming Data
    Peng, Yanjin
    Wang, Lei
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2024, 37 (03) : 1251 - 1270
  • [42] A LIKELIHOOD RATIO FRAMEWORK FOR HIGH-DIMENSIONAL SEMIPARAMETRIC REGRESSION
    Ning, Yang
    Zhao, Tianqi
    Liu, Han
    ANNALS OF STATISTICS, 2017, 45 (06) : 2299 - 2327
  • [43] Overview of High-Dimensional Measurement Error Regression Models
    Luo, Jingxuan
    Yue, Lili
    Li, Gaorong
    MATHEMATICS, 2023, 11 (14)
  • [44] Two-sample inference for high-dimensional Markov networks
    Kim, Byol
    Liu, Song
    Kolar, Mladen
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2021, 83 (05) : 939 - 962
  • [45] In Defense of the Indefensible: A Very Naive Approach to High-Dimensional Inference
    Zhao, Sen
    Witten, Daniela
    Shojaie, Ali
    STATISTICAL SCIENCE, 2021, 36 (04) : 562 - 577
  • [46] Optimal statistical inference for individualized treatment effects in high-dimensional models
    Cai, Tianxi
    Cai, T. Tony
    Guo, Zijian
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2021, 83 (04) : 669 - 719
  • [47] HIGH-DIMENSIONAL ASYMPTOTIC BEHAVIOR OF INFERENCE BASED ON GWAS SUMMARY STATISTIC
    Jiang, Jiming
    Jiang, Wei
    Paul, Debashis
    Zhang, Yiliang
    Zhao, Hongyu
    STATISTICA SINICA, 2023, 33 : 1555 - 1576
  • [48] On High-Dimensional Constrained Maximum Likelihood Inference
    Zhu, Yunzhang
    Shen, Xiaotong
    Pan, Wei
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (529) : 217 - 230
  • [49] High-dimensional inference in misspecified linear models
    Buehlmann, Peter
    van de Geer, Sara
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 1449 - 1473
  • [50] Inference for high-dimensional linear models with locally stationary error processes
    Xia, Jiaqi
    Chen, Yu
    Guo, Xiao
    JOURNAL OF TIME SERIES ANALYSIS, 2024, 45 (01) : 78 - 102