Pathway-based genetic association analysis for overdispersed count data

被引:0
|
作者
Liu, Yang [1 ]
机构
[1] Wright State Univ, Dept Math & Stat, 3640 Colonel Glenn Hwy, Dayton, OH 45435 USA
基金
美国国家卫生研究院;
关键词
Overdispersion; association analysis; negative binomial regression; mixed effects; somatic mutations; DIFFERENTIAL EXPRESSION ANALYSIS; RARE-VARIANT ASSOCIATION; TESTS;
D O I
10.1080/02664763.2025.2460073
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Overdispersion is a common phenomenon in genetic data, such as gene expression count data. In genetic association studies, it is important to investigate the association between a gene expression and a set of genetic variants from a pathway. However, existing approaches for pathway analysis are primarily designed for continuous and binary outcomes and are not applicable to overdispersed count data. In this paper, we propose a hierarchical approach to analyze the association between an overdispersed count response and a set of low-frequency genetic variants in negative binomial regression. We derive score-type test statistics for both fixed and random effects of genetic variants, and further introduce a novel procedure for efficiently combining these two statistics for global testing. Through simulation studies, we demonstrate that the proposed method tends to be more powerful than existing methods under a wide range of scenarios. Additionally, we apply the proposed method to a colorectal cancer study, demonstrating its power in identifying associations between gene expression and somatic mutations.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] ANALYSIS OF OVERDISPERSED COUNT DATA: AN APPLICATION ON ACAR (ACARINA) COUNTS
    Akkol, Suna
    Denizhan, Evsel
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2016, 69 (08): : 1091 - 1100
  • [2] Regression to the mean for overdispersed count data
    Iftikhar, Kiran
    Khan, Manzoor
    Olivier, Jake
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2025, 234
  • [3] Distributions to model overdispersed count data
    Coly, Sylvain
    Yao, Anne-Franoise
    Abrial, David
    Charras-Garrido, Myriam
    JOURNAL OF THE SFDS, 2016, 157 (02): : 39 - 63
  • [4] A generalized model for overdispersed count data
    Okamura, Hiroshi
    Punt, Andre E.
    Amano, Tatsuya
    POPULATION ECOLOGY, 2012, 54 (03) : 467 - 474
  • [5] Dealing with overdispersed count data in applied ecology
    Richards, Shane A.
    JOURNAL OF APPLIED ECOLOGY, 2008, 45 (01) : 218 - 227
  • [6] Flexible models for overdispersed and underdispersed count data
    Dexter Cahoy
    Elvira Di Nardo
    Federico Polito
    Statistical Papers, 2021, 62 : 2969 - 2990
  • [7] Flexible models for overdispersed and underdispersed count data
    Cahoy, Dexter
    Di Nardo, Elvira
    Polito, Federico
    STATISTICAL PAPERS, 2021, 62 (06) : 2969 - 2990
  • [8] Estimation of hurdle models for overdispersed count data
    Farbmacher, Helmut
    STATA JOURNAL, 2011, 11 (01) : 82 - 94
  • [9] Efficient regression modeling for correlated and overdispersed count data
    Niu, Xiaomeng
    Cho, Hyunkeun Ryan
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2019, 48 (24) : 6005 - 6018
  • [10] A New Regression Model for the Analysis of Overdispersed and Zero-Modified Count Data
    Bertoli, Wesley
    Conceicao, Katiane S.
    Andrade, Marinho G.
    Louzada, Francisco
    ENTROPY, 2021, 23 (06)