Pathway-based genetic association analysis for overdispersed count data

被引:0
|
作者
Liu, Yang [1 ]
机构
[1] Wright State Univ, Dept Math & Stat, 3640 Colonel Glenn Hwy, Dayton, OH 45435 USA
基金
美国国家卫生研究院;
关键词
Overdispersion; association analysis; negative binomial regression; mixed effects; somatic mutations; DIFFERENTIAL EXPRESSION ANALYSIS; RARE-VARIANT ASSOCIATION; TESTS;
D O I
10.1080/02664763.2025.2460073
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Overdispersion is a common phenomenon in genetic data, such as gene expression count data. In genetic association studies, it is important to investigate the association between a gene expression and a set of genetic variants from a pathway. However, existing approaches for pathway analysis are primarily designed for continuous and binary outcomes and are not applicable to overdispersed count data. In this paper, we propose a hierarchical approach to analyze the association between an overdispersed count response and a set of low-frequency genetic variants in negative binomial regression. We derive score-type test statistics for both fixed and random effects of genetic variants, and further introduce a novel procedure for efficiently combining these two statistics for global testing. Through simulation studies, we demonstrate that the proposed method tends to be more powerful than existing methods under a wide range of scenarios. Additionally, we apply the proposed method to a colorectal cancer study, demonstrating its power in identifying associations between gene expression and somatic mutations.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Semi-parametric approach for modelling overdispersed count data with application to Industry 4.0
    Bonnini, S.
    Borghesi, M.
    Giacalone, M.
    SOCIO-ECONOMIC PLANNING SCIENCES, 2024, 95
  • [42] Simultaneous confidence intervals for comparing biodiversity indices estimated from overdispersed count data
    Scherer, Ralph
    Schaarschmidt, Frank
    Prescher, Sabine
    Priesnitz, Kai U.
    BIOMETRICAL JOURNAL, 2013, 55 (02) : 246 - 263
  • [43] Modelling multivariate, overdispersed count data with correlated and non-normal heterogeneity effects
    Kazemi, Iraj
    Hassanzadeh, Fatemeh
    SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2020, 44 (02) : 335 - 356
  • [44] Analyzing Overdispersed Antenatal Care Count Data in Bangladesh: Mixed Poisson Regression with Individual-Level Random Effects
    Hossain, Zakir
    Maria
    AUSTRIAN JOURNAL OF STATISTICS, 2021, 50 (04) : 78 - 90
  • [45] A variable clustering approach for overdispersed high-dimensional count data using a copula-based mixture model
    Brini, Alberto
    Manju, Abu
    van den Heuvel, Edwin R.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [46] Revisiting the analysis pipeline for overdispersed Poisson and binomial data
    Lee, Woojoo
    Kim, Jeonghwan
    Lee, Donghwan
    JOURNAL OF APPLIED STATISTICS, 2023, 50 (07) : 1455 - 1476
  • [47] Modelling zero-truncated overdispersed antenatal health care count data of women in Bangladesh
    Hossain, Zakir
    Akter, Rozina
    Sultana, Nasrin
    Kabir, Enamul
    PLOS ONE, 2020, 15 (01):
  • [48] Multilevel modeling in single-case studies with zero-inflated and overdispersed count data
    Li, Haoran
    Luo, Wen
    Baek, Eunkyeng
    BEHAVIOR RESEARCH METHODS, 2024, 56 (04) : 2765 - 2781
  • [49] A non-parametric model to address overdispersed count response in a longitudinal data setting with missingness
    Zhang, Hui
    He, Hua
    Lu, Naiji
    Zhu, Liang
    Zhang, Bo
    Zhang, Zhiwei
    Tang, Li
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2017, 26 (03) : 1461 - 1475
  • [50] Modeling overdispersed or underdispersed count data with generalized Poisson integer-valued autoregressive processes
    Yang, Kai
    Kang, Yao
    Wang, Dehui
    Li, Han
    Diao, Yajing
    METRIKA, 2019, 82 (07) : 863 - 889