Detecting sparse microbial association signals adaptively from longitudinal microbiome data based on generalized estimating equations

被引:1
作者
Sun, Han [2 ]
Huang, Xiaoyun [3 ]
Huo, Ban [1 ]
Tan, Yuting [2 ]
He, Tingting [1 ]
Jiang, Xingpeng [1 ,4 ]
机构
[1] Cent China Normal Univ, Sch Comp, Wuhan 430079, Peoples R China
[2] Cent China Normal Univ, Sch Math & Stat, Wuhan, Peoples R China
[3] Cent China Normal Univ, Collaborat & Innovat Ctr Educ Technol, Wuhan, Peoples R China
[4] Hubei Prov Key Lab Artificial Intelligence & Smar, Wuhan, Peoples R China
基金
中国国家自然科学基金;
关键词
microbiome-based association test; longitudinal microbiome data; generalized estimating equations; sparse microbial association signals; higher criticism; HIGHER CRITICISM; GUT MICROBIOME; POWERFUL; REGRESSION; FRAMEWORK; CANCER; MODEL;
D O I
10.1093/bib/bbac149
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The association between the compositions of microbial communities and various host phenotypes is an important research topic. Microbiome association research addresses multiple domains, such as human disease and diet. Statistical methods for testing microbiome-phenotype associations have been studied recently to determine their ability to assess longitudinal microbiome data. However, existing methods fail to detect sparse association signals in longitudinal microbiome data. In this paper, we developed a novel method, namely aGEEMIHC, which is a data-driven adaptive microbiome higher criticism analysis based on generalized estimating equations to detect sparse microbial association signals from longitudinal microbiome data. aGEEMiHC adopts generalized estimating equations framework that fully considers the correlation among different observations from the same subject in longitudinal data. To be robust to diverse correlation structures for longitudinal data, aGEEMiHC integrates multiple microbiome higher criticism analyses based on generalized estimating equations with different working correlation structures. Extensive simulation experiments demonstrate that aGEEMiHC can control the type I error correctly and achieve superior performance according to a statistical power comparison. We also applied it to longitudinal microbiome data with various types of host phenotypes to demonstrate the stability of our method. aGEEMiHC is also utilized for real longitudinal microbiome data, and we found a significant association between the gut microbiome and Crohn's disease. In addition, our method ranks the significant factors associated with the host phenotype to provide potential biomarkers.
引用
收藏
页数:14
相关论文
共 73 条
  • [1] AN INTRODUCTION TO KERNEL AND NEAREST-NEIGHBOR NONPARAMETRIC REGRESSION
    ALTMAN, NS
    [J]. AMERICAN STATISTICIAN, 1992, 46 (03) : 175 - 185
  • [2] Anderson MJ, 2001, AUSTRAL ECOL, V26, P32, DOI 10.1111/j.1442-9993.2001.01070.pp.x
  • [3] GLOBAL TESTING UNDER SPARSE ALTERNATIVES: ANOVA, MULTIPLE COMPARISONS AND THE HIGHER CRITICISM
    Arias-Castro, Ery
    Candes, Emmanuel J.
    Plan, Yaniv
    [J]. ANNALS OF STATISTICS, 2011, 39 (05) : 2533 - 2556
  • [4] Salivary microbiota reflects changes in gut microbiota in cirrhosis with hepatic encephalopathy
    Bajaj, Jasmohan S.
    Betrapally, Naga S.
    Hylemon, Phillip B.
    Heuman, Douglas M.
    Daita, Kalyani
    White, Melanie B.
    Unser, Ariel
    Thacker, Leroy R.
    Sanyal, Arun J.
    Kang, Dae Joong
    Sikaroodi, Masoumeh
    Gillevet, Patrick M.
    [J]. HEPATOLOGY, 2015, 62 (04) : 1260 - 1271
  • [5] The Generalized Higher Criticism for Testing SNP-Set Effects in Genetic Association Studies
    Barnett, Ian
    Mukherjee, Rajarshi
    Lin, Xihong
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (517) : 64 - 76
  • [6] Analytical p-value calculation for the higher criticism test in finite-d problems
    Barnett, Ian J.
    Lin, Xihong
    [J]. BIOMETRIKA, 2014, 101 (04) : 964 - 970
  • [7] The Role of the Microbiome in Cancer Development and Therapy
    Bhatt, Aadra P.
    Redinbo, Matthew R.
    Bultman, Scott J.
    [J]. CA-A CANCER JOURNAL FOR CLINICIANS, 2017, 67 (04) : 327 - 344
  • [8] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [9] Chaganty N.R., 2004, J R STAT SOC, V66, P851
  • [10] Disordered Microbial Communities in the Upper Respiratory Tract of Cigarette Smokers
    Charlson, Emily S.
    Chen, Jun
    Custers-Allen, Rebecca
    Bittinger, Kyle
    Li, Hongzhe
    Sinha, Rohini
    Hwang, Jennifer
    Bushman, Frederic D.
    Collman, Ronald G.
    [J]. PLOS ONE, 2010, 5 (12):