Microbiome compositional data analysis for survival studies

被引:1
|
作者
Pujolassos, Meritxell [1 ]
Susin, Antoni [2 ]
Calle, M. Luz [1 ,3 ]
机构
[1] Univ Vic Cent Univ Catalunya, Fac Sci Technol & Engn, Biosci Dept, Vic 08500, Spain
[2] UPC Barcelona Tech, Math Dept, Barcelona 08034, Spain
[3] Inst Recerca & Innovacio Ciencies Vida & Salut Cat, Pharm Dept, Vic 08500, Spain
关键词
STATISTICAL-ANALYSIS; MODELS;
D O I
10.1093/nargab/lqae038
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The growing interest in studying the relationship between the human microbiome and our health has also extended to time-to-event studies where researchers explore the connection between the microbiome and the occurrence of a specific event of interest. The analysis of microbiome obtained through high throughput sequencing techniques requires the use of specialized Compositional Data Analysis (CoDA) methods designed to accommodate its compositional nature. There is a limited availability of statistical tools for microbiome analysis that incorporate CoDA, and this is even more pronounced in the context of survival analysis. To fill this methodological gap, we present coda4microbiome for survival studies, a new methodology for the identification of microbial signatures in time-to-event studies. The algorithm implements an elastic-net penalized Cox regression model adapted to compositional covariates. We illustrate coda4microbiome algorithm for survival studies with a case study about the time to develop type 1 diabetes for non-obese diabetic mice. Our algorithm identified a bacterial signature composed of 21 genera associated with diabetes development. coda4microbiome for survival studies is integrated in the R package coda4microbiome as an extension of the existing functions for cross-sectional and longitudinal studies. Graphical Abstract
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Variable selection in microbiome compositional data analysis
    Susin, Antoni
    Wang, Yiwen
    Cao, Kim-Anh Le
    Calle, M. Luz
    NAR GENOMICS AND BIOINFORMATICS, 2020, 2 (02)
  • [2] Microbiome, Metagenomics, and High-Dimensional Compositional Data Analysis
    Li, Hongzhe
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 2, 2015, 2 : 73 - 94
  • [3] Data Augmentation for Compositional Data: Advancing Predictive Models of the Microbiome
    Gordon-Rodriguez, Elliott
    Quinn, Thomas P.
    Cunninghham, John P.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [4] Bayesian compositional generalized linear models for analyzing microbiome data
    Zhang, Li
    Zhang, Xinyan
    Yi, Nengjun
    STATISTICS IN MEDICINE, 2024, 43 (01) : 141 - 155
  • [5] Robust logistic zero-sum regression for microbiome compositional data
    Monti, G. S.
    Filzmoser, P.
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2022, 16 (02) : 301 - 324
  • [6] Mediation effect selection in high-dimensional and compositional microbiome data
    Zhang, Haixiang
    Chen, Jun
    Feng, Yang
    Wang, Chan
    Li, Huilin
    Liu, Lei
    STATISTICS IN MEDICINE, 2021, 40 (04) : 885 - 896
  • [7] Compositional Data Analysis Tutorial
    Smithson, Michael
    Broomell, Stephen B.
    PSYCHOLOGICAL METHODS, 2024, 29 (02) : 362 - 378
  • [8] Aitchison's Compositional Data Analysis 40 Years on: A Reappraisal
    Greenacre, Michael
    Grunsky, Eric
    Bacon-Shone, John
    Erb, Ionas
    Quinn, Thomas
    STATISTICAL SCIENCE, 2023, 38 (03) : 386 - 410
  • [9] A folded model for compositional data analysis
    Tsagris, Michail
    Stewart, Connie
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2020, 62 (02) : 249 - 277
  • [10] Compositional Data Analysis of 16S rRNA Gene Sequencing Results from Hospital Airborne Microbiome Samples
    Perrone, Maria Rita
    Romano, Salvatore
    De Maria, Giuseppe
    Tundo, Paolo
    Bruno, Anna Rita
    Tagliaferro, Luigi
    Maffia, Michele
    Fragola, Mattia
    INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2022, 19 (16)