A self-knowledge distillation-driven CNN-LSTM model for predicting disease outcomes using longitudinal microbiome data

被引:5
作者
Fung, Daryl L. X. [1 ]
Li, Xu [2 ]
Leung, Carson K. [1 ]
Hu, Pingzhao [1 ,2 ,3 ,4 ]
机构
[1] Univ Manitoba, Dept Comp Sci, Winnipeg, MB R3T 2N2, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Div Biostat, Toronto, ON M5T 3M7, Canada
[3] Univ Manitoba, Dept Biochem & Med Genet, Winnipeg, MB R3E 0J9, Canada
[4] Western Univ, Dept Biochem, London, ON N6A 5C1, Canada
来源
BIOINFORMATICS ADVANCES | 2023年 / 3卷 / 01期
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1093/bioadv/vbad059
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Motivation Human microbiome is complex and highly dynamic in nature. Dynamic patterns of the microbiome can capture more information than single point inference as it contains the temporal changes information. However, dynamic information of the human microbiome can be hard to be captured due to the complexity of obtaining the longitudinal data with a large volume of missing data that in conjunction with heterogeneity may provide a challenge for the data analysis.Results We propose using an efficient hybrid deep learning architecture convolutional neural network-long short-term memory, which combines with self-knowledge distillation to create highly accurate models to analyze the longitudinal microbiome profiles to predict disease outcomes. Using our proposed models, we analyzed the datasets from Predicting Response to Standardized Pediatric Colitis Therapy (PROTECT) study and DIABIMMUNE study. We showed the significant improvement in the area under the receiver operating characteristic curve scores, achieving 0.889 and 0.798 on PROTECT study and DIABIMMUNE study, respectively, compared with state-of-the-art temporal deep learning models. Our findings provide an effective artificial intelligence-based tool to predict disease outcomes using longitudinal microbiome profiles from collected patients.Availability and implementation The data and source code can be accessed at https://github.com/darylfung96/UC-disease-TL.
引用
收藏
页数:10
相关论文
共 29 条
  • [1] Adebayo J, 2018, ADV NEUR IN, V31
  • [2] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
    Bach, Sebastian
    Binder, Alexander
    Montavon, Gregoire
    Klauschen, Frederick
    Mueller, Klaus-Robert
    Samek, Wojciech
    [J]. PLOS ONE, 2015, 10 (07):
  • [3] Brendel W., 2019, P INT C LEARN REPR
  • [4] Data, disease and diplomacy: GISAID's innovative contribution to global health
    Elbe, Stefan
    Buckland-Merrett, Gemma
    [J]. GLOBAL CHALLENGES, 2017, 1 (01) : 33 - 46
  • [5] Nextstrain: real-time tracking of pathogen evolution
    Hadfield, James
    Megill, Colin
    Bell, Sidney M.
    Huddleston, John
    Potter, Barney
    Callender, Charlton
    Sagulenko, Pavel
    Bedford, Trevor
    Neher, Richard A.
    [J]. BIOINFORMATICS, 2018, 34 (23) : 4121 - 4123
  • [6] He X, 2005, P ADV NEUR INF PROC, P507
  • [7] Hooker S, 2019, ADV NEUR IN, V32
  • [8] MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform
    Katoh, K
    Misawa, K
    Kuma, K
    Miyata, T
    [J]. NUCLEIC ACIDS RESEARCH, 2002, 30 (14) : 3059 - 3066
  • [9] The art of using t-SNE for single-cell transcriptomics
    Kobak, Dmitry
    Berens, Philipp
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [10] Quality assessment of dimensionality reduction: Rank-based criteria
    Lee, John A.
    Verleysen, Michel
    [J]. NEUROCOMPUTING, 2009, 72 (7-9) : 1431 - 1443