An additive Gaussian process regression model for interpretable non-parametric analysis of longitudinal data

被引:49
作者
Cheng, Lu [1 ,2 ]
Ramchandran, Siddharth [1 ]
Vatanen, Tommi [3 ,4 ]
Lietzen, Niina [5 ,6 ]
Lahesmaa, Riitta [5 ,6 ]
Vehtari, Aki [1 ]
Lahdesmaki, Harri [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Sch Sci, FI-00076 Aalto, Finland
[2] Cardiff Univ, Sch Biosci, Organisms & Environm Div, Microbiomes Microbes & Informat Grp, Cardiff CF10 3AX, S Glam, Wales
[3] Broad Inst MIT & Harvard, Cambridge, MA 02142 USA
[4] Univ Auckland, Liggins Inst, Auckland 1023, New Zealand
[5] Univ Turku, Turku Ctr Biotechnol, FI-20520 Turku, Finland
[6] Abo Akad Univ, FI-20520 Turku, Finland
基金
芬兰科学院;
关键词
INFERENCE;
D O I
10.1038/s41467-019-09785-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Biomedical research typically involves longitudinal study designs where samples from individuals are measured repeatedly over time and the goal is to identify risk factors (covariates) that are associated with an outcome value. General linear mixed effect models are the standard workhorse for statistical analysis of longitudinal data. However, analysis of longitudinal data can be complicated for reasons such as difficulties in modelling correlated outcome values, functional (time-varying) covariates, nonlinear and non-stationary effects, and model inference. We present LonGP, an additive Gaussian process regression model that is specifically designed for statistical analysis of longitudinal data, which solves these commonly faced challenges. LonGP can model time-varying random effects and non-stationary signals, incorporate multiple kernel learning, and provide interpretable results for the effects of individual covariates and their interactions. We demonstrate LonGP's performance and accuracy by analysing various simulated and real longitudinal -omics datasets.
引用
收藏
页数:11
相关论文
共 29 条
  • [1] [Anonymous], 1999, Behaviormetrika
  • [2] [Anonymous], 2018, P INT C ART INT STAT
  • [3] [Anonymous], 2013, P 30 INT C MACH LEAR
  • [4] Proteomics analysis of insulin secretory granules
    Brunner, Yannick
    Coute, Yohann
    Iezzi, Mariella
    Foti, Michelangelo
    Fukuda, Mitsonuri
    Hochstrasser, Denis F.
    Wollheim, Claes B.
    Sanchez, Jean-Charles
    [J]. MOLECULAR & CELLULAR PROTEOMICS, 2007, 6 (06) : 1007 - 1017
  • [5] Advances in Analysis of Longitudinal Data
    Gibbons, Robert D.
    Hedeker, Donald
    DuToit, Stephen
    [J]. ANNUAL REVIEW OF CLINICAL PSYCHOLOGY, VOL 6, 2010, 6 : 79 - 107
  • [6] Gilboa Elad, 2013, P 30 INT C MACH LEAR
  • [7] Heinonen M, 2016, JMLR WORKSH CONF PRO, V51, P732
  • [8] Characterization and non-parametric modeling of the developing serum proteome during infancy and early childhood
    Lietzen, Niina
    Cheng, Lu
    Moulder, Robert
    Siljander, Heli
    Laajala, Essi
    Harkonen, Taina
    Peet, Aleksandr
    Vehtari, Aki
    Tillmann, Vallo
    Knip, Mikael
    Lahdesmaki, Harri
    Lahesmaa, Riitta
    [J]. SCIENTIFIC REPORTS, 2018, 8
  • [9] Temporal expression profiling of plasma proteins reveals oxidative stress in early stages of Type 1 Diabetes progression
    Liu, Chih-Wei
    Bramer, Lisa
    Webb-Robertson, Bobbie-Jo
    Waugh, Kathleen
    Rewers, Marian J.
    Zhang, Qibin
    [J]. JOURNAL OF PROTEOMICS, 2018, 172 : 100 - 110
  • [10] Liu JZ, 2017, ADV NEUR IN, V30