A statistical framework for biomarker discovery in metabolomic time course data

被引:34
作者
Berk, Maurice [1 ]
Ebbels, Timothy [2 ]
Montana, Giovanni [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Math, Stat Sect, London SW7 2AZ, England
[2] Univ London Imperial Coll Sci Technol & Med, Dept Surg & Canc, London SW7 2AZ, England
基金
英国惠康基金;
关键词
HYDRAZINE TOXICITY; METABONOMICS;
D O I
10.1093/bioinformatics/btr289
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Metabolomics is the study of the complement of small molecule metabolites in cells, biofluids and tissues. Many metabolomic experiments are designed to compare changes observed over time under two experimental conditions or groups (e. g. a control and drug-treated group) with the goal of identifying discriminatory metabolites or biomarkers that characterize each condition. A common study design consists of repeated measurements taken on each experimental unit thus producing time courses of all metabolites. We describe a statistical framework for estimating time-varying metabolic profiles and their within-group variability and for detecting between-group differences. Specifically, we propose (i) a smoothing splines mixed effects (SME) model that treats each longitudinal measurement as a smooth function of time and (ii) an associated functional test statistic. Statistical significance is assessed by a non-parametric bootstrap procedure. Results: The methodology has been extensively evaluated using simulated data and has been applied to real nuclear magnetic resonance spectroscopy data collected in a preclinical toxicology study as part of a larger project lead by the COMET (Consortium for Metabonomic Toxicology). Our findings are compatible with the previously published studies.
引用
收藏
页码:1979 / 1985
页数:7
相关论文
共 31 条
  • [1] [Anonymous], 1994, Nonparametric Regression and Generalized Linear Models: A Roughness Penalty
  • [2] Batch statistical processing of 1H NMR-derived urinary spectral data
    Antti, H
    Bollard, ME
    Ebbels, T
    Keun, H
    Lindon, JC
    Nicholson, JK
    Holmes, E
    [J]. JOURNAL OF CHEMOMETRICS, 2002, 16 (8-10) : 461 - 468
  • [3] Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes
    Bar-Joseph, Z
    Gerber, G
    Simon, L
    Gifford, DK
    Jaakkola, TS
    Jaakkola, TS
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (18) : 10146 - 10151
  • [4] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [5] Comparative metabonomics of differential hydrazine toxicity in the rat and mouse
    Bollard, ME
    Keun, HC
    Beckonert, O
    Ebbels, TMD
    Antti, H
    Nicholls, AW
    Shockcor, JP
    Cantor, GH
    Stevens, G
    Lindon, JC
    Holmes, E
    Nicholson, JK
    [J]. TOXICOLOGY AND APPLIED PHARMACOLOGY, 2005, 204 (02) : 135 - 151
  • [6] Box G.E.P., 1994, Time Series Analysis, Forecasting and Control, V3rd
  • [7] Pointwise testing with functional data using the Westfall-Young randomization method
    Cox, Dennis D.
    Lee, Jong Soo
    [J]. BIOMETRIKA, 2008, 95 (03) : 621 - 634
  • [8] An anova test for functional data
    Cuevas, A
    Febrero, M
    Fraiman, R
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2004, 47 (01) : 111 - 122
  • [9] De Boor C., 1978, A practical guide to splines
  • [10] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38