Reflections on univariate and multivariate analysis of metabolomics data

被引:443
|
作者
Saccenti, Edoardo [1 ,2 ]
Hoefsloot, Huub C. J. [1 ,2 ]
Smilde, Age K. [1 ,2 ]
Westerhuis, Johan A. [1 ,2 ]
Hendriks, Margriet M. W. B. [2 ,3 ]
机构
[1] Univ Amsterdam, Swammerdam Inst Life Sci, Biosyst Data Anal Grp, NL-1098 XH Amsterdam, Netherlands
[2] Netherlands Metabol Ctr, NL-2333 CL Leiden, Netherlands
[3] Leiden Acad Ctr Drug Res, NL-2333 CL Leiden, Netherlands
关键词
Univariate analysis; Multivariate analysis; Hypothesis testing; Multiple test correction; Overfitting; Consistency at large; NMR-BASED METABOLOMICS; STATISTICAL VALIDATION; DISCRIMINANT-ANALYSIS; SHRUNKEN CENTROIDS; POWERFUL APPROACH; FEATURE-SELECTION; HIGHER CRITICISM; GENE-EXPRESSION; DATA SETS; CLASSIFICATION;
D O I
10.1007/s11306-013-0598-6
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Metabolomics experiments usually result in a large quantity of data. Univariate and multivariate analysis techniques are routinely used to extract relevant information from the data with the aim of providing biological knowledge on the problem studied. Despite the fact that statistical tools like the t test, analysis of variance, principal component analysis, and partial least squares discriminant analysis constitute the backbone of the statistical part of the vast majority of metabolomics papers, it seems that many basic but rather fundamental questions are still often asked, like: Why do the results of univariate and multivariate analyses differ? Why apply univariate methods if you have already applied a multivariate method? Why if I do not see something univariately I see something multivariately? In the present paper we address some aspects of univariate and multivariate analysis, with the scope of clarifying in simple terms the main differences between the two approaches. Applications of the t test, analysis of variance, principal component analysis and partial least squares discriminant analysis will be shown on both real and simulated metabolomics data examples to provide an overview on fundamental aspects of univariate and multivariate methods.
引用
收藏
页码:361 / 374
页数:14
相关论文
共 50 条
  • [41] ESTABLISHING RELIABILITY OF BIOMECHANICAL DATA USING UNIVARIATE AND MULTIVARIATE APPROACHES
    LOONEY, MA
    SMITH, SL
    SRINIVASAN, S
    RESEARCH QUARTERLY FOR EXERCISE AND SPORT, 1990, 61 (02) : 154 - 161
  • [42] Pereneural invasion in colorectal cancer: A univariate and multivariate analysis
    Pinto-de-Sousa, J.
    Cardoso-de-Oliveira, M.
    ANNALS OF ONCOLOGY, 2007, 18 : VII121 - VII121
  • [43] A Guideline to Univariate Statistical Analysis for LC/MS-Based Untargeted Metabolomics-Derived Data
    Vinaixa, Maria
    Samino, Sara
    Saez, Isabel
    Duran, Jordi
    Guinovart, Joan J.
    Yanes, Oscar
    METABOLITES, 2012, 2 (04) : 775 - 795
  • [44] MULTIVARIATE-ANALYSIS VERSUS MULTIPLE UNIVARIATE ANALYSES
    HUBERTY, CJ
    MORRIS, JD
    PSYCHOLOGICAL BULLETIN, 1989, 105 (02) : 302 - 308
  • [45] EMPIRICAL COMPARISON OF UNIVARIATE AND MULTIVARIATE ANALYSIS OF VARIANCE PROCEDURES
    HUMMEL, TJ
    SLIGO, JR
    PSYCHOLOGICAL BULLETIN, 1971, 76 (01) : 49 - &
  • [46] Early gastric cancer: Univariate and multivariate analysis for survival
    Tuech, JJ
    Cervi, C
    Pessaux, P
    Villapadierna, F
    Bergamaschi, R
    Ronceray, J
    Arnaud, JP
    HEPATO-GASTROENTEROLOGY, 1999, 46 (30) : 3276 - 3280
  • [47] UNIVARIATE AND MULTIVARIATE-ANALYSIS ON PHENOTYPIC DIVERGENCE IN PHLEUM
    CAVICCHI, S
    PALENZONA, DL
    PANCALDI, C
    GIORGI, G
    THEORETICAL AND APPLIED GENETICS, 1978, 52 (01) : 39 - 44
  • [48] Survival in gastric cancer patients: Univariate and multivariate analysis
    Dehkordi, Bijan Moghimi
    Safaee, Azadeh
    Nayer, Babak Noori
    Zali, Mohammad Reza
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2007, 102 : S165 - S166
  • [49] NLDyn - An open source MATLAB toolbox for the univariate and multivariate nonlinear dynamical analysis of physiological data
    Rostaghi, Mostafa
    Rostaghi, Sadegh
    Humeau-Heurtier, Anne
    Rajji, Tarek K.
    Azami, Hamed
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2024, 243
  • [50] Univariate analysis of epidemiological data
    Galanis, P.
    ARCHIVES OF HELLENIC MEDICINE, 2014, 31 (02): : 221 - 243