Two-way analysis of high-dimensional collinear data

被引:0
|
作者
Ilkka Huopaniemi
Tommi Suvitaival
Janne Nikkilä
Matej Orešič
Samuel Kaski
机构
[1] Helsinki University of Technology (TKK),Department of Information and Computer Science
[2] University of Helsinki,Department of Basic Veterinary Sciences (Division of Microbiology and Epidemiology), Faculty of Veterinary Medicine
[3] VTT Technical Research Centre of Finland (VTT),undefined
来源
Data Mining and Knowledge Discovery | 2009年 / 19卷
关键词
ANOVA; Factor analysis; Hierarchical model; Metabolomics; Multi-way analysis; Small sample-size;
D O I
暂无
中图分类号
学科分类号
摘要
We present a Bayesian model for two-way ANOVA-type analysis of high-dimensional, small sample-size datasets with highly correlated groups of variables. Modern cellular measurement methods are a main application area; typically the task is differential analysis between diseased and healthy samples, complicated by additional covariates requiring a multi-way analysis. The main complication is the combination of high dimensionality and low sample size, which renders classical multivariate techniques useless. We introduce a hierarchical model which does dimensionality reduction by assuming that the input variables come in similarly-behaving groups, and performs an ANOVA-type decomposition for the set of reduced-dimensional latent variables. We apply the methods to study lipidomic profiles of a recent large-cohort human diabetes study.
引用
收藏
页码:261 / 276
页数:15
相关论文
共 50 条
  • [31] High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature
    Stephen Schilling
    R. Darrell. Bock
    Psychometrika, 2005, 70 : 533 - 555
  • [32] Robust factor modelling for high-dimensional time series: An application to air pollution data
    Reisen, Valderio Anselmo
    Sgrancio, Adriano Marcio
    Levy-Leduc, Celine
    Bondon, Pascal
    Monte, Edson Zambon
    Aranda Cotta, Higor Henrique
    Ziegelmann, Flavio Augusto
    APPLIED MATHEMATICS AND COMPUTATION, 2019, 346 : 842 - 852
  • [33] High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature
    Schilling, S
    Bock, RD
    PSYCHOMETRIKA, 2005, 70 (03) : 533 - 555
  • [34] Artificial neural networks compared to factor analysis for low-dimensional classification of high-dimensional body fat topography data of healthy and diabetic subjects
    Tafeit, E
    Möller, R
    Sudi, K
    Reibnegger, G
    COMPUTERS AND BIOMEDICAL RESEARCH, 2000, 33 (05): : 365 - 374
  • [35] MS-electronic nose performance improvement using the retention time dimension and two-way and three-way data processing methods
    Burian, Cosmin
    Brezmes, Jesus
    Vinaixa, Maria
    Canellas, Nicolau
    Llobet, Eduard
    Vilanova, Xavier
    Correig, Xavier
    SENSORS AND ACTUATORS B-CHEMICAL, 2010, 143 (02) : 759 - 768
  • [36] Global sensitivity analysis for the Rothermel model based on high-dimensional model representation
    Liu, Yaning
    Hussaini, Yousuff
    Oekten, Giray
    CANADIAN JOURNAL OF FOREST RESEARCH, 2015, 45 (11) : 1474 - 1479
  • [37] TWO-SAMPLE TESTING OF HIGH-DIMENSIONAL LINEAR REGRESSION COEFFICIENTS VIA COMPLEMENTARY SKETCHING
    Gao, Fengnan
    Wang, Tengyao
    ANNALS OF STATISTICS, 2022, 50 (05) : 2950 - 2972
  • [38] Approximate tests in unbalanced two-way random models without interaction
    Guven, Bilgehan
    STATISTICAL PAPERS, 2012, 53 (03) : 753 - 766
  • [39] Testing for trend in two-way crossed effects model under heteroscedasticity
    Mondal, Anjana
    Sattler, Paavo
    Kumar, Somesh
    TEST, 2023, 32 (04) : 1434 - 1458
  • [40] A Clustering-based Test for Nonadditivity inanUnreplicated Two-way Layout
    Malik, W. A.
    Moehring, J.
    Piepho, H. P.
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (02) : 660 - 670