Two-way analysis of high-dimensional collinear data

被引:0
|
作者
Ilkka Huopaniemi
Tommi Suvitaival
Janne Nikkilä
Matej Orešič
Samuel Kaski
机构
[1] Helsinki University of Technology (TKK),Department of Information and Computer Science
[2] University of Helsinki,Department of Basic Veterinary Sciences (Division of Microbiology and Epidemiology), Faculty of Veterinary Medicine
[3] VTT Technical Research Centre of Finland (VTT),undefined
来源
Data Mining and Knowledge Discovery | 2009年 / 19卷
关键词
ANOVA; Factor analysis; Hierarchical model; Metabolomics; Multi-way analysis; Small sample-size;
D O I
暂无
中图分类号
学科分类号
摘要
We present a Bayesian model for two-way ANOVA-type analysis of high-dimensional, small sample-size datasets with highly correlated groups of variables. Modern cellular measurement methods are a main application area; typically the task is differential analysis between diseased and healthy samples, complicated by additional covariates requiring a multi-way analysis. The main complication is the combination of high dimensionality and low sample size, which renders classical multivariate techniques useless. We introduce a hierarchical model which does dimensionality reduction by assuming that the input variables come in similarly-behaving groups, and performs an ANOVA-type decomposition for the set of reduced-dimensional latent variables. We apply the methods to study lipidomic profiles of a recent large-cohort human diabetes study.
引用
收藏
页码:261 / 276
页数:15
相关论文
共 50 条
  • [1] Two-Way Analysis of High-Dimensional Collinear Data
    Huopaniemi, Ilkka
    Suvitaival, Tommi
    Nikkila, Janne
    Oresic, Matej
    Kaski, Samuel
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2009, 5781 : 33 - 33
  • [2] Two-way analysis of high-dimensional collinear data
    Huopaniemi, Ilkka
    Suvitaival, Tommi
    Nikkila, Janne
    Oresic, Matej
    Kaski, Samuel
    DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 19 (02) : 261 - 276
  • [3] CANONICAL VARIATE ANALYSIS OF HIGH-DIMENSIONAL SPECTRAL DATA
    KIIVERI, HT
    TECHNOMETRICS, 1992, 34 (03) : 321 - 331
  • [5] Factor analysis of high-dimensional heterogeneous data for structural characterization
    Machado, AMC
    Gee, JC
    Campos, MFM
    MEDICAL IMAGING: 2001: IMAGE PROCESSING, PTS 1-3, 2001, 4322 : 995 - 1004
  • [6] Adaptive Testing for High-Dimensional Data
    Zhang, Yangfan
    Wang, Runmin
    Shao, Xiaofeng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2025,
  • [7] Interpretable Approximation of High-Dimensional Data
    Potts, Daniel
    Schmischke, Michael
    SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2021, 3 (04): : 1301 - 1323
  • [8] Robustness Analysis and Forecasting of High-Dimensional Financial Time Series Data
    Li, Junchen
    Song, Shuai
    Bian, Ce
    COMPUTATIONAL ECONOMICS, 2025,
  • [9] A Visualization Tool for One- and Two-Way Analysis of Variance
    Sturm-Beiss, Rachel
    JOURNAL OF STATISTICS EDUCATION, 2005, 13 (01):
  • [10] Flexible High-Dimensional Unsupervised Learning with Missing Data
    Wei, Yuhong
    Tang, Yang
    McNicholas, Paul D.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (03) : 610 - 621