Two-way analysis of high-dimensional collinear data

被引:0
|
作者
Ilkka Huopaniemi
Tommi Suvitaival
Janne Nikkilä
Matej Orešič
Samuel Kaski
机构
[1] Helsinki University of Technology (TKK),Department of Information and Computer Science
[2] University of Helsinki,Department of Basic Veterinary Sciences (Division of Microbiology and Epidemiology), Faculty of Veterinary Medicine
[3] VTT Technical Research Centre of Finland (VTT),undefined
来源
Data Mining and Knowledge Discovery | 2009年 / 19卷
关键词
ANOVA; Factor analysis; Hierarchical model; Metabolomics; Multi-way analysis; Small sample-size;
D O I
暂无
中图分类号
学科分类号
摘要
We present a Bayesian model for two-way ANOVA-type analysis of high-dimensional, small sample-size datasets with highly correlated groups of variables. Modern cellular measurement methods are a main application area; typically the task is differential analysis between diseased and healthy samples, complicated by additional covariates requiring a multi-way analysis. The main complication is the combination of high dimensionality and low sample size, which renders classical multivariate techniques useless. We introduce a hierarchical model which does dimensionality reduction by assuming that the input variables come in similarly-behaving groups, and performs an ANOVA-type decomposition for the set of reduced-dimensional latent variables. We apply the methods to study lipidomic profiles of a recent large-cohort human diabetes study.
引用
收藏
页码:261 / 276
页数:15
相关论文
共 50 条
  • [21] Multimodal Data Fusion in High-Dimensional Heterogeneous Datasets Via Generative Models
    Yilmaz, Yasin
    Aktukmak, Mehmet
    Hero, Alfred
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 : 5175 - 5188
  • [22] Constructing metabolic association networks using high-dimensional mass spectrometry data
    Koo, Imhoi
    Wei, Xiaoli
    Shi, Xue
    Zhou, Zhanxiang
    Kim, Seongho
    Zhang, Xiang
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 138 : 193 - 202
  • [23] Diagonal likelihood ratio test for equality of mean vectors in high-dimensional data
    Hu, Zongliang
    Tong, Tiejun
    Genton, Marc G.
    BIOMETRICS, 2019, 75 (01) : 256 - 267
  • [24] limpca: An R package for the linear modeling of high-dimensional designed data based on ASCA/APCA family of methods
    Thiel, Michel
    Benaiche, Nadia
    Martin, Manon
    Franceschini, Sebastien
    Van Oirbeek, Robin
    Govaerts, Bernadette
    JOURNAL OF CHEMOMETRICS, 2023, 37 (07)
  • [25] Adaptive Bayesian Spectral Analysis of High-Dimensional Nonstationary Time Series
    Li, Zeda
    Rosen, Ori
    Ferrarelli, Fabio
    Krafty, Robert T.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (03) : 794 - 807
  • [26] GENERALIZED CONFIDENCE REGIONS OF FIXED EFFECTS IN THE TWO-WAY ANOVA
    Weiyan MU Department of Mathematics
    Journal of Systems Science & Complexity, 2008, 21 (02) : 276 - 282
  • [27] Generalized confidence regions of fixed effects in the two-way ANOVA*
    Weiyan MU
    Shifeng XIONG
    Xingzhong XU
    Journal of Systems Science and Complexity, 2008, 21 : 276 - 282
  • [28] Big-Data Tensor Recovery for High-Dimensional Uncertainty Quantification of Process Variations
    Zhang, Zheng
    Weng, Tsui-Wei
    Daniel, Luca
    IEEE TRANSACTIONS ON COMPONENTS PACKAGING AND MANUFACTURING TECHNOLOGY, 2017, 7 (05): : 687 - 697
  • [29] Imputation for incomplete high-dimensional multivariate normal data using a common factor model
    Song, JW
    Belin, TR
    STATISTICS IN MEDICINE, 2004, 23 (18) : 2827 - 2843
  • [30] Generalized confidence regions of fixed effects in the two-way ANOVA
    Mu, Weiyan
    Xiong, Shifeng
    Xu, Xingzhong
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2008, 21 (02) : 276 - 282