Nonlinear Sparse Component Analysis with a Reference: Variable Selection in Genomics and Proteomics

被引:0
作者
Kopriva, Ivica [1 ]
Kapitanovic, Sanja [2 ]
Cacev, Tamara [2 ]
机构
[1] Rudjer Boskovic Inst, Div Laser & Atom R&D, Zagreb 10000, Croatia
[2] Rudjer Boskovic Inst, Div Mol Med, Zagreb 10000, Croatia
来源
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015 | 2015年 / 9237卷
关键词
Variable selection; Nonlinear mixture model; Empirical kernel maps; Sparse component analysis; CANCER; CLASSIFICATION; ALGORITHMS; PATTERNS; DISCOVERY; SERUM;
D O I
10.1007/978-3-319-22482-4_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many scenarios occurring in genomics and proteomics involve small number of labeled data and large number of variables. To create prediction models robust to overfitting variable selection is necessary. We propose variable selection method using nonlinear sparse component analysis with a reference representing either negative (healthy) or positive (cancer) class. Thereby, component comprised of cancer related variables is automatically inferred from the geometry of nonlinear mixture model with a reference. Proposed method is compared with 3 supervised and 2 unsupervised variable selection methods on two-class problems using 2 genomic and 2 proteomic datasets. Obtained results, which include analysis of biological relevance of selected genes, are comparable with those achieved by supervised methods. Thus, proposed method can possibly perform better on unseen data of the same cancer type.
引用
收藏
页码:168 / 175
页数:8
相关论文
共 50 条
  • [21] SuRF: A new method for sparse variable selection, with application in microbiome data analysis
    Liu, Lihui
    Gu, Hong
    Van Limbergen, Johan
    Kenney, Toby
    [J]. STATISTICS IN MEDICINE, 2021, 40 (04) : 897 - 919
  • [22] Simultaneous variable and factor selection via sparse group lasso in factor analysis
    Dang, Yuanchu
    Wang, Qing
    [J]. JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (14) : 2744 - 2764
  • [23] COORDINATE-INDEPENDENT SPARSE SUFFICIENT DIMENSION REDUCTION AND VARIABLE SELECTION
    Chen, Xin
    Zou, Changliang
    Cook, R. Dennis
    [J]. ANNALS OF STATISTICS, 2010, 38 (06) : 3696 - 3723
  • [24] Component selection and variable selection for mixture regression models
    Qi, Xuefei
    Xu, Xingbai
    Feng, Zhenghui
    Peng, Heng
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 206
  • [25] SPARSE BOUNDED COMPONENT ANALYSIS
    Babatas, Eren
    Erdogan, Alper T.
    [J]. 2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
  • [26] A comparison of simulated annealing algorithms for variable selection in principal component analysis and discriminant analysis
    Brusco, Michael J.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 77 : 38 - 53
  • [27] Finite mixture regression: A sparse variable selection by model selection for clustering
    Devijver, Emilie
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02): : 2642 - 2674
  • [28] Instance Selection Using Nonlinear Sparse Modeling
    Dornaika, Fadi
    Kamal Aldine, Ihab
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) : 1457 - 1461
  • [29] Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis
    Yang, Aijun
    Jiang, Xuejun
    Shu, Lianjie
    Lin, Jinguan
    [J]. COMPUTATIONAL STATISTICS, 2017, 32 (01) : 127 - 143
  • [30] SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis
    Huang, Qiming
    Zhu, Michael
    [J]. KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 857 - 865