Nonlinear Sparse Component Analysis with a Reference: Variable Selection in Genomics and Proteomics

被引:0
作者
Kopriva, Ivica [1 ]
Kapitanovic, Sanja [2 ]
Cacev, Tamara [2 ]
机构
[1] Rudjer Boskovic Inst, Div Laser & Atom R&D, Zagreb 10000, Croatia
[2] Rudjer Boskovic Inst, Div Mol Med, Zagreb 10000, Croatia
来源
LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, LVA/ICA 2015 | 2015年 / 9237卷
关键词
Variable selection; Nonlinear mixture model; Empirical kernel maps; Sparse component analysis; CANCER; CLASSIFICATION; ALGORITHMS; PATTERNS; DISCOVERY; SERUM;
D O I
10.1007/978-3-319-22482-4_19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many scenarios occurring in genomics and proteomics involve small number of labeled data and large number of variables. To create prediction models robust to overfitting variable selection is necessary. We propose variable selection method using nonlinear sparse component analysis with a reference representing either negative (healthy) or positive (cancer) class. Thereby, component comprised of cancer related variables is automatically inferred from the geometry of nonlinear mixture model with a reference. Proposed method is compared with 3 supervised and 2 unsupervised variable selection methods on two-class problems using 2 genomic and 2 proteomic datasets. Obtained results, which include analysis of biological relevance of selected genes, are comparable with those achieved by supervised methods. Thus, proposed method can possibly perform better on unseen data of the same cancer type.
引用
收藏
页码:168 / 175
页数:8
相关论文
共 50 条
[21]   Simultaneous variable and factor selection via sparse group lasso in factor analysis [J].
Dang, Yuanchu ;
Wang, Qing .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2019, 89 (14) :2744-2764
[22]   Component selection and variable selection for mixture regression models [J].
Qi, Xuefei ;
Xu, Xingbai ;
Feng, Zhenghui ;
Peng, Heng .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2025, 206
[23]   COORDINATE-INDEPENDENT SPARSE SUFFICIENT DIMENSION REDUCTION AND VARIABLE SELECTION [J].
Chen, Xin ;
Zou, Changliang ;
Cook, R. Dennis .
ANNALS OF STATISTICS, 2010, 38 (06) :3696-3723
[24]   SPARSE BOUNDED COMPONENT ANALYSIS [J].
Babatas, Eren ;
Erdogan, Alper T. .
2016 IEEE 26TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2016,
[25]   A comparison of simulated annealing algorithms for variable selection in principal component analysis and discriminant analysis [J].
Brusco, Michael J. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 77 :38-53
[26]   Lithium exploration targeting through robust variable selection and deep anomaly detection: An integrated application of sparse principal component analysis and stacked autoencoders [J].
Esmaeiloghli, Saeid ;
Lima, Alexandre ;
Sadeghi, Behnam .
GEOCHEMISTRY, 2024, 84 (04)
[27]   Finite mixture regression: A sparse variable selection by model selection for clustering [J].
Devijver, Emilie .
ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (02) :2642-2674
[28]   Instance Selection Using Nonlinear Sparse Modeling [J].
Dornaika, Fadi ;
Kamal Aldine, Ihab .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) :1457-1461
[29]   SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis [J].
Huang, Qiming ;
Zhu, Michael .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :857-865
[30]   Bayesian variable selection with sparse and correlation priors for high-dimensional data analysis [J].
Yang, Aijun ;
Jiang, Xuejun ;
Shu, Lianjie ;
Lin, Jinguan .
COMPUTATIONAL STATISTICS, 2017, 32 (01) :127-143