Tensor-Decomposition-Based Unsupervised Feature Extraction Applied to Prostate Cancer Multiomics Data

被引:3
|
作者
Taguchi, Y-h [1 ]
Turki, Turki [2 ]
机构
[1] Chuo Univ, Dept Phys, Tokyo 1128551, Japan
[2] King Abdulaziz Univ, Dept Comp Sci, Jeddah 21589, Saudi Arabia
关键词
prostate cancer; gene expression; genomic regions; protien-coding genes; tensor decomposition; unsupervised learning; MODEL;
D O I
10.3390/genes11121493
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The large p small n problem is a challenge without a de facto standard method available to it. In this study, we propose a tensor-decomposition (TD)-based unsupervised feature extraction (FE) formalism applied to multiomics datasets, in which the number of features is more than 100,000 whereas the number of samples is as small as about 100, hence constituting a typical large p small n problem. The proposed TD-based unsupervised FE outperformed other conventional supervised feature selection methods, random forest, categorical regression (also known as analysis of variance, or ANOVA), penalized linear discriminant analysis, and two unsupervised methods, multiple non-negative matrix factorization and principal component analysis (PCA) based unsupervised FE when applied to synthetic datasets and four methods other than PCA based unsupervised FE when applied to multiomics datasets. The genes selected by TD-based unsupervised FE were enriched in genes known to be related to tissues and transcription factors measured. TD-based unsupervised FE was demonstrated to be not only the superior feature selection method but also the method that can select biologically reliable genes. To our knowledge, this is the first study in which TD-based unsupervised FE has been successfully applied to the integration of this variety of multiomics measurements.
引用
收藏
页码:1 / 23
页数:23
相关论文
共 50 条
  • [31] CANONICAL POLYADIC DECOMPOSITION FOR UNSUPERVISED LINEAR FEATURE EXTRACTION FROM PROTEIN PROFILES
    Jukic, A.
    Kopriva, I.
    Cichocki, A.
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [32] Principal component analysis- and tensor decomposition-based unsupervised feature extraction to select more suitable differentially methylated cytosines: Optimization of standard deviation versus state-of-the-art methods
    Taguchi, Y. -H.
    Turki, Turki
    GENOMICS, 2023, 115 (02)
  • [33] Despeckling Multitemporal Polarimetric SAR Data Based on Tensor Decomposition
    Luo, Jiayin
    Zhang, Lu
    Dong, Jie
    Lopez-Sanchez, Juan M.
    Wang, Yian
    Feng, Hao
    Liao, Mingsheng
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 : 10285 - 10300
  • [34] Principal component analysis based unsupervised feature extraction applied to budding yeast temporally periodic gene expression
    Taguchi, Y-h
    BIODATA MINING, 2016, 9
  • [35] Principal component analysis based unsupervised feature extraction applied to budding yeast temporally periodic gene expression
    Y-h Taguchi
    BioData Mining, 9
  • [36] Unsupervised EEG feature extraction based on echo state network
    Sun, Leilei
    Jin, Bo
    Yang, Haoyu
    Tong, Jianing
    Liu, Chuanren
    Xiong, Hui
    INFORMATION SCIENCES, 2019, 475 : 1 - 17
  • [37] Epileptic Seizures Prediction Based on Unsupervised Learning for Feature Extraction
    Wang, Ruyan
    Wang, Linhai
    He, Peng
    Cui, Yaping
    Wu, Dapeng
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4643 - 4648
  • [38] Robust tensor decomposition with kernel rescaled error loss for feature extraction and dimensionality reduction
    Zhang, Shuaishuai
    Wang, Xiaofeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 270
  • [39] Graph Based Unsupervised Feature Selection for Microarray Data
    Swarnkar, Tripti
    Mitra, Pabitra
    2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [40] Image Feature Extraction Method with SIFT to Diagnose Prostate Cancer
    Toki, Yoshiteru
    Tanaka, Toshiyuki
    2012 PROCEEDINGS OF SICE ANNUAL CONFERENCE (SICE), 2012, : 2185 - 2188