AVBAE-MODFR: A novel deep learning framework of embedding and feature selection on multi-omics data for pan-cancer classification

被引:1
|
作者
Li M. [1 ]
Guo H. [1 ]
Wang K. [1 ]
Kang C. [1 ]
Yin Y. [2 ]
Zhang H. [1 ]
机构
[1] National Key Laboratory of Intelligent Tracking and Forecasting for Infectious Diseases, Engineering Research Center of Trusted Behavior Intelligence, Ministry of Education, College of Artificial Intelligence, Nankai University, Tongyan Road, Tianjin
[2] Department of Food Science and Technology, University of Nebraska - Lincoln, NE
基金
中国国家自然科学基金;
关键词
Deep learning; Feature importance ranking; Multi-omics; Pan-cancer classification; Variational autoencoders;
D O I
10.1016/j.compbiomed.2024.108614
中图分类号
学科分类号
摘要
Integration analysis of cancer multi-omics data for pan-cancer classification has the potential for clinical applications in various aspects such as tumor diagnosis, analyzing clinically significant features, and providing precision medicine. In these applications, the embedding and feature selection on high-dimensional multi-omics data is clinically necessary. Recently, deep learning algorithms become the most promising cancer multi-omic integration analysis methods, due to the powerful capability of capturing nonlinear relationships. Developing effective deep learning architectures for cancer multi-omics embedding and feature selection remains a challenge for researchers in view of high dimensionality and heterogeneity. In this paper, we propose a novel two-phase deep learning model named AVBAE-MODFR for pan-cancer classification. AVBAE-MODFR achieves embedding by a multi2multi autoencoder based on the adversarial variational Bayes method and further performs feature selection utilizing a dual-net-based feature ranking method. AVBAE-MODFR utilizes AVBAE to pre-train the network parameters, which improves the classification performance and enhances feature ranking stability in MODFR. Firstly, AVBAE learns high-quality representation among multiple omics features for unsupervised pan-cancer classification. We design an efficient discriminator architecture to distinguish the latent distributions for updating forward variational parameters. Secondly, we propose MODFR to simultaneously evaluate multi-omics feature importance for feature selection by training a designed multi2one selector network, where the efficient evaluation approach based on the average gradient of random mask subsets can avoid bias caused by input feature drift. We conduct experiments on the TCGA pan-cancer dataset and compare it with four state-of-the-art methods for each phase. The results show the superiority of AVBAE-MODFR over SOTA methods. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] Pan-cancer classification of multi-omics data based on machine learning models
    Cava, Claudia
    Sabetian, Soudabeh
    Salvatore, Christian
    Castiglioni, Isabella
    NETWORK MODELING AND ANALYSIS IN HEALTH INFORMATICS AND BIOINFORMATICS, 2024, 13 (01):
  • [2] A feature extraction framework for discovering pan-cancer driver genes based on multi-omics data
    Xue, Xiaomeng
    Li, Feng
    Shang, Junliang
    Dai, Lingyun
    Ge, Daohui
    Ren, Qianqian
    QUANTITATIVE BIOLOGY, 2024, 12 (02) : 173 - 181
  • [3] A pan-cancer integrative pathway analysis of multi-omics data
    Henry Linder
    Yuping Zhang
    Quantitative Biology, 2020, 8 (02) : 130 - 142
  • [4] A pan-cancer integrative pathway analysis of multi-omics data
    Linder, Henry
    Zhang, Yuping
    QUANTITATIVE BIOLOGY, 2020, 8 (02) : 130 - 142
  • [5] Integration of pan-cancer multi-omics data for novel mixed subgroup identification using machine learning methods
    Khadirnaikar, Seema
    Shukla, Sudhanshu
    Prasanna, S. R. M.
    PLOS ONE, 2023, 18 (10):
  • [6] MetaCancer: A deep learning-based pan-cancer metastasis prediction model developed using multi-omics data
    Albaradei, Somayah
    Napolitano, Francesco
    Thafar, Maha A.
    Gojobori, Takashi
    Essack, Magbubah
    Gao, Xin
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2021, 19 : 4404 - 4411
  • [7] Identifying individualized risk subpathways reveals pan-cancer molecular classification based on multi-omics data
    Xu, Yanjun
    Wang, Jingwen
    Li, Feng
    Zhang, Chunlong
    Zheng, Xuan
    Cao, Yang
    Shang, Desi
    Hu, Congxue
    Xu, Yingqi
    Mi, Wanqi
    Li, Xia
    Cao, Yan
    Zhang, Yunpeng
    COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL, 2022, 20 : 838 - 849
  • [8] Enhancing Lung Cancer Classification and Prediction With Deep Learning and Multi-Omics Data
    Mohamed, Tehnan I. A.
    Ezugwu, Absalom El-Shamir
    IEEE ACCESS, 2024, 12 : 59880 - 59892
  • [9] Classifying the multi-omics data of gastric cancer using a deep feature selection method
    Hu, Yanyu
    Zhao, Long
    Li, Zhao
    Dong, Xiangjun
    Xu, Tiantian
    Zhao, Yuhai
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [10] Identification of Pan-Cancer Prognostic Biomarkers Through Integration of Multi-Omics Data
    Zhao, Ning
    Guo, Maozu
    Wang, Kuanquan
    Zhang, Chunlong
    Liu, Xiaoyan
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8