A feature extraction framework for discovering pan-cancer driver genes based on multi-omics data

被引:0
|
作者
Xue, Xiaomeng [1 ]
Li, Feng [1 ]
Shang, Junliang [1 ]
Dai, Lingyun [1 ]
Ge, Daohui [1 ]
Ren, Qianqian [1 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao, Peoples R China
基金
中国国家自然科学基金;
关键词
cancer driver genes; feature extraction; multi-omics data; network propagation; pan-cancer; SOMATIC MUTATIONS; FEATURE-SELECTION; PATHWAYS; TOPSIS;
D O I
10.1002/qub2.40
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The identification of tumor driver genes facilitates accurate cancer diagnosis and treatment, playing a key role in precision oncology, along with gene signaling, regulation, and their interaction with protein complexes. To tackle the challenge of distinguishing driver genes from a large number of genomic data, we construct a feature extraction framework for discovering pan-cancer driver genes based on multi-omics data (mutations, gene expression, copy number variants, and DNA methylation) combined with protein-protein interaction (PPI) networks. Using a network propagation algorithm, we mine functional information among nodes in the PPI network, focusing on genes with weak node information to represent specific cancer information. From these functional features, we extract distribution features of pan-cancer data, pan-cancer TOPSIS features of functional features using the ideal solution method, and SetExpan features of pan-cancer data from the gene functional features, a method to rank pan-cancer data based on the average inverse rank. These features represent the common message of pan-cancer. Finally, we use the lightGBM classification algorithm for gene prediction. Experimental results show that our method outperforms existing methods in terms of the area under the check precision-recall curve (AUPRC) and demonstrates better performance across different PPI networks. This indicates our framework's effectiveness in predicting potential cancer genes, offering valuable insights for the diagnosis and treatment of tumors.
引用
收藏
页码:173 / 181
页数:9
相关论文
共 50 条
  • [41] Integrated Multi-omics Analysis Using Variational Autoencoders: Application to Pan-cancer Classification
    Zhang, Xiaoyu
    Zhang, Jingqing
    Sun, Kai
    Yang, Xian
    Dai, Chengliang
    Guo, Yike
    2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 765 - 769
  • [42] Comparison of Chemometric Explorative Multi-Omics Data Analysis Methods Applied to a Mechanistic Pan-Cancer Cell Model
    Westerhuis, J. A.
    Heintz-Buschart, A.
    Hoefsloot, H. C. J.
    van Der Kloet, F. M.
    van Der Ploeg, G. R.
    White, F. T. G.
    JOURNAL OF CHEMOMETRICS, 2025, 39 (02)
  • [43] Pan-cancer evidence of prognosis, immune infiltration, and immunotherapy efficacy for annexin family using multi-omics data
    Shen, Chong
    Zhang, Siyang
    Zhang, Zhe
    Yang, Shaobo
    Zhang, Yu
    Lin, Yuda
    Fu, Chong
    Li, Zhi
    Wu, Zhouliang
    Wang, Zejin
    Li, Zhuolun
    Guo, Jian
    Li, Peng
    Hu, Hailong
    FUNCTIONAL & INTEGRATIVE GENOMICS, 2023, 23 (03)
  • [44] scCancerExplorer: a comprehensive database for interactively exploring single-cell multi-omics data of human pan-cancer
    Huang, Changzhi
    Liu, Zekai
    Guo, Yunlei
    Wang, Wanchu
    Yuan, Zhen
    Guan, Yusheng
    Pan, Deng
    Hu, Zhibin
    Sun, Linhua
    Fu, Zan
    Bian, Shuhui
    NUCLEIC ACIDS RESEARCH, 2024, 53 (D1) : D1526 - D1535
  • [45] Identifying mutated driver pathways in cancer by integrating multi-omics data
    Wu, Jingli
    Cai, Qirong
    Wang, Jinyan
    Liao, Yuanxiu
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 80 (159-167) : 159 - 167
  • [46] ModulOmics: Integrating Multi-Omics Data to Identify Cancer Driver Modules
    Silverbush, Dana
    Cristea, Simona
    Yanovich, Gali
    Geiger, Tamar
    Beerenwinkel, Niko
    Sharan, Roded
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB 2018, 2018, 10812 : 283 - 284
  • [47] Network-based integration of multi-omics data for prioritizing cancer genes
    Dimitrakopoulos, Christos
    Hindupur, Sravanth Kumar
    Haefliger, Luca
    Behr, Jonas
    Montazeri, Hesam
    Hall, Michael N.
    Beerenwinkel, Niko
    BIOINFORMATICS, 2018, 34 (14) : 2441 - 2448
  • [48] Comprehensive Analysis of Metabolic Genes in Breast Cancer Based on Multi-Omics Data
    Hua, Yu
    Gao, Lihong
    Li, Xiaobo
    PATHOLOGY & ONCOLOGY RESEARCH, 2021, 27
  • [49] Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
    Nguyen, Quang-Huy
    Le, Duc-Hau
    SCIENTIFIC REPORTS, 2020, 10 (01)
  • [50] Improving existing analysis pipeline to identify and analyze cancer driver genes using multi-omics data
    Quang-Huy Nguyen
    Duc-Hau Le
    Scientific Reports, 10