Combining Mutation and Gene Network Data in a Machine Learning Approach for False-Positive Cancer Driver Gene Discovery

被引:3
|
作者
Cutigi, Jorge Francisco [1 ,2 ]
Evangelista, Renato Feijo [2 ]
Ramos, Rodrigo Henrique [1 ,2 ]
Lage Ferreira, Cynthia de Oliveira [2 ]
Evangelista, Adriane Feijo [3 ]
de Carvalho, Andre C. P. L. F. [2 ]
Simao, Adenilso [2 ]
机构
[1] Fed Inst Sao Paulo, Sao Carlos, SP, Brazil
[2] Univ Sao Paulo, Sao Carlos, SP, Brazil
[3] Barretos Canc Hosp, Barretos, SP, Brazil
来源
ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2020 | 2020年 / 12558卷
关键词
Cancer bioinformatics; Driver genes; False-positive driver; Complex networks; Machine learning; PATHWAYS;
D O I
10.1007/978-3-030-65775-8_8
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
An increasing interest in Cancer Genomics research emerged from the advent and widespread use of next-generation sequencing technologies, which have generated a large amount of digital biological data. However, not all of this information in fact contributes to cancer studies. For instance, false-positive-driver genes may contain characteristics of cancer genes but are not actually relevant to the cancer initiation and progression. Including this type of genes in cancer studies may lead to identifying unrealistic trends in the data and mislead biomedical decisions. Therefore, proper screening to detect this specific type of gene among genes considered drivers is of utmost importance. This work is focused on the development of models dedicated to this task. Support Vector Machine (SVM) and Random Forest (RF) machine learning algorithms were selected to induce predictive models to classify supposedly driver genes as real drivers or false-positive drivers based on both mutation data and gene network interactions. The results confirmed that the combination of the two sources of information improves the performance of the models. Moreover, SVM and RF models achieved a classification accuracy of 85.0% and 82.4% over labeled data, respectively. Finally, a literature-based analysis was performed over the classification of a new set of genes to further validate the concept.
引用
收藏
页码:81 / 92
页数:12
相关论文
共 50 条
  • [21] Overcoming false-positive gene-category enrichment in the analysis of spatially resolved transcriptomic brain atlas data
    Ben D. Fulcher
    Aurina Arnatkeviciute
    Alex Fornito
    Nature Communications, 12
  • [22] Cancer driver gene discovery in transcriptional regulatory networks using influence maximization approach
    Rahimi, Majid
    Teimourpour, Babak
    Marashi, Sayed-Amir
    COMPUTERS IN BIOLOGY AND MEDICINE, 2019, 114
  • [23] Intratumoral Resolution of Driver Gene Mutation Heterogeneity in Renal Cancer Using Deep Learning
    Acosta, Paul H.
    Panwar, Vandana
    Jarmale, Vipul
    Christie, Alana
    Jasti, Jay
    Margulis, Vitaly
    Rakheja, Dinesh
    Cheville, John
    Leibovich, Bradley C.
    Parker, Alexander
    Brugarolas, James
    Kapur, Payal
    Rajaram, Satwik
    CANCER RESEARCH, 2022, 82 (15) : 2792 - 2806
  • [24] Multi-Network Graph Contrastive Learning for Cancer Driver Gene Identification
    Peng, Wei
    Zhou, Zhengnan
    Dai, Wei
    Yu, Ning
    Wang, Jianxin
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (04): : 3430 - 3440
  • [26] Advancing cancer driver gene identification through an integrative network and pathway approach
    Song, Junrong
    Song, Zhiming
    Gong, Yuanli
    Ge, Lichang
    Lou, Wenlu
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 158
  • [27] Developmental gene regulatory network connections predicted by machine learning from gene expression data alone
    Zhang, Jingyi
    Ibrahim, Farhan
    Najmulski, Emily
    Katholos, George
    Altarawy, Doaa
    Heath, Lenwood S.
    Tulin, Sarah L.
    PLOS ONE, 2021, 16 (12):
  • [28] FCMEDriver: Identifying Cancer Driver Gene by Combining Mutual Exclusivity of Embedded Features and Optimized Mutation Frequency Score
    Yi, Sichen
    Xie, MinZhu
    BIOINFORMATICS RESEARCH AND APPLICATIONS, PT III, ISBRA 2024, 2024, 14956 : 130 - 141
  • [29] A machine learning approach to predict platform specific gene essentiality in cancer
    Gilvary, Coryandar M.
    Madhukar, Neel S.
    Gayvert, Kaitlyn M.
    Rickman, David S.
    Elemento, Olivier
    CANCER RESEARCH, 2017, 77
  • [30] Enhancing Molecular Network-Based Cancer Driver Gene Prediction Using Machine Learning Approaches: Current Challenges and Opportunities
    Zhang, Hao
    Lin, Chaohuan
    Chen, Ying'ao
    Shen, Xianrui
    Wang, Ruizhe
    Chen, Yiqi
    Lyu, Jie
    JOURNAL OF CELLULAR AND MOLECULAR MEDICINE, 2025, 29 (01)