WGCNA and Machine Learning-Based Integrative Bioinformatics Analysis for Identifying Key Genes of Colorectal Cancer

被引:1
|
作者
Al Mehedi Hasan, Md. [1 ]
Maniruzzaman, Md. [2 ,3 ]
Shin, Jungpil [3 ]
机构
[1] Rajshahi Univ Engn & Technol, Dept Comp Sci & Engn, Rajshahi 6204, Bangladesh
[2] Khulna Univ, Stat Discipline, Khulna 9208, Bangladesh
[3] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, 9658580, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Bioinformatics; Biomarkers; Proteins; Support vector machines; Object recognition; Network analyzers; Gene expression; Databases; Correlation; Colorectal cancer; Machine learning; WGCNA; machine learning-based models; differentially expressed discriminative genes; bioinformatics analysis; key genes; CARCINOMA; PROGNOSIS; ONTOLOGY;
D O I
10.1109/ACCESS.2024.3472688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Colorectal cancer (CC) is a significant public health concern and make it necessary to identify reliable biomarkers and elucidate their molecular and biological mechanisms. This study proposed a system by integrating weighted gene co-expression network analysis (WGCNA) and machine learning-based integrative bioinformatics (ML-IB) analysis to identify key genes for CC. WGCNA was implemented to find a co-expression network of genes and identify important genes by intersecting gene sets obtained using module membership and gene significance criteria across datasets. WGCNA-based significant genes were determined by intersecting important genes between two datasets. ML-IB based approach primarily identified differentially expressed genes (DEGs), then employed support vector machine to determine differentially expressed discriminative genes (DEDGs) and took their common DEDGs across datasets. Protein-protein interaction networks were built and identified hub genes based on the degrees of connectivity and hub module genes using MCODE scores. The ML-IB based significant genes were determined by intersecting hub genes and hub module genes. Four common significant genes were found by intersecting significant genes derived from WGCNA and ML-IB based perspectives. Finally, two genes (AURKA and CCNA2) were determined as key genes for showing strong correlation with survival of CC patients and validated their discriminative capability on an independent test dataset using AUC analysis. The key genes of AURKA and CCNA2 may be used for the early detection of patients with CC. This study will helpful for physicians and doctors to determine and understand the associated the molecular mechanisms and pathway of patients with CC.
引用
收藏
页码:144350 / 144363
页数:14
相关论文
共 50 条
  • [1] Bioinformatics and Machine Learning-Based Screening of Key Genes in Alzheimer's Disease
    Hou, Meng Ting
    Bao, Juan
    Zheng, Shu Xiong
    Li, Si Tong
    Li, Xi Yu
    INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2024, 21 (01)
  • [2] Identification of potential key genes for colorectal cancer based on bioinformatics analysis
    Li, Chongyang
    Cao, Shengqin
    Guo, Mingxiao
    Guo, Aihong
    Sun, Xuedi
    MEDICINE, 2023, 102 (51)
  • [3] Bioinformatics Analysis of Key Genes and Pathways in Colorectal Cancer
    Qi, Yuewen
    Qi, Haowen
    Liu, Zeyuan
    He, Peiyuan
    Li, Bingqing
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2019, 26 (04) : 364 - 375
  • [4] Identifying the key genes and microRNAs in colorectal cancer liver metastasis by bioinformatics analysis and in vitro experiments
    Zhang, Tao
    Guo, Jianrong
    Gu, Jian
    Wang, Zheng
    Wang, Guobin
    Li, Huili
    Wang, Jiliang
    ONCOLOGY REPORTS, 2019, 41 (01) : 279 - 291
  • [5] Screening of key genes related to ferroptosis and a molecular interaction network analysis in colorectal cancer using machine learning and bioinformatics
    Xue, Fengfu
    Jiang, Jingwen
    Kou, Jiguang
    JOURNAL OF GASTROINTESTINAL ONCOLOGY, 2023, 14 (03) : 1346 - +
  • [6] Identification of key candidate genes for colorectal cancer by bioinformatics analysis
    Chen, Zhihua
    Lin, Yilin
    Gao, Ji
    Lin, Suyong
    Zheng, Yan
    Liu, Yisu
    Chen, Shao Qin
    ONCOLOGY LETTERS, 2019, 18 (06) : 6583 - 6593
  • [7] Machine Learning-Based Colorectal Cancer Detection
    Blanes-Vidal, Victoria
    Baatrup, Gunnar
    Nadimi, Esmaeil S.
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 43 - 46
  • [8] Identification of potential biomarkers with colorectal cancer based on bioinformatics analysis and machine learning
    Hammad, Ahmed
    Elshaer, Mohamed
    Tang, Xiuwen
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2021, 18 (06) : 8997 - 9015
  • [9] Identification of key immune genes of osteoporosis based on bioinformatics and machine learning
    Hao, Song
    Mao, Xinqi
    Xu, Weicheng
    Yang, Shiwei
    Cao, Lumin
    Xiao, Wang
    Dong, Liu
    Jun, Hua
    FRONTIERS IN ENDOCRINOLOGY, 2023, 14
  • [10] Integrative analysis of key candidate genes and signaling pathways in ovarian cancer by bioinformatics
    Dong, Cuicui
    Tian, Xin
    He, Fucheng
    Zhang, Jiayi
    Cui, Xiaojian
    He, Qin
    Si, Ping
    Shen, Yongming
    JOURNAL OF OVARIAN RESEARCH, 2021, 14 (01)