WGCNA and Machine Learning-Based Integrative Bioinformatics Analysis for Identifying Key Genes of Colorectal Cancer

被引:1
|
作者
Al Mehedi Hasan, Md. [1 ]
Maniruzzaman, Md. [2 ,3 ]
Shin, Jungpil [3 ]
机构
[1] Rajshahi Univ Engn & Technol, Dept Comp Sci & Engn, Rajshahi 6204, Bangladesh
[2] Khulna Univ, Stat Discipline, Khulna 9208, Bangladesh
[3] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, 9658580, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Bioinformatics; Biomarkers; Proteins; Support vector machines; Object recognition; Network analyzers; Gene expression; Databases; Correlation; Colorectal cancer; Machine learning; WGCNA; machine learning-based models; differentially expressed discriminative genes; bioinformatics analysis; key genes; CARCINOMA; PROGNOSIS; ONTOLOGY;
D O I
10.1109/ACCESS.2024.3472688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Colorectal cancer (CC) is a significant public health concern and make it necessary to identify reliable biomarkers and elucidate their molecular and biological mechanisms. This study proposed a system by integrating weighted gene co-expression network analysis (WGCNA) and machine learning-based integrative bioinformatics (ML-IB) analysis to identify key genes for CC. WGCNA was implemented to find a co-expression network of genes and identify important genes by intersecting gene sets obtained using module membership and gene significance criteria across datasets. WGCNA-based significant genes were determined by intersecting important genes between two datasets. ML-IB based approach primarily identified differentially expressed genes (DEGs), then employed support vector machine to determine differentially expressed discriminative genes (DEDGs) and took their common DEDGs across datasets. Protein-protein interaction networks were built and identified hub genes based on the degrees of connectivity and hub module genes using MCODE scores. The ML-IB based significant genes were determined by intersecting hub genes and hub module genes. Four common significant genes were found by intersecting significant genes derived from WGCNA and ML-IB based perspectives. Finally, two genes (AURKA and CCNA2) were determined as key genes for showing strong correlation with survival of CC patients and validated their discriminative capability on an independent test dataset using AUC analysis. The key genes of AURKA and CCNA2 may be used for the early detection of patients with CC. This study will helpful for physicians and doctors to determine and understand the associated the molecular mechanisms and pathway of patients with CC.
引用
收藏
页码:144350 / 144363
页数:14
相关论文
共 50 条
  • [21] Diagnostic genes and immune infiltration analysis of colorectal cancer determined by LASSO and SVM machine learning methods: a bioinformatics analysis
    Li, Yan-Rong
    Meng, Ke
    Yang, Guang
    Liu, Bao-Hai
    Li, Chu-Qiao
    Zhang, Jia-Yuan
    Zhang, Xiao-Mei
    JOURNAL OF GASTROINTESTINAL ONCOLOGY, 2022, 13 (03) : 1188 - +
  • [22] Integrating machine learning and bioinformatics approaches for identifying novel diagnostic gene biomarkers in colorectal cancer
    Vaziri-Moghadam, Ayoub
    Foroughmand-Araabi, Mohammad-Hadi
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [23] Identifying key genes for diabetic kidney disease by bioinformatics analysis
    Xu, Yushan
    Li, Lan
    Tang, Ping
    Zhang, Jingrong
    Zhong, Ruxian
    Luo, Jingmei
    Lin, Jie
    Zhang, Lihua
    BMC NEPHROLOGY, 2023, 24 (01)
  • [24] Identifying key genes for diabetic kidney disease by bioinformatics analysis
    Yushan Xu
    Lan Li
    Ping Tang
    Jingrong Zhang
    Ruxian Zhong
    Jingmei Luo
    Jie Lin
    Lihua Zhang
    BMC Nephrology, 24
  • [25] Identifying key genes in rheumatoid arthritis using bioinformatics analysis
    Song, Xing
    Zeng, Xiaofeng
    INTERNATIONAL JOURNAL OF CLINICAL AND EXPERIMENTAL MEDICINE, 2019, 12 (07): : 8662 - +
  • [26] Machine learning-based meta-analysis of colorectal cancer and inflammatory bowel disease
    Sardari, Aria
    Usefi, Hamid
    PLOS ONE, 2023, 18 (12):
  • [27] Identification of potential core genes in colorectal carcinoma and key genes in colorectal cancer liver metastasis using bioinformatics analysis
    Niu, Lipeng
    Gao, Ce
    Li, Yang
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [28] Identification of key circadian rhythm genes in skin aging based on bioinformatics and machine learning
    Xiao, Xiao
    Feng, Hao
    Liao, Yangying
    Tang, Hua
    Li, Lan
    Li, Ke
    Hu, Feng
    AGING-US, 2023, 15 (20): : 11672 - 11689
  • [29] Identification of potential core genes in colorectal carcinoma and key genes in colorectal cancer liver metastasis using bioinformatics analysis
    Lipeng Niu
    Ce Gao
    Yang Li
    Scientific Reports, 11
  • [30] ASTK: A Machine Learning-Based Integrative Software for Alternative Splicing Analysis
    Huang, Shenghui
    He, Jiangshuang
    Yu, Lei
    Guo, Jun
    Jiang, Shangying
    Sun, Zhaoxia
    Cheng, Linghui
    Chen, Xing
    Ji, Xiang
    Zhang, Yi
    ADVANCED INTELLIGENT SYSTEMS, 2024, 6 (04)