WGCNA and Machine Learning-Based Integrative Bioinformatics Analysis for Identifying Key Genes of Colorectal Cancer

被引:1
|
作者
Al Mehedi Hasan, Md. [1 ]
Maniruzzaman, Md. [2 ,3 ]
Shin, Jungpil [3 ]
机构
[1] Rajshahi Univ Engn & Technol, Dept Comp Sci & Engn, Rajshahi 6204, Bangladesh
[2] Khulna Univ, Stat Discipline, Khulna 9208, Bangladesh
[3] Univ Aizu, Sch Comp Sci & Engn, Aizu Wakamatsu, 9658580, Japan
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Training; Bioinformatics; Biomarkers; Proteins; Support vector machines; Object recognition; Network analyzers; Gene expression; Databases; Correlation; Colorectal cancer; Machine learning; WGCNA; machine learning-based models; differentially expressed discriminative genes; bioinformatics analysis; key genes; CARCINOMA; PROGNOSIS; ONTOLOGY;
D O I
10.1109/ACCESS.2024.3472688
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Colorectal cancer (CC) is a significant public health concern and make it necessary to identify reliable biomarkers and elucidate their molecular and biological mechanisms. This study proposed a system by integrating weighted gene co-expression network analysis (WGCNA) and machine learning-based integrative bioinformatics (ML-IB) analysis to identify key genes for CC. WGCNA was implemented to find a co-expression network of genes and identify important genes by intersecting gene sets obtained using module membership and gene significance criteria across datasets. WGCNA-based significant genes were determined by intersecting important genes between two datasets. ML-IB based approach primarily identified differentially expressed genes (DEGs), then employed support vector machine to determine differentially expressed discriminative genes (DEDGs) and took their common DEDGs across datasets. Protein-protein interaction networks were built and identified hub genes based on the degrees of connectivity and hub module genes using MCODE scores. The ML-IB based significant genes were determined by intersecting hub genes and hub module genes. Four common significant genes were found by intersecting significant genes derived from WGCNA and ML-IB based perspectives. Finally, two genes (AURKA and CCNA2) were determined as key genes for showing strong correlation with survival of CC patients and validated their discriminative capability on an independent test dataset using AUC analysis. The key genes of AURKA and CCNA2 may be used for the early detection of patients with CC. This study will helpful for physicians and doctors to determine and understand the associated the molecular mechanisms and pathway of patients with CC.
引用
收藏
页码:144350 / 144363
页数:14
相关论文
共 50 条
  • [11] Identification of Potential Key Genes and Pathways in Early-Onset Colorectal Cancer Through Bioinformatics Analysis
    Zhao, Bin
    Baloch, Zulqarnain
    Ma, Yunhan
    Wan, Zheng
    Huo, Yani
    Li, Fujun
    Zhao, Yilin
    CANCER CONTROL, 2019, 26 (01)
  • [12] Integrative bioinformatics and machine learning identify key crosstalk genes and immune interactions in head and neck cancer and Hodgkin lymphoma
    Meiling Qin
    Xinxin Li
    Xun Gong
    Yuan Hu
    Min Tang
    Scientific Reports, 15 (1)
  • [13] Identification of key pathways and genes in colorectal cancer using bioinformatics analysis
    Bin Liang
    Chunning Li
    Jianying Zhao
    Medical Oncology, 2016, 33
  • [14] Identification of key pathways and genes in colorectal cancer using bioinformatics analysis
    Liang, Bin
    Li, Chunning
    Zhao, Jianying
    MEDICAL ONCOLOGY, 2016, 33 (10)
  • [15] Bioinformatics Analysis Identifying Key Biomarkers in Bladder Cancer
    Zhang, Chuan
    Berndt-Paetz, Mandy
    Neuhaus, Jochen
    DATA, 2020, 5 (02)
  • [16] Colon cancer diagnosis and staging classification based on machine learning and bioinformatics analysis
    Su, Ying
    Tian, Xuecong
    Gao, Rui
    Guo, Wenjia
    Chen, Cheng
    Chen, Chen
    Jia, Dongfang
    Li, Hongtao
    Lv, Xiaoyi
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 145
  • [17] Identification of Key Prognostic Genes of Triple Negative Breast Cancer by LASSO-Based Machine Learning and Bioinformatics Analysis
    Chen, De-Lun
    Cai, Jia-Hua
    Wang, Charles C. N.
    GENES, 2022, 13 (05)
  • [18] Identifying Molecular Biomarkers for Diseases With Machine Learning Based on Integrative Omics
    Shi, Kai
    Lin, Wei
    Zhao, Xing-Ming
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (06) : 2514 - 2525
  • [19] The Detection of Colorectal Cancer through Machine Learning-Based Breath Sensor Analysis
    Polaka, Inese
    Mezmale, Linda
    Anarkulova, Linda
    Kononova, Elina
    Vilkoite, Ilona
    Veliks, Viktors
    Lescinska, Anna Marija
    Stonans, Ilmars
    Pcolkins, Andrejs
    Tolmanis, Ivars
    Shani, Gidi
    Haick, Hossam
    Mitrovics, Jan
    Gloeckler, Johannes
    Mizaikoff, Boris
    Leja, Marcis
    DIAGNOSTICS, 2023, 13 (21)
  • [20] Identification of key genes for predicting colorectal cancer prognosis by integrated bioinformatics analysis
    Dai, Gong-Peng
    Wang, Li-Ping
    Wen, Yu-Qing
    Ren, Xue-Qun
    Zuo, Shu-Guang
    ONCOLOGY LETTERS, 2020, 19 (01) : 388 - 398