Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods

被引:0
|
作者
Yang, Yong [1 ]
Zhang, Yuhang [2 ]
Ren, Jingxin [3 ]
Feng, Kaiyan [4 ]
Li, Zhandong [5 ]
Huang, Tao [6 ,7 ]
Cai, Yudong [3 ]
机构
[1] Qianwei Hosp Jilin Prov, Changchun 130012, Peoples R China
[2] Harvard Med Sch, Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
[3] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[4] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[5] Jilin Engn Normal Univ, Coll Biol & Food Engn, Changchun 130052, Peoples R China
[6] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[7] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
来源
LIFE-BASEL | 2023年 / 13卷 / 09期
关键词
colon immune cell; marker gene; machine learning; feature selection; NF-KAPPA-B; INFLAMMATORY FACTOR-I; J-CHAIN; FEATURE-SELECTION; T-CELLS; CANCER; ACTIVATION; EXPRESSION; PROTEIN; DRUG;
D O I
10.3390/life13091876
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Immune cell infiltration that occurs at the site of colon tumors influences the course of cancer. Different immune cell compositions in the microenvironment lead to different immune responses and different therapeutic effects. This study analyzed single-cell RNA sequencing data in a normal colon with the aim of screening genetic markers of 25 candidate immune cell types and revealing quantitative differences between them. The dataset contains 25 classes of immune cells, 41,650 cells in total, and each cell is expressed by 22,164 genes at the expression level. They were fed into a machine learning-based stream. The five feature ranking algorithms (last absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, minimum redundancy maximum relevance, and random forest) were first used to analyze the importance of gene features, yielding five feature lists. Then, incremental feature selection and two classification algorithms (decision tree and random forest) were combined to filter the most important genetic markers from each list. For different immune cell subtypes, their marker genes, such as KLRB1 in CD4 T cells, RPL30 in B cell IGA plasma cells, and JCHAIN in IgG producing B cells, were identified. They were confirmed to be differentially expressed in different immune cells and involved in immune processes. In addition, quantitative rules were summarized by using the decision tree algorithm to distinguish candidate immune cell types. These results provide a reference for exploring the cell composition of the colon cancer microenvironment and for clinical immunotherapy.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] Identification of immune-related mitochondrial metabolic disorder genes in septic shock using bioinformatics and machine learning
    Cui, Yu-Hui
    Wu, Chun-Rong
    Huang, Li-Ou
    Xu, Dan
    Tang, Jian-Guo
    HEREDITAS, 2024, 161 (01):
  • [42] Seed identification using machine vision: Machine learning features and model performance
    Himmelboe, Martin
    Jorgensen, Johannes Ravn
    Gislum, Rene
    Boelt, Birte
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 231
  • [43] Identification and Verification of Authorship Using Machine and Deep Learning Methods
    Ovasapyan, T. D.
    Tolgorenko, E. M.
    Zubkov, E. A.
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2024, 58 (08) : 1271 - 1282
  • [44] Chemical identification of metamorphic protoliths using machine learning methods
    Hasterok, D.
    Gard, M.
    Bishop, C. M. B.
    Kelsey, D.
    COMPUTERS & GEOSCIENCES, 2019, 132 : 56 - 68
  • [45] Identification of Marker Genes for Cancer Based on Microarrays Using a Computational Biology Approach
    Wang, Xiaosheng
    CURRENT BIOINFORMATICS, 2014, 9 (02) : 140 - 146
  • [46] Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning
    Feng, Haiming
    Zhao, Ye
    Yan, Weijian
    Wei, Xiaoping
    Lin, Junping
    Jiang, Peng
    Wang, Cheng
    Li, Bin
    FRONTIERS IN MEDICINE, 2022, 9
  • [47] Identification of immune-associated genes in diagnosing osteoarthritis with metabolic syndrome by integrated bioinformatics analysis and machine learning
    Li, Junchen
    Wang, Genghong
    Xv, Xilin
    Li, Zhigang
    Shen, Yiwei
    Zhang, Cheng
    Zhang, Xiaofeng
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [48] Identification and validation of immune and cuproptosis - related genes for diabetic nephropathy by WGCNA and machine learning
    Chen, Yubing
    Liao, Lijuan
    Wang, Baoju
    Wu, Zhan
    FRONTIERS IN IMMUNOLOGY, 2024, 15
  • [49] Identification and isolation of BZR transcription factor and screening of cell wall degradation marker genes based on machine learning in ripening kiwifruit
    Yang, Yaming
    Ren, Shichang
    Chen, Ming
    Li, Zhaoting
    Ma, Qian
    Zhang, Miaochan
    Li, Hualin
    Ren, Xiaolin
    Ding, Yuduan
    POSTHARVEST BIOLOGY AND TECHNOLOGY, 2024, 211
  • [50] Identification of Age-Related Characteristic Genes Involved in Severe COVID-19 Infection Among Elderly Patients Using Machine Learning and Immune Cell Infiltration Analysis
    Li, Huan
    Zhao, Jin
    Xing, Yan
    Chen, Jia
    Wen, Ziying
    Ma, Rui
    Han, Fengxia
    Huang, Boyong
    Wang, Hao
    Li, Cui
    Chen, Yang
    Ning, Xiaoxuan
    BIOCHEMICAL GENETICS, 2024, 63 (3) : 2040 - 2060