Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods

被引:0
|
作者
Yang, Yong [1 ]
Zhang, Yuhang [2 ]
Ren, Jingxin [3 ]
Feng, Kaiyan [4 ]
Li, Zhandong [5 ]
Huang, Tao [6 ,7 ]
Cai, Yudong [3 ]
机构
[1] Qianwei Hosp Jilin Prov, Changchun 130012, Peoples R China
[2] Harvard Med Sch, Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
[3] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[4] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[5] Jilin Engn Normal Univ, Coll Biol & Food Engn, Changchun 130052, Peoples R China
[6] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[7] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
来源
LIFE-BASEL | 2023年 / 13卷 / 09期
关键词
colon immune cell; marker gene; machine learning; feature selection; NF-KAPPA-B; INFLAMMATORY FACTOR-I; J-CHAIN; FEATURE-SELECTION; T-CELLS; CANCER; ACTIVATION; EXPRESSION; PROTEIN; DRUG;
D O I
10.3390/life13091876
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Immune cell infiltration that occurs at the site of colon tumors influences the course of cancer. Different immune cell compositions in the microenvironment lead to different immune responses and different therapeutic effects. This study analyzed single-cell RNA sequencing data in a normal colon with the aim of screening genetic markers of 25 candidate immune cell types and revealing quantitative differences between them. The dataset contains 25 classes of immune cells, 41,650 cells in total, and each cell is expressed by 22,164 genes at the expression level. They were fed into a machine learning-based stream. The five feature ranking algorithms (last absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, minimum redundancy maximum relevance, and random forest) were first used to analyze the importance of gene features, yielding five feature lists. Then, incremental feature selection and two classification algorithms (decision tree and random forest) were combined to filter the most important genetic markers from each list. For different immune cell subtypes, their marker genes, such as KLRB1 in CD4 T cells, RPL30 in B cell IGA plasma cells, and JCHAIN in IgG producing B cells, were identified. They were confirmed to be differentially expressed in different immune cells and involved in immune processes. In addition, quantitative rules were summarized by using the decision tree algorithm to distinguish candidate immune cell types. These results provide a reference for exploring the cell composition of the colon cancer microenvironment and for clinical immunotherapy.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Identification of Immune-Related Genes in Patients with Acute Myocardial Infarction Using Machine Learning Methods
    Zhu, Xu
    Yin, Ting
    Zhang, Ting
    Zhu, Qingqing
    Lu, Xinyi
    Wang, Luyang
    Liao, Shengen
    Yao, Wenming
    Zhou, Yanli
    Zhang, Haifeng
    Li, Xinli
    JOURNAL OF INFLAMMATION RESEARCH, 2022, 15 : 3305 - 3321
  • [2] Identification of COVID-19-Specific Immune Markers Using a Machine Learning Method
    Li, Hao
    Huang, Feiming
    Liao, Huiping
    Li, Zhandong
    Feng, Kaiyan
    Huang, Tao
    Cai, Yu-Dong
    FRONTIERS IN MOLECULAR BIOSCIENCES, 2022, 9
  • [3] Identification of new marker genes from plant single-cell RNA-seq data using interpretable machine learning methods
    Yan, Haidong
    Lee, Jiyoung
    Song, Qi
    Li, Qi
    Schiefelbein, John
    Zhao, Bingyu
    Li, Song
    NEW PHYTOLOGIST, 2022, 234 (04) : 1507 - 1520
  • [4] A Brief Survey for MicroRNA Precursor Identification Using Machine Learning Methods
    Guan, Zheng-Xing
    Li, Shi-Hao
    Zhang, Zi-Mei
    Zhang, Dan
    Yang, Hui
    Ding, Hui
    CURRENT GENOMICS, 2020, 21 (01) : 11 - 25
  • [5] Identification of the diagnostic genes and immune cell infiltration characteristics of gastric cancer using bioinformatics analysis and machine learning
    Xie, Rongjun
    Liu, Longfei
    Lu, Xianzhou
    He, Chengjian
    Li, Guoxin
    FRONTIERS IN GENETICS, 2023, 13
  • [6] Identification of genes related to immune enhancement caused by heterologous ChAdOx1-BNT162b2 vaccines in lymphocytes at single-cell resolution with machine learning methods
    Li, Jing
    Huang, FeiMing
    Ma, QingLan
    Guo, Wei
    Feng, KaiYan
    Huang, Tao
    Cai, Yu-Dong
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [7] Prediction of core cancer genes using a hybrid of feature selection and machine learning methods
    Liu, Y. X.
    Zhang, N. N.
    He, Y.
    Lun, L. J.
    GENETICS AND MOLECULAR RESEARCH, 2015, 14 (03): : 8871 - 8882
  • [8] Identifying genes associated with resistance to KRAS G12C inhibitors via machine learning methods
    Lin, Xiandong
    Ma, QingLan
    Chen, Lei
    Guo, Wei
    Huang, Zhiyi
    Huang, Tao
    Cai, Yu-Dong
    BIOCHIMICA ET BIOPHYSICA ACTA-GENERAL SUBJECTS, 2023, 1867 (12):
  • [9] Identification of immune-related genes in diagnosing retinopathy of prematurity with sepsis through bioinformatics analysis and machine learning
    Chen, Han
    Chen, Enguang
    Lu, Yao
    Xu, Yu
    FRONTIERS IN GENETICS, 2023, 14
  • [10] Identification of Human Cell Cycle Phase Markers Based on Single-Cell RNA-Seq Data by Using Machine Learning Methods
    Huang, FeiMing
    Chen, Lei
    Guo, Wei
    Huang, Tao
    Cai, Yu-dong
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022