Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods

被引:0
|
作者
Yang, Yong [1 ]
Zhang, Yuhang [2 ]
Ren, Jingxin [3 ]
Feng, Kaiyan [4 ]
Li, Zhandong [5 ]
Huang, Tao [6 ,7 ]
Cai, Yudong [3 ]
机构
[1] Qianwei Hosp Jilin Prov, Changchun 130012, Peoples R China
[2] Harvard Med Sch, Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
[3] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[4] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[5] Jilin Engn Normal Univ, Coll Biol & Food Engn, Changchun 130052, Peoples R China
[6] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[7] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
来源
LIFE-BASEL | 2023年 / 13卷 / 09期
关键词
colon immune cell; marker gene; machine learning; feature selection; NF-KAPPA-B; INFLAMMATORY FACTOR-I; J-CHAIN; FEATURE-SELECTION; T-CELLS; CANCER; ACTIVATION; EXPRESSION; PROTEIN; DRUG;
D O I
10.3390/life13091876
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Immune cell infiltration that occurs at the site of colon tumors influences the course of cancer. Different immune cell compositions in the microenvironment lead to different immune responses and different therapeutic effects. This study analyzed single-cell RNA sequencing data in a normal colon with the aim of screening genetic markers of 25 candidate immune cell types and revealing quantitative differences between them. The dataset contains 25 classes of immune cells, 41,650 cells in total, and each cell is expressed by 22,164 genes at the expression level. They were fed into a machine learning-based stream. The five feature ranking algorithms (last absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, minimum redundancy maximum relevance, and random forest) were first used to analyze the importance of gene features, yielding five feature lists. Then, incremental feature selection and two classification algorithms (decision tree and random forest) were combined to filter the most important genetic markers from each list. For different immune cell subtypes, their marker genes, such as KLRB1 in CD4 T cells, RPL30 in B cell IGA plasma cells, and JCHAIN in IgG producing B cells, were identified. They were confirmed to be differentially expressed in different immune cells and involved in immune processes. In addition, quantitative rules were summarized by using the decision tree algorithm to distinguish candidate immune cell types. These results provide a reference for exploring the cell composition of the colon cancer microenvironment and for clinical immunotherapy.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Identification of hub genes and their correlation with immune infiltration in coronary artery disease through bioinformatics and machine learning methods
    Huang, Ke-Ke
    Zheng, Hui-Lei
    Li, Shuo
    Zeng, Zhi-Yu
    JOURNAL OF THORACIC DISEASE, 2022, 14 (07) : 2621 - 2634
  • [22] Sensitivity analysis based on the random forest machine learning algorithm identifies candidate genes for regulation of innate and adaptive immune response of chicken
    Polewko-Klim, Aneta
    Lesinski, Wojciech
    Golinska, Agnieszka Kitlas
    Mnich, Krzysztof
    Siwek, Maria
    Rudnicki, Witold R.
    POULTRY SCIENCE, 2020, 99 (12) : 6341 - 6354
  • [23] Identification of the most important features of knee osteoarthritis structural progressors using machine learning methods
    Jamshidi, Afshin
    Leclercq, Mickael
    Labbe, Aurelie
    Pelletier, Jean-Pierre
    Abram, Francois
    Droit, Arnaud
    Martel-Pelletier, Johanne
    THERAPEUTIC ADVANCES IN MUSCULOSKELETAL DISEASE, 2020, 12
  • [24] Identification and validation of key biomarkers associated with immune and oxidative stress for preeclampsia by WGCNA and machine learning
    Yu, Tiantian
    Wang, Guiying
    Xu, Xia
    Yan, Jianying
    FRONTIERS IN GENETICS, 2025, 16
  • [25] Identification the immune related marker genes and transcription-factor network in ruptured cerebral aneurysms using bioinformatics analysis and machine-learning strategies
    Zhao, Xiang
    Fu, Jinxing
    Lei, Chao
    Wang, Zhaochen
    Jing, Zhitao
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 88
  • [26] Machine Learning in Identifying Marker Genes for Congenital Heart Diseases of Different Cardiac Cell Types
    Ma, Qinglan
    Zhang, Yu-Hang
    Guo, Wei
    Feng, Kaiyan
    Huang, Tao
    Cai, Yu-Dong
    LIFE-BASEL, 2024, 14 (08):
  • [27] Recent Advances on Antioxidant Identification Based on Machine Learning Methods
    Feng, Pengmian
    Feng, Lijing
    CURRENT DRUG METABOLISM, 2020, 21 (10) : 804 - 809
  • [28] Machine learning methods revealed the roles of immune-metabolism related genes in immune infiltration, stemness, and prognosis of neuroblastoma
    Mu, Jianhua
    Gong, Jianan
    Lin, Peng
    Zhang, Mengzhen
    Wu, Kai
    CANCER BIOMARKERS, 2023, 38 (02) : 241 - 259
  • [29] Recognition of Immune Cell Markers of COVID-19 Severity with Machine Learning Methods
    Chen, Lei
    Mei, Zi
    Guo, Wei
    Ding, ShiJian
    Huang, Tao
    Cai, Yu-Dong
    BIOMED RESEARCH INTERNATIONAL, 2022, 2022
  • [30] Identification of synthetic lethality based on a functional network by using machine learning algorithms
    Li, JiaRui
    Lu, Lin
    Zhang, Yu-Hang
    Liu, Min
    Chen, Lei
    Huang, Tao
    Cai, Yu-Dong
    JOURNAL OF CELLULAR BIOCHEMISTRY, 2019, 120 (01) : 405 - 416