Identification of Colon Immune Cell Marker Genes Using Machine Learning Methods

被引:0
|
作者
Yang, Yong [1 ]
Zhang, Yuhang [2 ]
Ren, Jingxin [3 ]
Feng, Kaiyan [4 ]
Li, Zhandong [5 ]
Huang, Tao [6 ,7 ]
Cai, Yudong [3 ]
机构
[1] Qianwei Hosp Jilin Prov, Changchun 130012, Peoples R China
[2] Harvard Med Sch, Brigham & Womens Hosp, Channing Div Network Med, Boston, MA 02115 USA
[3] Shanghai Univ, Sch Life Sci, Shanghai 200444, Peoples R China
[4] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou 510507, Peoples R China
[5] Jilin Engn Normal Univ, Coll Biol & Food Engn, Changchun 130052, Peoples R China
[6] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai 200031, Peoples R China
[7] Chinese Acad Sci, Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai 200031, Peoples R China
来源
LIFE-BASEL | 2023年 / 13卷 / 09期
关键词
colon immune cell; marker gene; machine learning; feature selection; NF-KAPPA-B; INFLAMMATORY FACTOR-I; J-CHAIN; FEATURE-SELECTION; T-CELLS; CANCER; ACTIVATION; EXPRESSION; PROTEIN; DRUG;
D O I
10.3390/life13091876
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Immune cell infiltration that occurs at the site of colon tumors influences the course of cancer. Different immune cell compositions in the microenvironment lead to different immune responses and different therapeutic effects. This study analyzed single-cell RNA sequencing data in a normal colon with the aim of screening genetic markers of 25 candidate immune cell types and revealing quantitative differences between them. The dataset contains 25 classes of immune cells, 41,650 cells in total, and each cell is expressed by 22,164 genes at the expression level. They were fed into a machine learning-based stream. The five feature ranking algorithms (last absolute shrinkage and selection operator, light gradient boosting machine, Monte Carlo feature selection, minimum redundancy maximum relevance, and random forest) were first used to analyze the importance of gene features, yielding five feature lists. Then, incremental feature selection and two classification algorithms (decision tree and random forest) were combined to filter the most important genetic markers from each list. For different immune cell subtypes, their marker genes, such as KLRB1 in CD4 T cells, RPL30 in B cell IGA plasma cells, and JCHAIN in IgG producing B cells, were identified. They were confirmed to be differentially expressed in different immune cells and involved in immune processes. In addition, quantitative rules were summarized by using the decision tree algorithm to distinguish candidate immune cell types. These results provide a reference for exploring the cell composition of the colon cancer microenvironment and for clinical immunotherapy.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Discovering the molecular differences between right- and left-sided colon cancer using machine learning methods
    Jiang, Yimei
    Yan, Xiaowei
    Liu, Kun
    Shi, Yiqing
    Wang, Changgang
    Hu, Jiele
    Li, You
    Wu, Qinghua
    Xiang, Ming
    Zhao, Ren
    BMC CANCER, 2020, 20 (01)
  • [32] Identification of an immune-related gene panel for the diagnosis of pulmonary arterial hypertension using bioinformatics and machine learning
    Xiong, Pan
    Huang, Qiuhong
    Mao, Yang
    Qian, Hang
    Yang, Yi
    Mou, Ziye
    Deng, Xiaohui
    Wang, Guansong
    He, Binfeng
    You, Zaichun
    INTERNATIONAL IMMUNOPHARMACOLOGY, 2025, 144
  • [33] Prediction of Essential Genes in Comparison States Using Machine Learning
    Xie, Jiang
    Zhao, Chang
    Sun, Jiamin
    Li, Jiaxin
    Yang, Fuzhang
    Wang, Jiao
    Nie, Qing
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (05) : 1784 - 1792
  • [34] Establishment of a nomogram model based on immune-related genes using machine learning for aortic dissection and immunomodulation assessment
    Hou, Yanjun
    Zhao, Yangyang
    Shi, Zhensu
    Pan, Yipeng
    Shi, Kaijia
    Zhao, Chaoyang
    Liu, Shengnan
    Chen, Yongkun
    Zhao, Lini
    Wu, Jizhen
    Ge, Guangquan
    Jie, Wei
    INTERNATIONAL JOURNAL OF MEDICAL SCIENCES, 2025, 22 (04): : 873 - 886
  • [35] Machine Learning-Based identification of resistance genes associated with sunflower broomrape
    Yingxue Che
    Congzi Zhang
    Jixiang Xing
    Qilemuge Xi
    Ying Shao
    Lingmin Zhao
    Shuchun Guo
    Yongchun Zuo
    Plant Methods, 21 (1)
  • [36] Identification of driver genes in lupus nephritis based on comprehensive bioinformatics and machine learning
    Wang, Zheng
    Hu, Danni
    Pei, Guangchang
    Zeng, Rui
    Yao, Ying
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [37] Integrating network, sequence and functional features using machine learning approaches towards identification of novel Alzheimer genes
    Jamal, Salma
    Goyal, Sukriti
    Shanker, Asheesh
    Grover, Abhinav
    BMC GENOMICS, 2016, 17
  • [38] Identification of miRNA Biomarkers for Diverse Cancer Types Using Statistical Learning Methods at the Whole-Genome Scale
    Sarkar, Jnanendra Prasad
    Saha, Indrajit
    Lancucki, Adrian
    Ghosh, Nimisha
    Wlasnowolski, Michal
    Bokota, Grzegorz
    Dey, Ashmita
    Lipinski, Piotr
    Plewczynski, Dariusz
    FRONTIERS IN GENETICS, 2020, 11
  • [39] Feature genes identification and immune infiltration assessment in abdominal aortic aneurysm using WGCNA and machine learning algorithms
    Xie, Ming
    Li, Xiandeng
    Qi, Congwei
    Zhang, Yufeng
    Li, Gang
    Xue, Yong
    Chen, Guobao
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2024, 11
  • [40] Integrative identification of immune-related key genes in atrial fibrillation using weighted gene coexpression network analysis and machine learning
    Zheng, Peng-Fei
    Chen, Lu-Zhu
    Liu, Peng
    Liu, Zheng-Yu
    Pan, Hong Wei
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2022, 9