Characterization of spleen and lymph node cell types via CITE-seq and machine learning methods

被引:4
作者
Li, Hao [1 ]
Wang, Deling [2 ]
Zhou, Xianchao [3 ]
Ding, Shijian [4 ]
Guo, Wei [5 ]
Zhang, Shiqi [6 ]
Li, Zhandong [1 ]
Huang, Tao [7 ,8 ]
Cai, Yu-Dong [4 ]
机构
[1] Jilin Engn Normal Univ, Coll Biol & Food Engn, Changchun, Peoples R China
[2] Sun Yat sen Univ, Collaborat Innovat Ctr Canc Med, State Key Lab Oncol South China, Dept Radiol,Canc Ctr, Guangzhou, Peoples R China
[3] Shanghai Jiao Tong Univ, Ctr Single Cell Om, Sch Publ Hlth, Sch Med, Shanghai, Peoples R China
[4] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[5] Shanghai Jiao Tong Univ, Shanghai Inst Biol Sci SIBS, Chinese Acad Sci CAS, Key Lab Stem Cell Biol,Sch Med SJTUSM, Shanghai, Peoples R China
[6] Univ Copenhagen, Dept Biostat, Copenhagen, Denmark
[7] Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Chinese Acad Sci, Biomed Big Data Ctr,CAS Key Lab Computational Biol, Shanghai, Peoples R China
[8] Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Chinese Acad Sci, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai, Peoples R China
来源
FRONTIERS IN MOLECULAR NEUROSCIENCE | 2022年 / 15卷
基金
国家重点研发计划;
关键词
spleen and lymph; machine learning algorithm; feature analysis; deep forest; decision rule; NATURAL-KILLER-CELLS; FEATURE-SELECTION; B-CELLS; CD24; CD43; DIFFERENTIATION; EXPRESSION; RELEVANCE; BIOMARKER; RESPONSES;
D O I
10.3389/fnmol.2022.1033159
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The spleen and lymph nodes are important functional organs for human immune system. The identification of cell types for spleen and lymph nodes is helpful for understanding the mechanism of immune system. However, the cell types of spleen and lymph are highly diverse in the human body. Therefore, in this study, we employed a series of machine learning algorithms to computationally analyze the cell types of spleen and lymph based on single-cell CITE-seq sequencing data. A total of 28,211 cell data (training vs. test = 14,435 vs. 13,776) involving 24 cell types were collected for this study. For the training dataset, it was analyzed by Boruta and minimum redundancy maximum relevance (mRMR) one by one, resulting in an mRMR feature list. This list was fed into the incremental feature selection (IFS) method, incorporating four classification algorithms (deep forest, random forest, K-nearest neighbor, and decision tree). Some essential features were discovered and the deep forest with its optimal features achieved the best performance. A group of related proteins (CD4, TCRb, CD103, CD43, and CD23) and genes (Nkg7 and Thy1) contributing to the classification of spleen and lymph nodes cell types were analyzed. Furthermore, the classification rules yielded by decision tree were also provided and analyzed. Above findings may provide helpful information for deepening our understanding on the diversity of cell types.
引用
收藏
页数:13
相关论文
共 76 条
[1]   Functional expression of CD43 on human natural killer cells [J].
Aguado, E ;
Santamaría, M ;
Gallego, MD ;
Peña, J ;
Molina, IJ .
JOURNAL OF LEUKOCYTE BIOLOGY, 1999, 66 (06) :923-929
[2]   Essential role for CD103 in the T cell-mediated regulation of experimental colitis [J].
Annacker, O ;
Coombes, JL ;
Malmstrom, V ;
Uhlig, HH ;
Bourne, T ;
Johansson-Lindbom, B ;
Agace, WW ;
Parker, CM ;
Powrie, F .
JOURNAL OF EXPERIMENTAL MEDICINE, 2005, 202 (08) :1051-1061
[3]   A deep learning architecture for metabolic pathway prediction [J].
Baranwal, Mayank ;
Magner, Abram ;
Elvati, Paolo ;
Saldinger, Jacob ;
Violi, Angela ;
Hero, Alfred O. .
BIOINFORMATICS, 2020, 36 (08) :2547-2553
[4]   The who, how and where of antigen presentation to B cells [J].
Batista, Facundo D. ;
Harwood, Naomi E. .
NATURE REVIEWS IMMUNOLOGY, 2009, 9 (01) :15-27
[5]   The double life of a B-1 cell: self-reactivity selects for protective effector functions [J].
Baumgarth, Nicole .
NATURE REVIEWS IMMUNOLOGY, 2011, 11 (01) :34-46
[6]  
Baxter A., 2016, EUR J IMMUNOL, V46
[7]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[8]   Application of Random Forests Methods to Diabetic Retinopathy Classification Analyses [J].
Casanova, Ramon ;
Saldana, Santiago ;
Chew, Emily Y. ;
Danis, Ronald P. ;
Greven, Craig M. ;
Ambrosius, Walter T. .
PLOS ONE, 2014, 9 (06)
[9]   Marginal zone B cells: virtues of innate-like antibody-producing lymphocytes [J].
Cerutti, Andrea ;
Cols, Montserrat ;
Puga, Irene .
NATURE REVIEWS IMMUNOLOGY, 2013, 13 (02) :118-132
[10]   Stromal infrastructure of the lymph node and coordination of immunity [J].
Chang, Jonathan E. ;
Turley, Shannon J. .
TRENDS IN IMMUNOLOGY, 2015, 36 (01) :30-39