Identification of COVID-19 severity biomarkers based on feature selection on single-cell RNA-Seq data of CD8+ T cells

被引:6
|
作者
Lu, Jian [1 ,2 ]
Meng, Mei [3 ]
Zhou, XianChao [3 ]
Ding, Shijian [4 ]
Feng, KaiYan [5 ]
Zeng, Zhenbing [1 ]
Huang, Tao [2 ,6 ]
Cai, Yu-Dong [4 ]
机构
[1] Shanghai Univ, Sch Sci, Dept Math, Shanghai, Peoples R China
[2] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Ctr Single Cell Om, Sch Publ Hlth, Sch Med,State Key Lab Oncogenes & Related Genes, Shanghai, Peoples R China
[4] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[5] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou, Peoples R China
[6] Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Chinese Acad Sci, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai, Peoples R China
关键词
COVID-19; severity; CD8(+) T cell; single-cell; feature selection; ANTIGEN;
D O I
10.3389/fgene.2022.1053772
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The global outbreak of the COVID-19 epidemic has become a major public health problem. COVID-19 virus infection triggers a complex immune response. CD8(+) T cells, in particular, play an essential role in controlling the severity of the disease. However, the mechanism of the regulatory role of CD8(+) T cells on COVID-19 remains poorly investigated. In this study, single-cell gene expression profiles from three CD8(+) T cell subtypes (effector, memory, and naive T cells) were downloaded. Each cell subtype included three disease states, namely, acute COVID-19, convalescent COVID-19, and unexposed individuals. The profiles on each cell subtype were individually analyzed in the same way. Irrelevant features in the profiles were first excluded by the Boruta method. The remaining features for each CD8(+) T cells subtype were further analyzed by Max-Relevance and Min-Redundancy, Monte Carlo feature selection, and light gradient boosting machine methods to obtain three feature lists. These lists were then brought into the incremental feature selection method to determine the optimal features for each cell subtype. Their corresponding genes may be latent biomarkers to determine COVID-19 severity. Genes, such as ZFP36, DUSP1, TCR, and IL7R, can be confirmed to play an immune regulatory role in COVID-19 infection and recovery. The results of functional enrichment analysis revealed that these important genes may be associated with immune functions, such as response to cAMP, response to virus, T cell receptor complex, T cell activation, and T cell differentiation. This study further set up different gene expression pattens, represented by classification rules, on three states of COVID-19 and constructed several efficient classifiers to distinguish COVID-19 severity. The findings of this study provided new insights into the biological processes of CD8(+) T cells in regulating the immune response.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] FEATS: feature selection-based clustering of single-cell RNA-seq data
    Vans, Edwin
    Patil, Ashwini
    Sharma, Alok
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [2] Feature Selection in Single-Cell RNA-seq Data via a Genetic Algorithm
    Chatzilygeroudis, Konstantinos I.
    Vrahatis, Aristidis G.
    Tasoulis, Sotiris K.
    Vrahatis, Michael N.
    LEARNING AND INTELLIGENT OPTIMIZATION, LION 15, 2021, 12931 : 66 - 79
  • [3] scFSNN: a feature selection method based on neural network for single-cell RNA-seq data
    Minjiao Peng
    Baoqin Lin
    Jun Zhang
    Yan Zhou
    Bingqing Lin
    BMC Genomics, 25
  • [4] scFSNN: a feature selection method based on neural network for single-cell RNA-seq data
    Peng, Minjiao
    Lin, Baoqin
    Zhang, Jun
    Zhou, Yan
    Lin, Bingqing
    BMC GENOMICS, 2024, 25 (01)
  • [5] Accurate feature selection improves single-cell RNA-seq cell clustering
    Su, Kenong
    Yu, Tianwei
    Wu, Hao
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [6] Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model
    Townes, F. William
    Hicks, Stephanie C.
    Aryee, Martin J.
    Irizarry, Rafael A.
    GENOME BIOLOGY, 2019, 20 (01)
  • [7] Feature selection and dimension reduction for single-cell RNA-Seq based on a multinomial model
    F. William Townes
    Stephanie C. Hicks
    Martin J. Aryee
    Rafael A. Irizarry
    Genome Biology, 20
  • [8] Predicting COVID-19 Severity Integrating RNA-Seq Data Using Machine Learning Techniques
    Bajo-Morales, Javier
    Castillo-Secilla, Daniel
    Herrera, Luis Javier
    Caba, Octavio
    Prados, Jose Carlos
    Rojas, Ignacio
    CURRENT BIOINFORMATICS, 2023, 18 (03) : 221 - 231
  • [9] COMSE: analysis of single-cell RNA-seq data using community detection-based feature selection
    Luo, Qinhuan
    Chen, Yaozhu
    Lan, Xun
    BMC BIOLOGY, 2024, 22 (01)
  • [10] CellHeap: A Workflow for Optimizing COVID-19 Single-Cell RNA-Seq Data Processing in the Santos Dumont Supercomputer
    Silva, Vanessa S.
    Costa, Maiana O. C.
    Castro, Maria Clicia S.
    Silva, Helena S.
    Walter, Maria Emilia M. T.
    Melo, Alba C. M. A.
    Ocana, Kary A. C.
    dos Santos, Marcelo T.
    Nicolas, Marisa F.
    Carvalho, Anna Cristina C.
    Henriques-Pons, Andrea
    Silva, Fabricio A. B.
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2021, 2021, 13063 : 41 - 52