Identification of COVID-19 severity biomarkers based on feature selection on single-cell RNA-Seq data of CD8+ T cells

被引:6
|
作者
Lu, Jian [1 ,2 ]
Meng, Mei [3 ]
Zhou, XianChao [3 ]
Ding, Shijian [4 ]
Feng, KaiYan [5 ]
Zeng, Zhenbing [1 ]
Huang, Tao [2 ,6 ]
Cai, Yu-Dong [4 ]
机构
[1] Shanghai Univ, Sch Sci, Dept Math, Shanghai, Peoples R China
[2] Univ Chinese Acad Sci, Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Biomed Big Data Ctr,CAS Key Lab Computat Biol, Shanghai, Peoples R China
[3] Shanghai Jiao Tong Univ, Ctr Single Cell Om, Sch Publ Hlth, Sch Med,State Key Lab Oncogenes & Related Genes, Shanghai, Peoples R China
[4] Shanghai Univ, Sch Life Sci, Shanghai, Peoples R China
[5] Guangdong AIB Polytech Coll, Dept Comp Sci, Guangzhou, Peoples R China
[6] Univ Chinese Acad Sci, Shanghai Inst Nutr & Hlth, Chinese Acad Sci, CAS Key Lab Tissue Microenvironm & Tumor, Shanghai, Peoples R China
关键词
COVID-19; severity; CD8(+) T cell; single-cell; feature selection; ANTIGEN;
D O I
10.3389/fgene.2022.1053772
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The global outbreak of the COVID-19 epidemic has become a major public health problem. COVID-19 virus infection triggers a complex immune response. CD8(+) T cells, in particular, play an essential role in controlling the severity of the disease. However, the mechanism of the regulatory role of CD8(+) T cells on COVID-19 remains poorly investigated. In this study, single-cell gene expression profiles from three CD8(+) T cell subtypes (effector, memory, and naive T cells) were downloaded. Each cell subtype included three disease states, namely, acute COVID-19, convalescent COVID-19, and unexposed individuals. The profiles on each cell subtype were individually analyzed in the same way. Irrelevant features in the profiles were first excluded by the Boruta method. The remaining features for each CD8(+) T cells subtype were further analyzed by Max-Relevance and Min-Redundancy, Monte Carlo feature selection, and light gradient boosting machine methods to obtain three feature lists. These lists were then brought into the incremental feature selection method to determine the optimal features for each cell subtype. Their corresponding genes may be latent biomarkers to determine COVID-19 severity. Genes, such as ZFP36, DUSP1, TCR, and IL7R, can be confirmed to play an immune regulatory role in COVID-19 infection and recovery. The results of functional enrichment analysis revealed that these important genes may be associated with immune functions, such as response to cAMP, response to virus, T cell receptor complex, T cell activation, and T cell differentiation. This study further set up different gene expression pattens, represented by classification rules, on three states of COVID-19 and constructed several efficient classifiers to distinguish COVID-19 severity. The findings of this study provided new insights into the biological processes of CD8(+) T cells in regulating the immune response.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Single-cell RNA-seq public data reveal the gene regulatory network landscape of respiratory epithelial and peripheral immune cells in COVID-19 patients
    Zhang, Lin
    Nishi, Hafumi
    Kinoshita, Kengo
    FRONTIERS IN IMMUNOLOGY, 2023, 14
  • [22] Data-driven selection of analysis decisions in single-cell RNA-seq trajectory inference
    Dong, Xiaoru
    Leary, Jack R.
    Yang, Chuanhao
    Brusko, Maigan A.
    Brusko, Todd M.
    Bacher, Rhonda
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [23] Activation or exhaustion of CD8+ T cells in patients with COVID-19
    Min-Seok Rha
    Eui-Cheol Shin
    Cellular & Molecular Immunology, 2021, 18 : 2325 - 2333
  • [24] Early Response of CD8+ T Cells in COVID-19 Patients
    Ramljak, Deni
    Vukoja, Martina
    Curlin, Marina
    Vukojevic, Katarina
    Barbaric, Maja
    Glamoclija, Una
    Purisevic, Bejana
    Peric, Olivera
    Soljic, Violeta
    JOURNAL OF PERSONALIZED MEDICINE, 2021, 11 (12):
  • [25] The characterization of CD8+ T-cell responses in COVID-19
    Yang, Yuanting
    Miller, Heather
    Byazrova, Maria G.
    Cndotti, Fabio
    Benlagha, Kamel
    Camara, Niels Olsen Saraiva
    Shi, Junming
    Forsman, Huamei
    Lee, Pamela
    Yang, Lu
    Filatov, Alexander
    Zhai, Zhimin
    Liu, Chaohong
    EMERGING MICROBES & INFECTIONS, 2024, 13 (01)
  • [26] Category encoding method to select feature genes for the classification of bulk and single-cell RNA-seq data
    Zhou, Yan
    Zhang, Li
    Xu, Jinfeng
    Zhang, Jun
    Yan, Xiaodong
    STATISTICS IN MEDICINE, 2021, 40 (18) : 4077 - 4089
  • [27] Single_cell_GRN: gene regulatory network identification based on supervised learning method and Single-cell RNA-seq data
    Bin Yang
    Wenzheng Bao
    Baitong Chen
    Dan Song
    BioData Mining, 15
  • [28] Single_cell_GRN: gene regulatory network identification based on supervised learning method and Single-cell RNA-seq data
    Bin Yang
    Bao, Wenzheng
    Chen, Baitong
    Song, Dan
    BIODATA MINING, 2022, 15 (01)
  • [29] A Machine Learning Model for the Prediction of COVID-19 Severity Using RNA-Seq, Clinical, and Co-Morbidity Data
    Sethi, Sahil
    Shakyawar, Sushil
    Reddy, Athreya S.
    Patel, Jai Chand
    Guda, Chittibabu
    DIAGNOSTICS, 2024, 14 (12)
  • [30] Gene Expression-Based Identification of Antigen-Responsive CD8+ T Cells on a Single-Cell Level
    Fuchs, Yannick F.
    Sharma, Virag
    Eugster, Anne
    Kraus, Gloria
    Morgenstern, Robert
    Dahl, Andreas
    Reinhardt, Susanne
    Petzold, Andreas
    Lindner, Annett
    Loebel, Doreen
    Bonifacio, Ezio
    FRONTIERS IN IMMUNOLOGY, 2019, 10