HArmonized single-cell RNA-seq Cell type Assisted Deconvolution (HASCAD)

被引:0
作者
Chiu, Yen-Jung [1 ,2 ]
Ni, Chung-En [1 ]
Huang, Yen-Hua [1 ,3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Biomed Informat, Taipei 112, Taiwan
[2] Ming Chuan Univ, Dept Biomed Engn, Taoyuan 333, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Ctr Syst & Synthet Biol, Taipei 112, Taiwan
关键词
Harmonization; Cell composition deconvolution; RNA-seq; Deep learning; CANCER;
D O I
10.1186/s12920-023-01674-w
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
BackgroundCell composition deconvolution (CCD) is a type of bioinformatic task to estimate the cell fractions from bulk gene expression profiles, such as RNA-seq. Many CCD models were developed to perform linear regression analysis using reference gene expression signatures of distinct cell types. Reference gene expression signatures could be generated from cell-specific gene expression profiles, such as scRNA-seq. However, the batch effects and dropout events frequently observed across scRNA-seq datasets have limited the performances of CCD methods.MethodsWe developed a deep neural network (DNN) model, HASCAD, to predict the cell fractions of up to 15 immune cell types. HASCAD was trained using the bulk RNA-seq simulated from three scRNA-seq datasets that have been normalized by using a Harmony-Symphony based strategy. Mean square error and Pearson correlation coefficient were used to compare the performance of HASCAD with those of other widely used CCD methods. Two types of datasets, including a set of simulated bulk RNA-seq, and three human PBMC RNA-seq datasets, were arranged to conduct the benchmarks.ResultsHASCAD is useful for the investigation of the impacts of immune cell heterogeneity on the therapeutic effects of immune checkpoint inhibitors, since the target cell types include the ones known to play a role in anti-tumor immunity, such as three subtypes of CD8 T cells and three subtypes of CD4 T cells. We found that the removal of batch effects in the reference scRNA-seq datasets could benefit the task of CCD. Our benchmarks showed that HASCAD is more suitable for analyzing bulk RNA-seq data, compared with the two widely used CCD methods, CIBERSORTx and quanTIseq. We applied HASCAD to analyze the liver cancer samples of TCGA-LIHC, and found that there were significant associations of the predicted abundance of Treg and effector CD8 T cell with patients' overall survival.ConclusionHASCAD could predict the cell composition of the PBMC bulk RNA-seq and classify the cell type from pure bulk RNA-seq. The model of HASCAD is available at https://github.com/holiday01/HASCAD.
引用
收藏
页数:16
相关论文
共 25 条
[1]   Inflammation and cancer: back to Virchow? [J].
Balkwill, F ;
Mantovani, A .
LANCET, 2001, 357 (9255) :539-545
[2]   Not-so-opposite ends of the spectrum: CD8+ T cell dysfunction across chronic infection, cancer and autoimmunity [J].
Collier, Jenna L. ;
Weiss, Sarah A. ;
Pauken, Kristen E. ;
Sen, Debattama R. ;
Sharpe, Arlene H. .
NATURE IMMUNOLOGY, 2021, 22 (07) :809-819
[3]   SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references [J].
Dong, Meichen ;
Thennavan, Aatish ;
Urrutia, Eugene ;
Li, Yun ;
Perou, Charles M. ;
Zou, Fei ;
Jiang, Yuchao .
BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) :416-427
[4]   CD86+/CD206+, Diametrically Polarized Tumor-Associated Macrophages, Predict Hepatocellular Carcinoma Patient Prognosis [J].
Dong, Pingping ;
Ma, Lijie ;
Liu, Longzi ;
Zhao, Guangxi ;
Zhang, Si ;
Dong, Ling ;
Xue, Ruyi ;
Chen, She .
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2016, 17 (03)
[5]   Molecular and pharmacological modulators of the tumor immune contexture revealed by deconvolution of RNA-seq data [J].
Finotello, Francesca ;
Mayer, Clemens ;
Plattner, Christina ;
Laschober, Gerhard ;
Rieder, Dietmar ;
Hackl, Hubert ;
Krogsdam, Anne ;
Loncova, Zuzana ;
Posch, Wilfried ;
Wilflingseder, Doris ;
Sopper, Sieghart ;
Ijsselsteijn, Marieke ;
Brouwer, Thomas P. ;
Johnson, Douglas ;
Xu, Yaomin ;
Wang, Yu ;
Sanders, Melinda E. ;
Estrada, Monica V. ;
Ericsson-Gonzalez, Paula ;
Charoentong, Pornpimol ;
Balko, Justin ;
de Miranda, Noel Filipe da Cunha Carvahlo ;
Trajanoski, Zlatko .
GENOME MEDICINE, 2019, 11 (1)
[6]   A benchmark of batch-effect correction methods for single-cell RNA sequencing data [J].
Hoa Thi Nhu Tran ;
Ang, Kok Siong ;
Chevrier, Marion ;
Zhang, Xiaomeng ;
Lee, Nicole Yee Shin ;
Goh, Michelle ;
Chen, Jinmiao .
GENOME BIOLOGY, 2020, 21 (01)
[7]   A new cancer ecosystem [J].
Horning, Sandra J. .
SCIENCE, 2017, 355 (6330) :1103-1103
[8]   RNA-Seq methods for transcriptome analysis [J].
Hrdlickova, Radmila ;
Toloue, Masoud ;
Tian, Bin .
WILEY INTERDISCIPLINARY REVIEWS-RNA, 2017, 8 (01)
[9]   Efficient and precise single-cell reference atlas mapping with Symphony [J].
Kang, Joyce B. ;
Nathan, Aparna ;
Weinand, Kathryn ;
Zhang, Fan ;
Millard, Nghia ;
Rumker, Laurie ;
Moody, D. Branch ;
Korsunsky, Ilya ;
Raychaudhuri, Soumya .
NATURE COMMUNICATIONS, 2021, 12 (01)
[10]   Fast, sensitive and accurate integration of single-cell data with Harmony [J].
Korsunsky, Ilya ;
Millard, Nghia ;
Fan, Jean ;
Slowikowski, Kamil ;
Zhang, Fan ;
Wei, Kevin ;
Baglaenko, Yuriy ;
Brenner, Michael ;
Loh, Po-ru ;
Raychaudhuri, Soumya .
NATURE METHODS, 2019, 16 (12) :1289-+