Imputing single-cell RNA-seq data by considering cell heterogeneity and prior expression of dropouts

被引:21
|
作者
Zhang, Lihua [1 ,2 ]
Zhang, Shihua [1 ,2 ,3 ,4 ]
机构
[1] Chinese Acad Sci, Acad Math & Syst Sci, RCSDS, NCMIS,CEMS, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
[3] Chinese Acad Sci, Ctr Excellence Anim Evolut & Genet, Kunming 650223, Yunnan, Peoples R China
[4] Chinese Acad Sci, Univ Chinese Acad Sci, Hangzhou Inst Adv Study, Key Lab Syst Biol, Hangzhou 310024, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
single-cell RNA-seq; dropout; imputation; low-rank; systems biology;
D O I
10.1093/jmcb/mjaa052
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Single-cell RNA sequencing (scRNA-seq) provides a powerful tool to determine expression patterns of thousands of individual cells. However, the analysis of scRNA-seq data remains a computational challenge due to the high technical noise such as the presence of dropout events that lead to a large proportion of zeros for expressed genes. Taking into account the cell heterogeneity and the relationship between dropout rate and expected expression level, we present a cell sub-population based bounded low-rank (PBLR) method to impute the dropouts of scRNA-seq data. Through application to both simulated and real scRNA-seq datasets, PBLR is shown to be effective in recovering dropout events, and it can dramatically improve the low-dimensional representation and the recovery of genegene relationships masked by dropout events compared to several state-of-the-art methods. Moreover, PBLR also detects accurate and robust cell sub-populations automatically, shedding light on its flexibility and generality for scRNA-seq data analysis.
引用
收藏
页码:29 / 40
页数:12
相关论文
共 50 条
  • [41] SCell: integrated analysis of single-cell RNA-seq data
    Diaz, Aaron
    Liu, Siyuan J.
    Sandoval, Carmen
    Pollen, Alex
    Nowakowski, Tom J.
    Lim, Daniel A.
    Kriegstein, Arnold
    BIOINFORMATICS, 2016, 32 (14) : 2219 - 2220
  • [42] SCnorm: robust normalization of single-cell RNA-seq data
    Rhonda Bacher
    Li-Fang Chu
    Ning Leng
    Audrey P Gasch
    James A Thomson
    Ron M Stewart
    Michael Newton
    Christina Kendziorski
    Nature Methods, 2017, 14 : 584 - 586
  • [43] Tracking intratumoral heterogeneity in glioblastoma via regularized classification of single-cell RNA-Seq data
    Marta B. Lopes
    Susana Vinga
    BMC Bioinformatics, 21
  • [44] Tumor Heterogeneity and Molecular Characteristics of Glioblastoma Revealed by Single-Cell RNA-Seq Data Analysis
    Yesudhas, Dhanusha
    Dharshini, S. Akila Parvathy
    Taguchi, Y-H
    Gromiha, M. Michael
    GENES, 2022, 13 (03)
  • [45] Generalized Cell Type Annotation and Discovery for Single-Cell RNA-Seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 5402 - 5410
  • [46] Realistic Cell Type Annotation and Discovery for Single-cell RNA-seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4967 - 4974
  • [47] SCSA: A Cell Type Annotation Tool for Single-Cell RNA-seq Data
    Cao, Yinghao
    Wang, Xiaoyue
    Peng, Gongxin
    FRONTIERS IN GENETICS, 2020, 11
  • [48] Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization
    Zhu, Xun
    Ching, Travers
    Pan, Xinghua
    Weissman, Sherman M.
    Garmire, Lana
    PEERJ, 2017, 5
  • [49] Robust identification of perturbed cell types in single-cell RNA-seq data
    Nicol, Phillip B.
    Paulson, Danielle
    Qian, Gege
    Liu, X. Shirley
    Irizarry, Rafael
    Sahu, Avinash D.
    NATURE COMMUNICATIONS, 2024, 15 (01)
  • [50] Tracking intratumoral heterogeneity in glioblastoma via regularized classification of single-cell RNA-Seq data
    Lopes, Marta B.
    Vinga, Susana
    BMC BIOINFORMATICS, 2020, 21 (01)