TsImpute: an accurate two-step imputation method for single-cell RNA-seq data

被引:7
作者
Zheng, Weihua [1 ]
Min, Wenwen [1 ,2 ]
Wang, Shunfang [1 ,2 ]
机构
[1] Yunnan Univ, Sch Informat Sci & Engn, Dept Comp Sci & Engn, Kunming 650504, Peoples R China
[2] Yunnan Univ, Yunnan Key Lab Intelligent Syst & Comp, Kunming 650504, Peoples R China
基金
中国国家自然科学基金;
关键词
GENE-EXPRESSION;
D O I
10.1093/bioinformatics/btad731
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Single-cell RNA sequencing (scRNA-seq) technology has enabled discovering gene expression patterns at single cell resolution. However, due to technical limitations, there are usually excessive zeros, called "dropouts," in scRNA-seq data, which may mislead the downstream analysis. Therefore, it is crucial to impute these dropouts to recover the biological information.Results We propose a two-step imputation method called tsImpute to impute scRNA-seq data. At the first step, tsImpute adopts zero-inflated negative binomial distribution to discriminate dropouts from true zeros and performs initial imputation by calculating the expected expression level. At the second step, it conducts clustering with this modified expression matrix, based on which the final distance weighted imputation is performed. Numerical results based on both simulated and real data show that tsImpute achieves favorable performance in terms of gene expression recovery, cell clustering, and differential expression analysis.Availability and implementation The R package of tsImpute is available at https://github.com/ZhengWeihuaYNU/tsImpute.
引用
收藏
页数:8
相关论文
共 47 条
[1]   Tutorial: guidelines for the computational analysis of single-cell RNA sequencing data [J].
Andrews, Tallulah S. ;
Kiselev, Vladimir Yu ;
McCarthy, Davis ;
Hemberg, Martin .
NATURE PROTOCOLS, 2021, 16 (01) :1-9
[2]   A Single-Cell Transcriptomic Map of the Human and Mouse Pancreas Reveals Inter- and Intra-cell Population Structure [J].
Baron, Maayan ;
Veres, Adrian ;
Wolock, Samuel L. ;
Faust, Aubrey L. ;
Gaujoux, Renaud ;
Vetere, Amedeo ;
Ryu, Jennifer Hyoje ;
Wagner, Bridget K. ;
Shen-Orr, Shai S. ;
Klein, Allon M. ;
Melton, Douglas A. ;
Yanai, Itai .
CELL SYSTEMS, 2016, 3 (04) :346-+
[3]   Integrating single-cell transcriptomic data across different conditions, technologies, and species [J].
Butler, Andrew ;
Hoffman, Paul ;
Smibert, Peter ;
Papalexi, Efthymia ;
Satija, Rahul .
NATURE BIOTECHNOLOGY, 2018, 36 (05) :411-+
[4]   scRMD: imputation for single cell RNA-seq data via robust matrix decomposition [J].
Chen, Chong ;
Wu, Changjing ;
Wu, Linjie ;
Wang, Xiaochen ;
Deng, Minghua ;
Xi, Ruibin .
BIOINFORMATICS, 2020, 36 (10) :3156-3161
[5]   Single-cell RNA-seq reveals novel regulators of human embryonic stem cell differentiation to definitive endoderm [J].
Chu, Li-Fang ;
Leng, Ning ;
Zhang, Jue ;
Hou, Zhonggang ;
Mamott, Daniel ;
Vereide, David T. ;
Choi, Jeea ;
Kendziorski, Christina ;
Stewart, Ron ;
Thomson, James A. .
GENOME BIOLOGY, 2016, 17
[6]   Best practices on the differential expression analysis of multi-species RNA-seq [J].
Chung, Matthew ;
Bruno, Vincent M. ;
Rasko, David A. ;
Cuomo, Christina A. ;
Munoz, Jose F. ;
Livny, Jonathan ;
Shetty, Amol C. ;
Mahurkar, Anup ;
Dunning Hotopp, Julie C. .
GENOME BIOLOGY, 2021, 22 (01)
[7]   A survey of human brain transcriptome diversity at the single cell level [J].
Darmanis, Spyros ;
Sloan, Steven A. ;
Zhang, Ye ;
Enge, Martin ;
Caneda, Christine ;
Shuer, Lawrence M. ;
Gephart, Melanie G. Hayden ;
Barres, Ben A. ;
Quake, Stephen R. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (23) :7285-7290
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Diverse homeostatic and immunomodulatory roles of immune cells in the developing mouse lung at single cell resolution [J].
Domingo-Gonzalez, Racquel ;
Zanini, Fabio ;
Che, Xibing ;
Liu, Min ;
Jones, Robert C. ;
Swift, Michael A. ;
Quake, Stephen R. ;
Cornfield, David N. ;
Alvira, Cristina M. .
ELIFE, 2020, 9 :1-39
[10]   DrImpute: imputing dropout events in single cell RNA sequencing data [J].
Gong, Wuming ;
Kwak, Il-Youp ;
Pota, Pruthvi ;
Koyano-Nakagawa, Naoko ;
Garry, Daniel J. .
BMC BIOINFORMATICS, 2018, 19