Identifying strengths and weaknesses of methods for computational network inference from single-cell RNA-seq data

被引:17
作者
McCalla, Sunnie Grace [1 ,3 ]
Siahpirani, Alireza Fotuhi [1 ]
Li, Jiaxin [1 ,3 ]
Pyne, Saptarshi [1 ]
Stone, Matthew [1 ,2 ]
Periyasamy, Viswesh [1 ,4 ]
Shin, Junha [1 ]
Roy, Sushmita [1 ,2 ,4 ,5 ]
机构
[1] Univ Wisconsin Madison, Wisconsin Inst Discovery, Madison, WI 53715 USA
[2] Univ Wisconsin Madison, Dept Biostat & Med Informat, Madison, WI 53792 USA
[3] Univ Wisconsin Madison, Lab Genet, Madison, WI 53706 USA
[4] Univ Wisconsin Madison, Dept Comp Sci, Madison, WI 53706 USA
[5] Univ Wisconsin Madison, Dept Biostat & Med Informat, 600 Highland Ave, Madison, WI 53792 USA
关键词
gene regulatory networks; single-cell RNA-seq; benchmarking; network inference algorithms; prior knowledge; GENE REGULATORY NETWORK; DIFFERENTIATION; PLURIPOTENCY; DISCOVERY;
D O I
10.1093/g3journal/jkad004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Single-cell RNA-sequencing (scRNA-seq) offers unparalleled insight into the transcriptional programs of different cellular states by measuring the transcriptome of thousands of individual cells. An emerging problem in the analysis of scRNA-seq is the inference of transcriptional gene regulatory networks and a number of methods with different learning frameworks have been developed to address this problem. Here, we present an expanded benchmarking study of eleven recent network inference methods on seven published scRNA-seq datasets in human, mouse, and yeast considering different types of gold standard networks and evaluation metrics. We evaluate methods based on their computing requirements as well as on their ability to recover the network structure. We find that, while most methods have a modest recovery of experimentally derived interactions based on global metrics such as Area Under the Precision Recall curve, methods are able to capture targets of regulators that are relevant to the system under study. Among the top performing methods that use only expression were SCENIC, PIDC, MERLIN or Correlation. Addition of prior biological knowledge and the estimation of transcription factor activities resulted in the best overall performance with the Inferelator and MERLIN methods that use prior knowledge outperforming methods that use expression alone. We found that imputation for network inference did not improve network inference accuracy and could be detrimental. Comparisons of inferred networks for comparable bulk conditions showed that the networks inferred from scRNA-seq datasets are often better or at par with the networks inferred from bulk datasets. Our analysis should be beneficial in selecting methods for network inference. At the same time, this highlights the need for improved methods and better gold standards for regulatory network inference from scRNAseq datasets.
引用
收藏
页数:18
相关论文
共 58 条
[1]  
Aibar S, 2017, NAT METHODS, V14, P1083, DOI [10.1038/nmeth.4463, 10.1038/NMETH.4463]
[2]   MetaCell: analysis of single-cell RNA-seq data using K-nn graph partitions [J].
Baran, Yael ;
Bercovich, Akhiad ;
Sebe-Pedros, Arnau ;
Lubling, Yaniv ;
Giladi, Amir ;
Chomsky, Elad ;
Meir, Zohar ;
Hoichman, Michael ;
Lifshitz, Aviezer ;
Tanay, Amos .
GENOME BIOLOGY, 2019, 20 (01)
[3]   CIS:: compound importance sampling method for protein-DNA binding site p-value estimation [J].
Barash, Y ;
Elidan, G ;
Kaplan, T ;
Friedman, N .
BIOINFORMATICS, 2005, 21 (05) :596-600
[4]   Metacells untangle large and complex single-cell transcriptome networks [J].
Bilous, Mariia ;
Tran, Loc ;
Cianciaruso, Chiara ;
Gabriel, Aurelie ;
Michel, Hugo ;
Carmona, Santiago J. ;
Pittet, Mikael J. ;
Gfeller, David .
BMC BIOINFORMATICS, 2022, 23 (01)
[5]   Core transcriptional regulatory circuitry in human embryonic stem cells [J].
Boyer, LA ;
Lee, TI ;
Cole, MF ;
Johnstone, SE ;
Levine, SS ;
Zucker, JR ;
Guenther, MG ;
Kumar, RM ;
Murray, HL ;
Jenner, RG ;
Gifford, DK ;
Melton, DA ;
Jaenisch, R ;
Young, RA .
CELL, 2005, 122 (06) :947-956
[6]   Single-Cell Expression Analyses during Cellular Reprogramming Reveal an Early Stochastic and a Late Hierarchic Phase [J].
Buganim, Yosef ;
Faddah, Dina A. ;
Cheng, Albert W. ;
Itskovich, Elena ;
Markoulaki, Styliani ;
Ganz, Kibibi ;
Klemm, Sandy L. ;
van Oudenaarden, Alexander ;
Jaenisch, Rudolf .
CELL, 2012, 150 (06) :1209-1222
[7]   Multi-study inference of regulatory networks for more accurate models of gene regulation [J].
Castro, Dayanne M. ;
de Veaux, Nicholas R. ;
Miraldi, Emily R. ;
Bonneau, Richard .
PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (01)
[8]   Gene Regulatory Network Inference from Single-Cell Data Using Multivariate Information Measures [J].
Chan, Thalia E. ;
Stumpf, Michael P. H. ;
Babtie, Ann C. .
CELL SYSTEMS, 2017, 5 (03) :251-+
[9]   Evaluating methods of inferring gene regulatory networks highlights their lack of performance for single cell gene expression data [J].
Chen, Shuonan ;
Mar, Jessica C. .
BMC BIOINFORMATICS, 2018, 19
[10]  
Davis J., 2006, Proceedings of the 23rd International Conference on Machine Learning, P233, DOI DOI 10.1145/1143844.1143874