Model-based analysis of sample index hopping reveals its widespread artifacts in multiplexed single-cell RNA-sequencing

被引:23
作者
Farouni, Rick [1 ,2 ]
Djambazian, Haig [1 ,2 ]
Ferri, Lorenzo E. [3 ]
Ragoussis, Jiannis [1 ,2 ]
Najafabadi, Hamed S. [1 ,2 ]
机构
[1] McGill Univ, Dept Human Genet, Montreal, PQ H3A 0C7, Canada
[2] McGill Univ, Genome Ctr, Montreal, PQ H3A 0G1, Canada
[3] McGill Univ, Dept Surg, Montreal, PQ H3G 1A4, Canada
基金
加拿大健康研究院; 加拿大创新基金会;
关键词
D O I
10.1038/s41467-020-16522-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Index hopping is the main cause of incorrect sample assignment of sequencing reads in multiplexed pooled libraries. We introduce a statistical model for estimating the sample index-hopping rate in multiplexed droplet-based single-cell RNA-seq data and for probabilistic inference of the true sample of origin of hopped reads. We analyze several datasets and estimate the sample index hopping probability to range between 0.003-0.009, a small number that counter-intuitively gives rise to a large fraction of phantom molecules - the fraction of phantom molecules exceeds 8% in more than 25% of samples and reaches as high as 85% in low-complexity samples. Phantom molecules lead to widespread complications in downstream analyses, including transcriptome mixing across cells, emergence of phantom copies of cells from other samples, and misclassification of empty droplets as cells. We demonstrate that our approach can correct for these artifacts by accurately purging the majority of phantom molecules from the data.
引用
收藏
页数:8
相关论文
共 9 条
[1]  
[Anonymous], 2017, Effects of index misassignment on multiplexing and downstream analysis
[2]   Differentiation dynamics of mammary epithelial cells revealed by single-cell RNA sequencing [J].
Bach, Karsten ;
Pensa, Sara ;
Grzelak, Marta ;
Hadfield, James ;
Adams, David J. ;
Marioni, John C. ;
Khaled, Walid T. .
NATURE COMMUNICATIONS, 2017, 8
[3]   Characterization and remediation of sample index swaps by non-redundant dual indexing on massively parallel sequencing platforms [J].
Costello, Maura ;
Fleharty, Mark ;
Abreu, Justin ;
Farjoun, Yossi ;
Ferriera, Steven ;
Holmes, Laurie ;
Granger, Brian ;
Green, Lisa ;
Howd, Tom ;
Mason, Tamara ;
Vicente, Gina ;
Dasilva, Michael ;
Brodeur, Wendy ;
DeSmet, Timothy ;
Dodge, Sheila ;
Lennon, Niall J. ;
Gabriel, Stacey .
BMC GENOMICS, 2018, 19
[4]   Detection and removal of barcode swapping in single-cell RNA-seq data [J].
Griffiths, Jonathan A. ;
Richard, Arianne C. ;
Bach, Karsten ;
Lun, Aaron T. L. ;
Marioni, John C. .
NATURE COMMUNICATIONS, 2018, 9
[5]   Computational correction of index switching in multiplexed sequencing libraries [J].
Larsson, Anton J. M. ;
Stanley, Geoff ;
Sinha, Rahul ;
Weissman, Irving L. ;
Sandberg, Rickard .
NATURE METHODS, 2018, 15 (05) :305-307
[6]   EmptyDrops: distinguishing cells from empty droplets in droplet-based single-cell RNA sequencing data [J].
Lun, Aaron T. L. ;
Riesenfeld, Samantha ;
Andrews, Tallulah ;
The Phuong Dao ;
Gomes, Tomas ;
Marioni, John C. .
GENOME BIOLOGY, 2019, 20
[7]   Unique, dual-indexed sequencing adapters with UMIs effectively eliminate index crosstalk and significantly improve sensitivity of massively parallel sequencing [J].
MacConaill, Laura E. ;
Burns, Robert T. ;
Nag, Anwesha ;
Coleman, Haley A. ;
Slevin, Michael K. ;
Giorda, Kristina ;
Light, Madelyn ;
Lai, Kevin ;
Jarosz, Mirna ;
McNeill, Matthew S. ;
Ducar, Matthew D. ;
Meyerson, Matthew ;
Thorner, Aaron R. .
BMC GENOMICS, 2018, 19
[8]   Single-cell transcriptomics of 20 mouse organs creates a Tabula Muris [J].
Schaum, Nicholas ;
Karkanias, Jim ;
Neff, Norma F. ;
May, Andrew P. ;
Quake, Stephen R. ;
Wyss-Coray, Tony ;
Darmanis, Spyros ;
Batson, Joshua ;
Botvinnik, Olga ;
Chen, Michelle B. ;
Chen, Steven ;
Green, Foad ;
Jones, Robert C. ;
Maynard, Ashley ;
Penland, Lolita ;
Pisco, Angela Oliveira ;
Sit, Rene V. ;
Stanley, Geoffrey M. ;
Webber, James T. ;
Zanini, Fabio ;
Baghel, Ankit S. ;
Bakerman, Isaac ;
Bansal, Ishita ;
Berdnik, Daniela ;
Bilen, Biter ;
Brownfield, Douglas ;
Cain, Corey ;
Cho, Min ;
Cirolia, Giana ;
Conley, Stephanie D. ;
Demers, Aaron ;
Demir, Kubilay ;
de Morree, Antoine ;
Divita, Tessa ;
du Bois, Haley ;
Dulgeroff, Laughing Bear Torrez ;
Ebadi, Hamid ;
Espinoza, F. Hernan ;
Fish, Matt ;
Gan, Qiang ;
George, Benson M. ;
Gillich, Astrid ;
Genetiano, Geraldine ;
Gu, Xueying ;
Gulati, Gunsagar S. ;
Hang, Yan ;
Hosseinzadeh, Shayan ;
Huang, Albin ;
Iram, Tal ;
Isobe, Taichi .
NATURE, 2018, 562 (7727) :367-+
[9]  
Sinha R, 2017, INDEX SWITCHING CAUS, DOI [10.1101/125724v1, DOI 10.1101/125724V1]