A real-world multi-center RNA-seq benchmarking study using the Quartet and MAQC reference materials

被引:6
作者
Wang, Duo [1 ,2 ,3 ]
Liu, Yaqing [4 ,5 ]
Zhang, Yuanfeng [1 ,2 ,3 ]
Chen, Qingwang [4 ,5 ]
Han, Yanxi [1 ,2 ,3 ]
Hou, Wanwan [4 ,5 ]
Liu, Cong [1 ,2 ,3 ]
Yu, Ying [4 ,5 ]
Li, Ziyang [6 ]
Li, Ziqiang [1 ,2 ,3 ]
Zhao, Jiaxin [1 ,2 ,3 ]
Shi, Leming [4 ,5 ,7 ]
Zheng, Yuanting [4 ,5 ,7 ]
Li, Jinming [1 ,2 ,3 ]
Zhang, Rui [1 ,2 ,3 ]
机构
[1] Chinese Acad Med Sci, Beijing Hosp, Inst Geriatr Med, Natl Ctr Gerontol,Natl Ctr Clin Labs, Beijing, Peoples R China
[2] Chinese Acad Med Sci & Peking Union Med Coll, Natl Ctr Clin Labs, Beijing, Peoples R China
[3] Beijing Hosp, Beijing Engn Res Ctr Lab Med, Beijing, Peoples R China
[4] Fudan Univ, Human Phenome Inst, Sch Life Sci, State Key Lab Genet Engn, Shanghai, Peoples R China
[5] Fudan Univ, Shanghai Canc Ctr, Shanghai, Peoples R China
[6] Cent South Univ, Xiangya Hosp 2, Dept Lab Med, Changsha, Hunan, Peoples R China
[7] Int Human Phenome Inst, Shanghai, Peoples R China
关键词
QUALITY-CONTROL; REPRODUCIBILITY; ALIGNMENT; QUANTIFICATION; TRANSCRIPTOME; PACKAGE; ERRORS;
D O I
10.1038/s41467-024-50420-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Translating RNA-seq into clinical diagnostics requires ensuring the reliability and cross-laboratory consistency of detecting clinically relevant subtle differential expressions, such as those between different disease subtypes or stages. As part of the Quartet project, we present an RNA-seq benchmarking study across 45 laboratories using the Quartet and MAQC reference samples spiked with ERCC controls. Based on multiple types of 'ground truth', we systematically assess the real-world RNA-seq performance and investigate the influencing factors involved in 26 experimental processes and 140 bioinformatics pipelines. Here we show greater inter-laboratory variations in detecting subtle differential expressions among the Quartet samples. Experimental factors including mRNA enrichment and strandedness, and each bioinformatics step, emerge as primary sources of variations in gene expression. We underscore the profound influence of experimental execution, and provide best practice recommendations for experimental designs, strategies for filtering low-expression genes, and the optimal gene annotation and analysis pipelines. In summary, this study lays the foundation for developing and quality control of RNA-seq for clinical diagnostic purposes. Here the authors report on an RNA-seq benchmarking study that demonstrates greater inter-lab variations in detecting subtle differential expression. The study reveals the impact of experimental execution, experimental designs, low-expression gene filtering, and analysis tool selection.
引用
收藏
页数:21
相关论文
共 97 条
[1]   Comparison of library preparation methods reveals their impact on interpretation of metatranscriptomic data [J].
Alberti, Adriana ;
Belser, Caroline ;
Engelen, Stefan ;
Bertrand, Laurie ;
Orvain, Celine ;
Brinas, Laura ;
Cruaud, Corinne ;
Giraut, Laurene ;
Da Silva, Corinne ;
Firmo, Cyril ;
Aury, Jean-Marc ;
Wincker, Patrick .
BMC GENOMICS, 2014, 15
[2]   Differential expression analysis for sequence count data [J].
Anders, Simon ;
Huber, Wolfgang .
GENOME BIOLOGY, 2010, 11 (10)
[3]   HTSeq-a Python']Python framework to work with high-throughput sequencing data [J].
Anders, Simon ;
Pyl, Paul Theodor ;
Huber, Wolfgang .
BIOINFORMATICS, 2015, 31 (02) :166-169
[4]  
[Anonymous], 2023, Ensembl. GRCh38 genome assembly Ensembl genome browser
[5]  
[Anonymous], 2023, RefSeq. /refseq/Hsapiens/annotation/GRCh38latest/refseqidentifiers/GRCh38latestgenomic.gtf.gz
[6]  
[Anonymous], 2023, Ensembl. /pub/release-109/gtf/homosapiens/Homosapiens.GRCh38.109.gtf.gz
[7]   A computational method for estimating the PCR duplication rate in DNA and RNA-seq experiments [J].
Bansal, Vikas .
BMC BIOINFORMATICS, 2017, 18
[8]  
Baruzzo G, 2017, NAT METHODS, V14, P135, DOI [10.1038/NMETH.4106, 10.1038/nmeth.4106]
[9]   SOX2 controls tumour initiation and cancer stem-cell functions in squamous-cell carcinoma [J].
Boumahdi, Soufiane ;
Driessens, Gregory ;
Lapouge, Gaelle ;
Rorive, Sandrine ;
Nassar, Dany ;
Le Mercier, Marie ;
Delatte, Benjamin ;
Caauwe, Amelie ;
Lenglez, Sandrine ;
Nkusi, Erwin ;
Brohee, Sylvain ;
Salmon, Isabelle ;
Dubois, Christine ;
del Marmol, Veronique ;
Fuks, Francois ;
Beck, Benjamin ;
Blanpain, Cedric .
NATURE, 2014, 511 (7508) :246-+
[10]   Near-optimal probabilistic RNA-seq quantification (vol 34, pg 525, 2016) [J].
Bray, Nicolas L. ;
Pimentel, Harold ;
Melsted, Pall ;
Pachter, Lior .
NATURE BIOTECHNOLOGY, 2016, 34 (08) :888-888