System analysis of the sequencing quality of human whole exome samples on BGI NGS platform

被引:20
作者
Belova, Vera [1 ]
Pavlova, Anna [1 ]
Afasizhev, Robert [1 ]
Moskalenko, Viktoriya [1 ]
Korzhanova, Margarita [1 ]
Krivoy, Andrey [1 ]
Cheranev, Valery [1 ]
Nikashin, Boris [1 ]
Bulusheva, Irina [1 ]
Rebrikov, Denis [1 ]
Korostin, Dmitriy [1 ]
机构
[1] Pirogov Med Univ, Ctr Precis Genome Editing & Genet Technol Biomed, Ostovityanova Str 1, Moscow 117997, Russia
关键词
CAPTURE; PERFORMANCE;
D O I
10.1038/s41598-021-04526-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human exome sequencing is a classical method used in most medical genetic applications. The leaders in the field are the manufacturers of enrichment kits based on hybridization of cRNA or cDNA biotinylated probes specific for a genomic region of interest. Recently, the platforms manufactured by the Chinese company MGI Tech have become widespread in Europe and Asia. The reliability and quality of the obtained data are already beyond any doubt. However, only a few kits compatible with these sequencers can be used for such specific tasks as exome sequencing. We developed our own solution for library pre-capture pooling and exome enrichment with Agilent probes. In this work, using a set of the standard benchmark samples from the Platinum Genome collection, we demonstrate that the qualitative and quantitative parameters of our protocol which we called "RSMU_exome" exceed those of the MGI Tech kit. Our protocol allows for identifying more SNV and indels, generates fewer PCR duplicates, enables pooling of more samples in a single enrichment procedure, and requires less raw data to obtain results comparable with the MGI Tech's protocol. The cost of our protocol is also lower than that of MGI Tech's solution.
引用
收藏
页数:15
相关论文
共 30 条
[1]   Analyzing and minimizing bias in Illumina sequencing libraries [J].
Aird, Daniel ;
Chen, Wei-Shen ;
Ross, Michael ;
Connolly, Kristen ;
Meldrim, Jim ;
Russ, Carsten ;
Fisher, Sheila ;
Jaffe, David ;
Nusbaum, Chad ;
Gnirke, Andreas .
GENOME BIOLOGY, 2010, 11
[2]   Systematic dissection of biases in whole-exome and whole-genome sequencing reveals major determinants of coding sequence coverage [J].
Barbitoff, Yury A. ;
Polev, Dmitrii E. ;
Glotov, Andrey S. ;
Serebryakova, Elena A. ;
Shcherbakova, Irina V. ;
Kiselev, Artem M. ;
Kostareva, Anna A. ;
Glotov, Oleg S. ;
Predeus, Alexander V. .
SCIENTIFIC REPORTS, 2020, 10 (01)
[3]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[4]   Systematic comparison of germline variant calling pipelines cross multiple next-generation sequencers [J].
Chen, Jiayun ;
Li, Xingsong ;
Zhong, Hongbin ;
Meng, Yuhuan ;
Du, Hongli .
SCIENTIFIC REPORTS, 2019, 9 (1)
[5]   Performance comparison of four exome capture systems for deep sequencing [J].
Chilamakuri, Chandra Sekhar Reddy ;
Lorenz, Susanne ;
Madoui, Mohammed-Amin ;
Vodak, Daniel ;
Sun, Jinchang ;
Hovig, Eivind ;
Myklebost, Ola ;
Meza-Zepeda, Leonardo A. .
BMC GENOMICS, 2014, 15
[6]   Genetic diagnosis by whole exome capture and massively parallel DNA sequencing [J].
Choi, Murim ;
Scholl, Ute I. ;
Ji, Weizhen ;
Liu, Tiewen ;
Tikhonova, Irina R. ;
Zumbo, Paul ;
Nayir, Ahmet ;
Bakkaloglu, Aysin ;
Ozen, Seza ;
Sanjad, Sami ;
Nelson-Williams, Carol ;
Farhi, Anita ;
Mane, Shrikant ;
Lifton, Richard P. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (45) :19096-19101
[7]   Controlled Growth of Rubrene Nanowires by Eutectic Melt Crystallization [J].
Chung, Jeyon ;
Hyon, Jinho ;
Park, Kyung-Sun ;
Cho, Boram ;
Baek, Jangmi ;
Kim, Jueun ;
Lee, Sang Uck ;
Sung, Myung Mo ;
Kang, Youngjong .
SCIENTIFIC REPORTS, 2016, 6
[8]   Performance comparison of exome DNA sequencing technologies [J].
Clark, Michael J. ;
Chen, Rui ;
Lam, Hugo Y. K. ;
Karczewski, Konrad J. ;
Chen, Rong ;
Euskirchen, Ghia ;
Butte, Atul J. ;
Snyder, Michael .
NATURE BIOTECHNOLOGY, 2011, 29 (10) :908-U206
[9]   A reference data set of 5.4 million phased human variants validated by genetic inheritance from sequencing a three-generation 17-member pedigree [J].
Eberle, Michael A. ;
Fritzilas, Epameinondas ;
Krusche, Peter ;
Kallberg, Morten ;
Moore, Benjamin L. ;
Bekritsky, Mitchell A. ;
Iqbal, Zamin ;
Chuang, Han-Yu ;
Humphray, Sean J. ;
Halpern, Aaron L. ;
Kruglyak, Semyon ;
Margulies, Elliott H. ;
McVean, Gil ;
Bentley, David R. .
GENOME RESEARCH, 2017, 27 (01) :157-164
[10]   MultiQC: summarize analysis results for multiple tools and samples in a single report [J].
Ewels, Philip ;
Magnusson, Mans ;
Lundin, Sverker ;
Kaller, Max .
BIOINFORMATICS, 2016, 32 (19) :3047-3048