Artificial duplicate reads in sequencing data of 454 Genome Sequencer FLX System

被引:16
作者
Dong, Hui [1 ]
Chen, Yangyi [2 ]
Shen, Yan [2 ]
Wang, Shengyue [1 ]
Zhao, Guoping [1 ,2 ]
Jin, Weirong [1 ,2 ]
机构
[1] Chinese Natl Human Genome Ctr Shanghai, Shanghai 201203, Peoples R China
[2] Natl Engn Ctr Biochip Shanghai, Shanghai 201203, Peoples R China
关键词
duplicate reads; 454 GS FLX System; emulsion PCR;
D O I
10.1093/abbs/gmr030
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The 454 Genome Sequencer (GS) FLX System is one of the next-generation sequencing systems featured by long reads, high accuracy, and ultra-high throughput. Based on the mechanism of emulsion PCR, a unique DNA template would only generate a unique sequence read after being amplified and sequenced on GS FLX. However, biased amplification of DNA templates might occur in the process of emulsion PCR, which results in production of artificial duplicate reads. Under the condition that each DNA template is unique to another, 3.49%-18.14% of total reads in GS FLX-sequencing data were found to be artificial duplicate reads. These duplicate reads may lead to misunderstanding of sequencing data and special attention should be paid to the potential biases they introduced to the data.
引用
收藏
页码:496 / 500
页数:5
相关论文
共 14 条
[1]   Diversity of microRNAs in human and chimpanzee brain [J].
Berezikov, Eugene ;
Thuemmler, Fritz ;
van Laake, Linda W. ;
Kondova, Ivanela ;
Bontrop, Ronald ;
Cuppen, Edwin ;
Plasterk, Ronald H. A. .
NATURE GENETICS, 2006, 38 (12) :1375-1377
[2]   Functional metagenomic profiling of nine biomes [J].
Dinsdale, Elizabeth A. ;
Edwards, Robert A. ;
Hall, Dana ;
Angly, Florent ;
Breitbart, Mya ;
Brulc, Jennifer M. ;
Furlan, Mike ;
Desnues, Christelle ;
Haynes, Matthew ;
Li, Linlin ;
McDaniel, Lauren ;
Moran, Mary Ann ;
Nelson, Karen E. ;
Nilsson, Christina ;
Olson, Robert ;
Paul, John ;
Brito, Beltran Rodriguez ;
Ruan, Yijun ;
Swan, Brandon K. ;
Stevens, Rick ;
Valentine, David L. ;
Thurber, Rebecca Vega ;
Wegley, Linda ;
White, Bryan A. ;
Rohwer, Forest .
NATURE, 2008, 452 (7187) :629-U8
[3]   The genome sequencer FLX™ system-longer reads, more applications, straight forward bioinformatics and more complete data sets [J].
Droege, Marcus ;
Hill, Brendon .
JOURNAL OF BIOTECHNOLOGY, 2008, 136 (1-2) :3-10
[4]   Systematic artifacts in metagenomes from complex microbial communities [J].
Gomez-Alvarez, Vicente ;
Teal, Tracy K. ;
Schmidt, Thomas M. .
ISME JOURNAL, 2009, 3 (11) :1314-1317
[5]  
Korbel JO, 2007, SCIENCE, V318, P420, DOI 10.1126/science.1149504
[6]   Genome sequencing in microfabricated high-density picolitre reactors [J].
Margulies, M ;
Egholm, M ;
Altman, WE ;
Attiya, S ;
Bader, JS ;
Bemben, LA ;
Berka, J ;
Braverman, MS ;
Chen, YJ ;
Chen, ZT ;
Dewell, SB ;
Du, L ;
Fierro, JM ;
Gomes, XV ;
Godwin, BC ;
He, W ;
Helgesen, S ;
Ho, CH ;
Irzyk, GP ;
Jando, SC ;
Alenquer, MLI ;
Jarvie, TP ;
Jirage, KB ;
Kim, JB ;
Knight, JR ;
Lanza, JR ;
Leamon, JH ;
Lefkowitz, SM ;
Lei, M ;
Li, J ;
Lohman, KL ;
Lu, H ;
Makhijani, VB ;
McDade, KE ;
McKenna, MP ;
Myers, EW ;
Nickerson, E ;
Nobile, JR ;
Plant, R ;
Puc, BP ;
Ronan, MT ;
Roth, GT ;
Sarkis, GJ ;
Simons, JF ;
Simpson, JW ;
Srinivasan, M ;
Tartaro, KR ;
Tomasz, A ;
Vogt, KA ;
Volkmer, GA .
NATURE, 2005, 437 (7057) :376-380
[7]   SuperSAGE: the drought stress-responsive transcriptome of chickpea roots [J].
Molina, Carlos ;
Rotter, Bjoern ;
Horres, Ralf ;
Udupa, Sripada M. ;
Besser, Bert ;
Bellarmino, Luis ;
Baum, Michael ;
Matsumura, Hideo ;
Terauchi, Ryohei ;
Kahl, Guenter ;
Winter, Peter .
BMC GENOMICS, 2008, 9 (1)
[8]   DeepSAGE - digital transcriptomics with high sensitivity, simple experimental protocol and multiplexing of samples [J].
Nielsen, Kare L. ;
Hogh, Annabeth Laursen ;
Emmersen, Jeppe .
NUCLEIC ACIDS RESEARCH, 2006, 34 (19)
[9]   Artificial and natural duplicates in pyrosequencing reads of metagenomic data [J].
Niu, Beifang ;
Fu, Limin ;
Sun, Shulei ;
Li, Weizhong .
BMC BIOINFORMATICS, 2010, 11
[10]   The development and impact of 454 sequencing [J].
Rothberg, Jonathan M. ;
Leamon, John H. .
NATURE BIOTECHNOLOGY, 2008, 26 (10) :1117-1124