Analysing 454 amplicon resequencing experiments using the modular and database oriented Variant Identification Pipeline

被引:13
作者
De Schrijver, Joachim M. [1 ]
De Leeneer, Kim [2 ]
Lefever, Steve [2 ]
Sabbe, Nick [3 ]
Pattyn, Filip [2 ]
Van Nieuwerburgh, Filip [4 ,5 ]
Coucke, Paul [2 ,5 ]
Deforce, Dieter [4 ,5 ]
Vandesompele, Jo [2 ,5 ]
Bekaert, Sofie [1 ,5 ]
Hellemans, Jan [2 ,5 ]
Van Criekinge, Wim [1 ,5 ]
机构
[1] Univ Ghent, Lab Bioinformat & Computat Genom, Dept Mol Biotechnol, B-9000 Ghent, Belgium
[2] Ghent Univ Hosp, Ctr Med Genet, B-9000 Ghent, Belgium
[3] Univ Ghent, Dept Appl Math Biometr & Proc Control, B-9000 Ghent, Belgium
[4] Univ Ghent, Lab Pharmaceut Biotechnol, Fac Pharmaceut Sci, B-9000 Ghent, Belgium
[5] Univ Ghent, NXTGNT Collaborators, B-9000 Ghent, Belgium
关键词
ALIGNMENT; GENOME;
D O I
10.1186/1471-2105-11-269
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Next-generation amplicon sequencing enables high-throughput genetic diagnostics, sequencing multiple genes in several patients together in one sequencing run. Currently, no open-source out-of-the-box software solution exists that reliably reports detected genetic variations and that can be used to improve future sequencing effectiveness by analyzing the PCR reactions. Results: We developed an integrated database oriented software pipeline for analysis of 454/Roche GS-FLX amplicon resequencing experiments using Perl and a relational database. The pipeline enables variation detection, variation detection validation, and advanced data analysis, which provides information that can be used to optimize PCR efficiency using traditional means. The modular approach enables customization of the pipeline where needed and allows researchers to adopt their analysis pipeline to their experiments. Clear documentation and training data is available to test and validate the pipeline prior to using it on real sequencing data. Conclusions: We designed an open-source database oriented pipeline that enables advanced analysis of 454/ Roche GS-FLX amplicon resequencing experiments using SQL-statements. This modular database approach allows easy coupling with other pipeline modules such as variant interpretation or a LIMS system. There is also a set of standard reporting scripts available.
引用
收藏
页数:12
相关论文
共 15 条
[1]   The need for speed [J].
Flicek, Paul .
GENOME BIOLOGY, 2009, 10 (03)
[2]  
H, 2009, BIOINFORMATICS, V25, P1754
[3]   Accuracy and quality of massively parallel DNA pyrosequencing [J].
Huse, Susan M. ;
Huber, Julie A. ;
Morrison, Hilary G. ;
Sogin, Mitchell L. ;
Mark Welch, David .
GENOME BIOLOGY, 2007, 8 (07)
[4]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI [10.1101/gr.229202, 10.1101/gr.229202. Article published online before March 2002]
[5]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[6]  
Li H., 2009, SAMTOOLS BIOINFORMAT, V25, DOI [10.1093/bioinformatics/btp352, DOI 10.1093/BIOINFORMATICS/BTP352]
[7]   Fast and accurate long-read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2010, 26 (05) :589-595
[8]   SOAP: short oligonucleotide alignment program [J].
Li, Ruiqiang ;
Li, Yingrui ;
Kristiansen, Karsten ;
Wang, Jun .
BIOINFORMATICS, 2008, 24 (05) :713-714
[9]   Genome sequencing in microfabricated high-density picolitre reactors [J].
Margulies, M ;
Egholm, M ;
Altman, WE ;
Attiya, S ;
Bader, JS ;
Bemben, LA ;
Berka, J ;
Braverman, MS ;
Chen, YJ ;
Chen, ZT ;
Dewell, SB ;
Du, L ;
Fierro, JM ;
Gomes, XV ;
Godwin, BC ;
He, W ;
Helgesen, S ;
Ho, CH ;
Irzyk, GP ;
Jando, SC ;
Alenquer, MLI ;
Jarvie, TP ;
Jirage, KB ;
Kim, JB ;
Knight, JR ;
Lanza, JR ;
Leamon, JH ;
Lefkowitz, SM ;
Lei, M ;
Li, J ;
Lohman, KL ;
Lu, H ;
Makhijani, VB ;
McDade, KE ;
McKenna, MP ;
Myers, EW ;
Nickerson, E ;
Nobile, JR ;
Plant, R ;
Puc, BP ;
Ronan, MT ;
Roth, GT ;
Sarkis, GJ ;
Simons, JF ;
Simpson, JW ;
Srinivasan, M ;
Tartaro, KR ;
Tomasz, A ;
Vogt, KA ;
Volkmer, GA .
NATURE, 2005, 437 (7057) :376-380
[10]   Bioinformatics challenges of new sequencing technology [J].
Pop, Mihai ;
Salzberg, Steven L. .
TRENDS IN GENETICS, 2008, 24 (03) :142-149