Genome assembly reborn: recent computational challenges

被引:202
作者
Pop, Mihai [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
genome assembly; genome sequencing; next generation sequencing technologies; SEQUENCING TECHNOLOGY; MICROBIAL GENOMES; REPEAT REGIONS; DNA-SEQUENCES; SHOTGUN; TOOL; VALIDATION; ALGORITHMS; GENERATION; ACCURACY;
D O I
10.1093/bib/bbp026
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Research into genome assembly algorithms has experienced a resurgence due to new challenges created by the development of next generation sequencing technologies. Several genome assemblers have been published in recent years specifically targeted at the new sequence data; however, the ever-changing technological landscape leads to the need for continued research. In addition, the low cost of next generation sequencing data has led to an increased use of sequencing in new settings. For example, the new field of metagenomics relies on large-scale sequencing of entire microbial communities instead of isolate genomes, leading to new computational challenges. In this article, we outline the major algorithmic approaches for genome assembly and describe recent developments in this domain.
引用
收藏
页码:354 / 366
页数:13
相关论文
共 73 条
[1]   Genome sequence of the metazoan plant-parasitic nematode Meloidogyne incognita [J].
Abad, Pierre ;
Gouzy, Jerome ;
Aury, Jean-Marc ;
Castagnone-Sereno, Philippe ;
Danchin, Etienne G. J. ;
Deleury, Emeline ;
Perfus-Barbeoch, Laetitia ;
Anthouard, Veronique ;
Artiguenave, Francois ;
Blok, Vivian C. ;
Caillaud, Marie-Cecile ;
Coutinho, Pedro M. ;
Dasilva, Corinne ;
De Luca, Francesca ;
Deau, Florence ;
Esquibet, Magali ;
Flutre, Timothe ;
Goldstone, Jared V. ;
Hamamouch, Noureddine ;
Hewezi, Tarek ;
Jaillon, Olivier ;
Jubin, Claire ;
Leonetti, Paola ;
Magliano, Marc ;
Maier, Tom R. ;
Markov, Gabriel V. ;
McVeigh, Paul ;
Pesole, Graziano ;
Poulain, Julie ;
Robinson-Rechavi, Marc ;
Sallet, Erika ;
Segurens, Beatrice ;
Steinbach, Delphine ;
Tytgat, Tom ;
Ugarte, Edgardo ;
van Ghelder, Cyril ;
Veronico, Pasqua ;
Baum, Thomas J. ;
Blaxter, Mark ;
Bleve-Zacheo, Teresa ;
Davis, Eric L. ;
Ewbank, Jonathan J. ;
Favery, Bruno ;
Grenier, Eric ;
Henrissat, Bernard ;
Jones, John T. ;
Laudet, Vincent ;
Maule, Aaron G. ;
Quesneville, Hadi ;
Rosso, Marie-Noelle .
NATURE BIOTECHNOLOGY, 2008, 26 (08) :909-915
[2]   DNPTrapper: an assembly editing tool for finishing and analysis of complex repeat regions [J].
Arner, E ;
Tammi, MT ;
Tran, AN ;
Kindlund, E ;
Andersson, B .
BMC BIOINFORMATICS, 2006, 7 (1)
[3]   A new approach to sequence comparison:: normalired sequence alignment [J].
Arslan, AN ;
Egecioglu, Ö ;
Pevzner, PA .
BIOINFORMATICS, 2001, 17 (04) :327-337
[4]   BACCardI -: a tool for the validation of genomic assemblies, assisting genome finishing and intergenome comparison [J].
Bartels, D ;
Kespohl, S ;
Albaum, S ;
Drüke, T ;
Goesmann, A ;
Herold, J ;
Kaiser, O ;
Pühler, A ;
Pfeiffer, F ;
Raddatz, G ;
Stoye, J ;
Meyer, F ;
Schuster, SC .
BIOINFORMATICS, 2005, 21 (07) :853-859
[5]  
Batzoglou S, 2002, GENOME RES, V12, P177, DOI 10.1101/gr.208902
[6]   ALLPATHS: De novo assembly of whole-genome shotgun microreads [J].
Butler, Jonathan ;
MacCallum, Iain ;
Kleber, Michael ;
Shlyakhter, Ilya A. ;
Belmonte, Matthew K. ;
Lander, Eric S. ;
Nusbaum, Chad ;
Jaffe, David B. .
GENOME RESEARCH, 2008, 18 (05) :810-820
[7]   Fragment assembly with short reads [J].
Chaisson, M ;
Pevzner, P ;
Tang, HX .
BIOINFORMATICS, 2004, 20 (13) :2067-2074
[8]   Short read fragment assembly of bacterial genomes [J].
Chaisson, Mark J. ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2008, 18 (02) :324-330
[9]   De novo fragment assembly with short mate-paired reads: Does the read length matter? [J].
Chaisson, Mark J. ;
Brinza, Dumitru ;
Pevzner, Pavel A. .
GENOME RESEARCH, 2009, 19 (02) :336-346
[10]   A machine-learning approach to combined evidence validation of genome assemblies [J].
Choi, Jeong-Hyeon ;
Kim, Sun ;
Tang, Haixu ;
Andrews, Justen ;
Gilbert, Don G. ;
Colbourne, John K. .
BIOINFORMATICS, 2008, 24 (06) :744-750