Genome assembly reborn: recent computational challenges

被引:202
作者
Pop, Mihai [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
基金
美国国家科学基金会;
关键词
genome assembly; genome sequencing; next generation sequencing technologies; SEQUENCING TECHNOLOGY; MICROBIAL GENOMES; REPEAT REGIONS; DNA-SEQUENCES; SHOTGUN; TOOL; VALIDATION; ALGORITHMS; GENERATION; ACCURACY;
D O I
10.1093/bib/bbp026
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Research into genome assembly algorithms has experienced a resurgence due to new challenges created by the development of next generation sequencing technologies. Several genome assemblers have been published in recent years specifically targeted at the new sequence data; however, the ever-changing technological landscape leads to the need for continued research. In addition, the low cost of next generation sequencing data has led to an increased use of sequencing in new settings. For example, the new field of metagenomics relies on large-scale sequencing of entire microbial communities instead of isolate genomes, leading to new computational challenges. In this article, we outline the major algorithmic approaches for genome assembly and describe recent developments in this domain.
引用
收藏
页码:354 / 366
页数:13
相关论文
共 73 条
[61]   STRATEGY OF DNA SEQUENCING EMPLOYING COMPUTER-PROGRAMS [J].
STADEN, R .
NUCLEIC ACIDS RESEARCH, 1979, 6 (07) :2601-2610
[62]   Whole-Genome Sequencing and Assembly with High-Throughput, Short-Read Technologies [J].
Sundquist, Andreas ;
Ronaghi, Mostafa ;
Tang, Haixu ;
Pevzner, Pavel ;
Batzoglou, Serafim .
PLOS ONE, 2007, 2 (05)
[63]   Separation of nearly identical repeats in shotgun assemblies using defined nucleotide positions, DNPs [J].
Tammi, MT ;
Arner, E ;
Britton, T ;
Andersson, B .
BIOINFORMATICS, 2002, 18 (03) :379-388
[64]   Community structure and metabolism through reconstruction of microbial genomes from the environment [J].
Tyson, GW ;
Chapman, J ;
Hugenholtz, P ;
Allen, EE ;
Ram, RJ ;
Richardson, PM ;
Solovyev, VV ;
Rubin, EM ;
Rokhsar, DS ;
Banfield, JF .
NATURE, 2004, 428 (6978) :37-43
[65]   Environmental genome shotgun sequencing of the Sargasso Sea [J].
Venter, JC ;
Remington, K ;
Heidelberg, JF ;
Halpern, AL ;
Rusch, D ;
Eisen, JA ;
Wu, DY ;
Paulsen, I ;
Nelson, KE ;
Nelson, W ;
Fouts, DE ;
Levy, S ;
Knap, AH ;
Lomas, MW ;
Nealson, K ;
White, O ;
Peterson, J ;
Hoffman, J ;
Parsons, R ;
Baden-Tillson, H ;
Pfannkoch, C ;
Rogers, YH ;
Smith, HO .
SCIENCE, 2004, 304 (5667) :66-74
[66]   The sequence of the human genome [J].
Venter, JC ;
Adams, MD ;
Myers, EW ;
Li, PW ;
Mural, RJ ;
Sutton, GG ;
Smith, HO ;
Yandell, M ;
Evans, CA ;
Holt, RA ;
Gocayne, JD ;
Amanatides, P ;
Ballew, RM ;
Huson, DH ;
Wortman, JR ;
Zhang, Q ;
Kodira, CD ;
Zheng, XQH ;
Chen, L ;
Skupski, M ;
Subramanian, G ;
Thomas, PD ;
Zhang, JH ;
Miklos, GLG ;
Nelson, C ;
Broder, S ;
Clark, AG ;
Nadeau, C ;
McKusick, VA ;
Zinder, N ;
Levine, AJ ;
Roberts, RJ ;
Simon, M ;
Slayman, C ;
Hunkapiller, M ;
Bolanos, R ;
Delcher, A ;
Dew, I ;
Fasulo, D ;
Flanigan, M ;
Florea, L ;
Halpern, A ;
Hannenhalli, S ;
Kravitz, S ;
Levy, S ;
Mobarry, C ;
Reinert, K ;
Remington, K ;
Abu-Threideh, J ;
Beasley, E .
SCIENCE, 2001, 291 (5507) :1304-+
[67]   Assembly of polymorphic genomes:: Algorithms and application to Ciona savignyi [J].
Vinson, JP ;
Jaffe, DB ;
O'Neill, K ;
Karlsson, EK ;
Stange-Thomann, N ;
Anderson, S ;
Mesirov, JP ;
Satoh, N ;
Satou, Y ;
Nusbaum, C ;
Birren, B ;
Galagan, JE ;
Lander, ES .
GENOME RESEARCH, 2005, 15 (08) :1127-1135
[68]   The diploid genome sequence of an Asian individual [J].
Wang, Jun ;
Wang, Wei ;
Li, Ruiqiang ;
Li, Yingrui ;
Tian, Geng ;
Goodman, Laurie ;
Fan, Wei ;
Zhang, Junqing ;
Li, Jun ;
Zhang, Juanbin ;
Guo, Yiran ;
Feng, Binxiao ;
Li, Heng ;
Lu, Yao ;
Fang, Xiaodong ;
Liang, Huiqing ;
Du, Zhenglin ;
Li, Dong ;
Zhao, Yiqing ;
Hu, Yujie ;
Yang, Zhenzhen ;
Zheng, Hancheng ;
Hellmann, Ines ;
Inouye, Michael ;
Pool, John ;
Yi, Xin ;
Zhao, Jing ;
Duan, Jinjie ;
Zhou, Yan ;
Qin, Junjie ;
Ma, Lijia ;
Li, Guoqing ;
Yang, Zhentao ;
Zhang, Guojie ;
Yang, Bin ;
Yu, Chang ;
Liang, Fang ;
Li, Wenjie ;
Li, Shaochuan ;
Li, Dawei ;
Ni, Peixiang ;
Ruan, Jue ;
Li, Qibin ;
Zhu, Hongmei ;
Liu, Dongyuan ;
Lu, Zhike ;
Li, Ning ;
Guo, Guangwu ;
Zhang, Jianguo ;
Ye, Jia .
NATURE, 2008, 456 (7218) :60-U1
[69]   The complete genome of an individual by massively parallel DNA sequencing [J].
Wheeler, David A. ;
Srinivasan, Maithreyan ;
Egholm, Michael ;
Shen, Yufeng ;
Chen, Lei ;
McGuire, Amy ;
He, Wen ;
Chen, Yi-Ju ;
Makhijani, Vinod ;
Roth, G. Thomas ;
Gomes, Xavier ;
Tartaro, Karrie ;
Niazi, Faheem ;
Turcotte, Cynthia L. ;
Irzyk, Gerard P. ;
Lupski, James R. ;
Chinault, Craig ;
Song, Xing-zhi ;
Liu, Yue ;
Yuan, Ye ;
Nazareth, Lynne ;
Qin, Xiang ;
Muzny, Donna M. ;
Margulies, Marcel ;
Weinstock, George M. ;
Gibbs, Richard A. ;
Rothberg, Jonathan M. .
NATURE, 2008, 452 (7189) :872-U5
[70]   Globally Distributed Uncultivated Oceanic N2-Fixing Cyanobacteria Lack Oxygenic Photosystem II [J].
Zehr, Jonathan P. ;
Bench, Shellie R. ;
Carter, Brandon J. ;
Hewson, Ian ;
Niazi, Faheem ;
Shi, Tuo ;
Tripp, H. James ;
Affourtit, Jason P. .
SCIENCE, 2008, 322 (5904) :1110-1112