Whole-genome sequencing and variant discovery in C-elegans

被引:278
作者
Hillier, LaDeana W. [1 ]
Marth, Gabor T. [2 ]
Quinlan, Aaron R. [2 ]
Dooling, David [1 ]
Fewell, Ginger [1 ]
Barnett, Derek [2 ]
Fox, Paul [1 ]
Glasscock, Jarret I. [1 ]
Hickenbotham, Matthew [1 ]
Huang, Weichun [2 ]
Magrini, Vincent J. [1 ]
Richt, Ryan J. [1 ]
Sander, Sacha N. [1 ]
Stewart, Donald A. [2 ]
Stromberg, Michael [2 ]
Tsung, Eric F. [2 ]
Wylie, Todd [1 ]
Schedl, Tim [1 ]
Wilson, Richard K. [1 ]
Mardis, Elaine R. [1 ]
机构
[1] Washington Univ, Sch Med, Dept Genet, St Louis, MO 63108 USA
[2] Boston Coll, Dept Biol, Chestnut Hill, MA 02467 USA
关键词
D O I
10.1038/NMETH.1179
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 17 条
[1]   Automating resequencing-based detection of insertion-deletion polymorphisms [J].
Bhangale, Tushar R. ;
Stephens, Matthew ;
Nickerson, Deborah A. .
NATURE GENETICS, 2006, 38 (12) :1457-1462
[2]   WormBase:: new content and better access [J].
Bieri, Tamberlyn ;
Blasiar, Darin ;
Ozersky, Philip ;
Antoshechkin, Igor ;
Bastiani, Carol ;
Canaran, Payan ;
Chan, Juancarlos ;
Chen, Nansheng ;
Chen, Wen J. ;
Davis, Paul ;
Fiedler, Tristan J. ;
Girard, Lisa ;
Han, Michael ;
Harris, Todd W. ;
Kishore, Ranjana ;
Lee, Raymond ;
McKay, Sheldon ;
Muller, Hans-Michael ;
Nakamura, Cecilia ;
Petcherski, Andrei ;
Rangarajan, Arun ;
Rogers, Anthony ;
Schindelman, Gary ;
Schwarz, Erich M. ;
Spooner, Will ;
Tuli, Mary Ann ;
Van Auken, Kimberly ;
Wang, Daniel ;
Wang, Xiaodong ;
Williams, Gary ;
Durbin, Richard ;
Stein, Lincoln D. ;
Sternberg, Paul W. ;
Spieth, John .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D506-D510
[3]   Genome sequence of the nematode C-elegans:: A platform for investigating biology [J].
不详 .
SCIENCE, 1998, 282 (5396) :2012-2018
[4]   Finishing the euchromatic sequence of the human genome [J].
Collins, FS ;
Lander, ES ;
Rogers, J ;
Waterston, RH .
NATURE, 2004, 431 (7011) :931-945
[5]   Phylogenetics in Caenorhabditis elegans:: An analysis of divergence and outcrossing [J].
Denver, DR ;
Morris, K ;
Thomas, WK .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (03) :393-400
[6]   Consed: A graphical tool for sequence finishing [J].
Gordon, D ;
Abajian, C ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :195-202
[7]   WormBase: a multi-species resource for nematode biology and genomics [J].
Harris, TW ;
Chen, NS ;
Cunningham, F ;
Tello-Ruiz, M ;
Antoshechkin, I ;
Bastiani, C ;
Bieri, T ;
Blasiar, D ;
Bradnam, K ;
Chan, J ;
Chen, CK ;
Chen, WJ ;
Davis, P ;
Kenny, E ;
Kishore, R ;
Lawson, D ;
Lee, R ;
Muller, HM ;
Nakamura, C ;
Ozersky, P ;
Petcherski, A ;
Rogers, A ;
Sabo, A ;
Schwarz, EM ;
Van Auken, K ;
Wang, QH ;
Durbin, R ;
Spieth, J ;
Sternberg, PW ;
Stein, LD .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D411-D417
[8]  
Hodgkin J, 1997, GENETICS, V146, P149
[9]   Single nucleotide polymorphisms in wild isolates of Caenorhabditis elegans [J].
Koch, R ;
van Luenen, HGAM ;
van der Horst, M ;
Thijssen, KL ;
Plasterk, RHA .
GENOME RESEARCH, 2000, 10 (11) :1690-1696
[10]   Initial sequencing and analysis of the human genome [J].
Lander, ES ;
Int Human Genome Sequencing Consortium ;
Linton, LM ;
Birren, B ;
Nusbaum, C ;
Zody, MC ;
Baldwin, J ;
Devon, K ;
Dewar, K ;
Doyle, M ;
FitzHugh, W ;
Funke, R ;
Gage, D ;
Harris, K ;
Heaford, A ;
Howland, J ;
Kann, L ;
Lehoczky, J ;
LeVine, R ;
McEwan, P ;
McKernan, K ;
Meldrim, J ;
Mesirov, JP ;
Miranda, C ;
Morris, W ;
Naylor, J ;
Raymond, C ;
Rosetti, M ;
Santos, R ;
Sheridan, A ;
Sougnez, C ;
Stange-Thomann, N ;
Stojanovic, N ;
Subramanian, A ;
Wyman, D ;
Rogers, J ;
Sulston, J ;
Ainscough, R ;
Beck, S ;
Bentley, D ;
Burton, J ;
Clee, C ;
Carter, N ;
Coulson, A ;
Deadman, R ;
Deloukas, P ;
Dunham, A ;
Dunham, I ;
Durbin, R ;
French, L .
NATURE, 2001, 409 (6822) :860-921