Decoding the oak genome: public release of sequence data, assembly, annotation and publication strategies

被引:85
作者
Plomion, Christophe [1 ,2 ]
Aury, Jean-Marc [3 ]
Amselem, Joelle [4 ]
Alaeitabar, Tina [4 ]
Barbe, Valerie [3 ]
Belser, Caroline [3 ]
Berges, Helene [5 ]
Bodenes, Catherine [1 ,2 ]
Boudet, Nathalie [6 ]
Boury, Christophe [1 ,2 ]
Canaguier, Aurelie [6 ]
Couloux, Arnaud [3 ]
Da Silva, Corinne [3 ]
Duplessis, Sebastien [7 ]
Ehrenmann, Francois [1 ,2 ]
Estrada-Mairey, Barbara [3 ]
Fouteau, Stephanie [3 ]
Francillonne, Nicolas
Gaspin, Christine [8 ]
Guichard, Cecile [6 ]
Klopp, Christophe [8 ]
Labadie, Karine [3 ]
Lalanne, Celine [1 ,2 ]
Le Clainche, Isabelle [6 ]
Leple, Jean-Charles [9 ]
Le Provost, Gregoire [1 ,2 ]
Leroy, Thibault [1 ,2 ]
Lesur, Isabelle [1 ,2 ]
Martin, Francis [7 ]
Mercier, Jonathan [3 ]
Michotey, Celia
Murat, Florent [10 ]
Salin, Franck [1 ,2 ]
Steinbach, Delphine [4 ]
Faivre-Rampant, Patricia [6 ]
Wincker, Patrick [3 ,11 ,12 ]
Salse, Jerome [10 ]
Quesneville, Hadi
Kremer, Antoine [1 ,2 ]
机构
[1] INRA, UMR1202, BIOGECO, F-33610 Cestas, France
[2] Univ Bordeaux, BIOGECO, UMR1202, F-33170 Talence, France
[3] CEA, IG, F-91057 Evry, France
[4] INRA, URGI, F-78026 Versailles, France
[5] INRA, CNRGV, F-31326 Castanet Tolosan, France
[6] INRA, URGV, Plant Genom Res, F-91057 Evry, France
[7] Univ Lorraine, INRA, UMR1136,Interact Arbres Microorganismes, Lab Excellence ARBRE, F-54280 Champenoux, France
[8] INRA, UBIA, Plateforme Bioinformat Toulouse Midi Pyrenees, F-31326 Castanet Tolosan, France
[9] INRA, Ameliorat Genet & Physiol Forestieres UR0588, Orleans, France
[10] INRA, UBP UMR 1095, Lab Genet Diversite & Ecophysiol Cereales, F-63039 Clermont Ferrand, France
[11] Univ Evry Val dEssone, UMR 8030, CP5706, Evry, France
[12] CNRS, UMR 8030, Evry, France
基金
欧洲研究理事会;
关键词
genome sequence; genomic resources; Quercus robur; QUANTITATIVE TRAIT LOCI; QUERCUS-ROBUR; PEDUNCULATE OAK; BUD BURST; GENE-EXPRESSION; EUROPEAN OAK; SESSILE OAK; DIFFERENTIATION; POPULATIONS; ADAPTATION;
D O I
10.1111/1755-0998.12425
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The 1.5 Gbp/2C genome of pedunculate oak (Quercus robur) has been sequenced. A strategy was established for dealing with the challenges imposed by the sequencing of such a large, complex and highly heterozygous genome by a whole-genome shotgun (WGS) approach, without the use of costly and time-consuming methods, such as fosmid or BAC clone-based hierarchical sequencing methods. The sequencing strategy combined short and long reads. Over 49 million reads provided by Roche 454 GS-FLX technology were assembled into contigs and combined with shorter Illumina sequence reads from paired-end and mate-pair libraries of different insert sizes, to build scaffolds. Errors were corrected and gaps filled with Illumina paired-end reads and contaminants detected, resulting in a total of 17 910 scaffolds (> 2 kb) corresponding to 1.34 Gb. Fifty per cent of the assembly was accounted for by 1468 scaffolds (N50 of 260 kb). Initial comparison with the phylogenetically related Prunus persica gene model indicated that genes for 84.6% of the proteins present in peach (mean protein coverage of 90.5%) were present in our assembly. The second and third steps in this project are genome annotation and the assignment of scaffolds to the oak genetic linkage map. In accordance with the Bermuda and Fort Lauderdale agreements and the more recent Toronto Statement, the oak genome data have been released into public sequence repositories in advance of publication. In this presubmission paper, the oak genome consortium describes its principal lines of work and future directions for analyses of the nature, function and evolution of the oak genome.
引用
收藏
页码:254 / 265
页数:12
相关论文
共 78 条
[1]   Adaptive responses for seed and leaf phenology in natural populations of sessile oak along an altitudinal gradient [J].
Alberto, F. ;
Bouffier, L. ;
Louvet, J. -M. ;
Lamy, J. -B. ;
Delzon, S. ;
Kremer, A. .
JOURNAL OF EVOLUTIONARY BIOLOGY, 2011, 24 (07) :1442-1454
[2]   Potential for evolutionary responses to climate change evidence from tree populations [J].
Alberto, Florian J. ;
Aitken, Sally N. ;
Alia, Ricardo ;
Gonzalez-Martinez, Santiago C. ;
Hanninen, Heikki ;
Kremer, Antoine ;
Lefevre, Francois ;
Lenormand, Thomas ;
Yeaman, Sam ;
Whetten, Ross ;
Savolainen, Outi .
GLOBAL CHANGE BIOLOGY, 2013, 19 (06) :1645-1661
[3]   Limitations of next-generation genome sequence assembly [J].
Alkan, Can ;
Sajjadian, Saba ;
Eichler, Evan E. .
NATURE METHODS, 2011, 8 (01) :61-65
[4]  
[Anonymous], 1987, Science, Philosophy, and Human Behavior in the Soviet Union
[5]   Pig genome sequence - analysis and publication strategy [J].
Archibald, Alan L. ;
Bolund, Lars ;
Churcher, Carol ;
Fredholm, Merete ;
Groenen, Martien A. M. ;
Harlizius, Barbara ;
Lee, Kyung-Tai ;
Milan, Denis ;
Rogers, Jane ;
Rothschild, Max F. ;
Uenishi, Hirohide ;
Wang, Jun ;
Schook, Lawrence B. .
BMC GENOMICS, 2010, 11
[6]   High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies [J].
Aury, Jean-Marc ;
Cruaud, Corinne ;
Barbe, Valerie ;
Rogier, Odile ;
Mangenot, Sophie ;
Samson, Gaelle ;
Poulain, Julie ;
Anthouard, Veronique ;
Scarpelli, Claude ;
Artiguenave, Francois ;
Wincker, Patrick .
BMC GENOMICS, 2008, 9 (1)
[7]   Comparison of the transcriptomes of American chestnut (Castanea dentata) and Chinese chestnut (Castanea mollissima) in response to the chestnut blight infection [J].
Barakat, Abdelali ;
DiLoreto, Denis S. ;
Zhang, Yi ;
Smith, Chris ;
Baier, Kathleen ;
Powell, William A. ;
Wheeler, Nicholas ;
Sederoff, Ron ;
Carlson, John E. .
BMC PLANT BIOLOGY, 2009, 9
[8]   Comparative mapping between Quercus and Castanea using simple-sequence repeats (SSRs) [J].
Barreneche, T ;
Casasoli, M ;
Russell, K ;
Akkak, A ;
Meddour, H ;
Plomion, C ;
Villani, F ;
Kremer, A .
THEORETICAL AND APPLIED GENETICS, 2004, 108 (03) :558-566
[9]   Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data [J].
Birol, Inanc ;
Raymond, Anthony ;
Jackman, Shaun D. ;
Pleasance, Stephen ;
Coope, Robin ;
Taylor, Greg A. ;
Saint Yuen, Macaire Man ;
Keeling, Christopher I. ;
Brand, Dana ;
Vandervalk, Benjamin P. ;
Kirk, Heather ;
Pandoh, Pawan ;
Moore, Richard A. ;
Zhao, Yongjun ;
Mungall, Andrew J. ;
Jaquish, Barry ;
Yanchuk, Alvin ;
Ritland, Carol ;
Boyle, Brian ;
Bousquet, Jean ;
Ritland, Kermit ;
MacKay, John ;
Bohlmann, Joerg ;
Jones, Steven J. M. .
BIOINFORMATICS, 2013, 29 (12) :1492-1497
[10]   Comparative mapping in the Fagaceae and beyond with EST-SSRs [J].
Bodenes, Catherine ;
Chancerel, Emilie ;
Gailing, Oliver ;
Vendramin, Giovanni G. ;
Bagnoli, Francesca ;
Durand, Jerome ;
Goicoechea, Pablo G. ;
Soliani, Carolina ;
Villani, Fiorella ;
Mattioni, Claudia ;
Koelewijn, Hans Peter ;
Murat, Florent ;
Salse, Jerome ;
Roussel, Guy ;
Boury, Christophe ;
Alberto, Florian ;
Kremer, Antoine ;
Plomion, Christophe .
BMC PLANT BIOLOGY, 2012, 12