Genome sequence of the olive tree, Olea europaea

被引:165
作者
Cruz, Fernando [1 ,2 ]
Julca, Irene [2 ,3 ,4 ]
Gomez-Garrido, Jessica [1 ,2 ]
Loska, Damian [2 ,3 ]
Marcet-Houben, Marina [2 ,3 ]
Cano, Emilio [5 ]
Galan, Beatriz [6 ]
Frias, Leonor [1 ,2 ]
Ribeca, Paolo [1 ,2 ]
Derdak, Sophia [1 ,2 ]
Gut, Marta [1 ,2 ]
Sanchez-Fernandez, Manuel [7 ]
Luis Garcia, Jose
Gut, Ivo G. [1 ,2 ]
Vargas, Pablo [5 ,11 ]
Alioto, Tyler S. [1 ,2 ,10 ]
Gabaldon, Toni [2 ,3 ,8 ,9 ]
机构
[1] Barcelona Inst Sci & Technol, Ctr Genom Regulat, CNAG CRG, Baldiri i Reixac 4, Barcelona 08028, Spain
[2] Univ Pompeu Fabra, Barcelona 08003, Spain
[3] Barcelona Inst Sci & Technol, Bioinformat & Genom Dept, Ctr Genom Regulat, Dr Aiguader 88, Barcelona 08003, Spain
[4] Univ Autonoma Barcelona, E-08193 Barcelona, Spain
[5] CSIC, Royal Bot Garden Madrid, Plaza Murillo 2, E-28014 Madrid, Spain
[6] CSIC, Ctr Invest Biol, Dept Biol Ambiental, Madrid 28040, Spain
[7] Paisajismo Area Corporat Inmuebles, Grp Santander, Madrid, Spain
[8] ICREA, Pg Lluis Companys 23, Barcelona 08010, Spain
[9] Ctr Genom Regulat, Doctor Aiguader 88, Barcelona 08003, Spain
[10] Ctr Nacl Anal Genom CNAG CRG, Baldiri Reixac 4, Barcelona 08028, Spain
[11] Pablo Vargas Royal Bot Garden Madrid, Plaza Murillo 2, Madrid 28014, Spain
基金
英国生物技术与生命科学研究理事会;
关键词
Olive tree genome; Genomics; Assembly; Annotation; ALIGNMENT; ANNOTATION; RNA; ACCURATE; PROGRAM; GENES;
D O I
10.1186/s13742-016-0134-5
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: The Mediterranean olive tree (Olea europaea subsp. europaea) was one of the first trees to be domesticated and is currently of major agricultural importance in the Mediterranean region as the source of olive oil. The molecular bases underlying the phenotypic differences among domesticated cultivars, or between domesticated olive trees and their wild relatives, remain poorly understood. Both wild and cultivated olive trees have 46 chromosomes (2n). Findings: A total of 543 Gb of raw DNA sequence from whole genome shotgun sequencing, and a fosmid library containing 155,000 clones from a 1,000+year-old olive tree (cv. Farga) were generated by Illumina sequencing using different combinations of mate-pair and pair-end libraries. Assembly gave a final genome with a scaffold N50 of 443 kb, and a total length of 1.31 Gb, which represents 95 % of the estimated genome length (1.38 Gb). In addition, the associated fungus Aureobasidium pullulans was partially sequenced. Genome annotation, assisted by RNA sequencing from leaf, root, and fruit tissues at various stages, resulted in 56,349 unique protein coding genes, suggesting recent genomic expansion. Genome completeness, as estimated using the CEGMA pipeline, reached 98.79 %. Conclusions: The assembled draft genome of O. europaea will provide a valuable resource for the study of the evolution and domestication processes of this important tree, and allow determination of the genetic bases of key phenotypic traits. Moreover, it will enhance breeding programs and the formation of new varieties.
引用
收藏
页数:12
相关论文
共 44 条
[1]   Metabarcoding Analysis of Fungal Diversity in the Phyllosphere and Carposphere of Olive (Olea europaea) [J].
Abdelfattah, Ahmed ;
Nicosia, Maria Giulia Li Destri ;
Cacciola, Santa Olga ;
Droby, Samir ;
Schena, Leonardo .
PLOS ONE, 2015, 10 (07)
[2]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[3]   The Peculiar Landscape of Repetitive Sequences in the Olive (Olea europaea L.) Genome [J].
Barghini, Elena ;
Natali, Lucia ;
Cossu, Rosa Maria ;
Giordani, Tommaso ;
Pindo, Massimo ;
Cattonaro, Federica ;
Scalabrin, Simone ;
Velasco, Riccardo ;
Morgante, Michele ;
Cavallini, Andrea .
GENOME BIOLOGY AND EVOLUTION, 2014, 6 (04) :776-791
[4]   Comparative Transcriptome Analysis of Two Olive Cultivars in Response to NaCl-Stress [J].
Bazakos, Christos ;
Manioudaki, Maria E. ;
Therios, Ioannis ;
Voyiatzis, Demetrios ;
Kafetzopoulos, Dimitris ;
Awada, Tala ;
Kalaitzis, Panagiotis .
PLOS ONE, 2012, 7 (08)
[5]   Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data [J].
Birol, Inanc ;
Raymond, Anthony ;
Jackman, Shaun D. ;
Pleasance, Stephen ;
Coope, Robin ;
Taylor, Greg A. ;
Saint Yuen, Macaire Man ;
Keeling, Christopher I. ;
Brand, Dana ;
Vandervalk, Benjamin P. ;
Kirk, Heather ;
Pandoh, Pawan ;
Moore, Richard A. ;
Zhao, Yongjun ;
Mungall, Andrew J. ;
Jaquish, Barry ;
Yanchuk, Alvin ;
Ritland, Carol ;
Boyle, Brian ;
Bousquet, Jean ;
Ritland, Kermit ;
MacKay, John ;
Bohlmann, Joerg ;
Jones, Steven J. M. .
BIOINFORMATICS, 2013, 29 (12) :1492-1497
[6]   Amount and organization of the heterochromatin in Olea europaea and related species [J].
Bitonti M.B. ;
Cozza R. ;
Chiappetta A. ;
Contento A. ;
Minelli S. ;
Ceccarelli M. ;
Gelati M.T. ;
Maggini F. ;
Baldoni L. ;
Cionini P.G. .
Heredity, 1999, 83 (2) :188-195
[7]   SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information [J].
Boetzer, Marten ;
Pirovano, Walter .
BMC BIOINFORMATICS, 2014, 15
[8]   Toward almost closed genomes with GapFiller [J].
Boetzer, Marten ;
Pirovano, Walter .
GENOME BIOLOGY, 2012, 13 (06)
[9]   Scaffolding pre-assembled contigs using SSPACE [J].
Boetzer, Marten ;
Henkel, Christiaan V. ;
Jansen, Hans J. ;
Butler, Derek ;
Pirovano, Walter .
BIOINFORMATICS, 2011, 27 (04) :578-579
[10]  
Borodovsky Mark, 2011, Curr Protoc Bioinformatics, VChapter 4, DOI [10.1002/0471250953.bi0405s35, 10.1002/0471250953.bi0406s35]