Assembly and comparison of two closely related Brassica napus genomes

被引:118
作者
Bayer, Philipp E. [1 ]
Hurgobin, Bhavna [1 ,2 ]
Golicz, Agnieszka A. [3 ]
Chan, Chon-Kit Kenneth [1 ]
Yuan, Yuxuan [1 ]
Lee, HueyTyng [1 ,2 ]
Renton, Michael [1 ,4 ]
Meng, Jinling [5 ]
Li, Ruiyuan [5 ]
Long, Yan [5 ]
Zou, Jun [5 ]
Bancroft, Ian [6 ]
Chalhoub, Boulos [7 ,8 ]
King, Graham J. [5 ,9 ]
Batley, Jacqueline [1 ]
Edwards, David [1 ]
机构
[1] Univ Western Australia, Sch Biol Sci, Crawley, WA, Australia
[2] Univ Queensland, Sch Agr & Food Sci, St Lucia, Qld, Australia
[3] Univ Melbourne, Fac Vet & Agr Sci, Plant Mol Biol & Biotechnol Lab, Melbourne, Vic, Australia
[4] Univ Western Australia, Sch Agr & Environm, Crawley, WA, Australia
[5] Huazhong Agr Univ, Minist Agr PR China, Key Lab Rapeseed Genet Improvement, Natl Key Lab Crop Genet Improvement, Wuhan, Hubei, Peoples R China
[6] Univ York, Dept Biol, York, N Yorkshire, England
[7] UEVE, INRA, OECG, Evry, France
[8] Univ Paris Saclay, Univ Evry Val Essonne, CNRS, Inst Syst & Synthet Biol,Genopole, Evry, France
[9] Southern Cross Univ, Southern Cross Plant Sci, Lismore, NSW, Australia
基金
英国生物技术与生命科学研究理事会; 澳大利亚研究理事会;
关键词
genome assembly; whole genome comparison; genotyping by sequencing; genome assembly improvement; Brassica napus; Tapidor; contigPlacer; HOMEOLOGOUS RECOMBINATION; MAPPING POPULATIONS; SEQUENCING REVEALS; CICER-ARIETINUM; GENE; EVOLUTION; ALIGNMENT; TRANSCRIPTOME; ANNOTATION; REARRANGEMENTS;
D O I
10.1111/pbi.12742
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating between true gene absence and variation in assembly or annotation is essential for the accurate identification of conserved and variable genes in a species. Here, we present the de novo assembly of the B.napus cultivar Tapidor and comparison with an improved assembly of the Brassicanapus cultivar Darmor-bzh. Both cultivars were annotated using the same method to allow comparison of gene content. We identified genes unique to each cultivar and differentiate these from artefacts due to variation in the assembly and annotation. We demonstrate that using a common annotation pipeline can result in different gene predictions, even for closely related cultivars, and repeat regions which collapse during assembly impact whole genome comparison. After accounting for differences in assembly and annotation, we demonstrate that the genome of Darmor-bzh contains a greater number of genes than the genome of Tapidor. Our results are the first step towards comparison of the true differences between B.napus genomes and highlight the potential sources of error in future production of a B.napus pangenome.
引用
收藏
页码:1602 / 1610
页数:9
相关论文
共 58 条
[1]  
Agarwala R, 2015, NUCLEIC ACIDS RES, V43, pD6, DOI [10.1093/nar/gku1130, 10.1093/nar/gkv1290]
[2]  
Alexa A., 2010, topgo: Enrichment analysis for gene ontology. R package version 2.38.1, V2
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]  
[Anonymous], NAT GENET
[5]  
[Anonymous], 2012, ARXIV12034802QBICOGN
[6]  
[Anonymous], PLANT MOL BIOL REP
[7]  
[Anonymous], NAT GENET
[8]  
[Anonymous], 1935, J JPN BOT
[9]  
[Anonymous], VELVETOPTIMISER VERS
[10]   High-resolution skim genotyping by sequencing reveals the distribution of crossovers and gene conversions in Cicer arietinum and Brassica napus [J].
Bayer, Philipp E. ;
Ruperao, Pradeep ;
Mason, Annaliese S. ;
Stiller, Jiri ;
Chan, Chon-Kit Kenneth ;
Hayashi, Satomi ;
Long, Yan ;
Meng, Jinling ;
Sutton, Tim ;
Visendi, Paul ;
Varshney, Rajeev K. ;
Batley, Jacqueline ;
Edwards, David .
THEORETICAL AND APPLIED GENETICS, 2015, 128 (06) :1039-1047