Assembly and comparison of two closely related Brassica napus genomes

被引:118
作者
Bayer, Philipp E. [1 ]
Hurgobin, Bhavna [1 ,2 ]
Golicz, Agnieszka A. [3 ]
Chan, Chon-Kit Kenneth [1 ]
Yuan, Yuxuan [1 ]
Lee, HueyTyng [1 ,2 ]
Renton, Michael [1 ,4 ]
Meng, Jinling [5 ]
Li, Ruiyuan [5 ]
Long, Yan [5 ]
Zou, Jun [5 ]
Bancroft, Ian [6 ]
Chalhoub, Boulos [7 ,8 ]
King, Graham J. [5 ,9 ]
Batley, Jacqueline [1 ]
Edwards, David [1 ]
机构
[1] Univ Western Australia, Sch Biol Sci, Crawley, WA, Australia
[2] Univ Queensland, Sch Agr & Food Sci, St Lucia, Qld, Australia
[3] Univ Melbourne, Fac Vet & Agr Sci, Plant Mol Biol & Biotechnol Lab, Melbourne, Vic, Australia
[4] Univ Western Australia, Sch Agr & Environm, Crawley, WA, Australia
[5] Huazhong Agr Univ, Minist Agr PR China, Key Lab Rapeseed Genet Improvement, Natl Key Lab Crop Genet Improvement, Wuhan, Hubei, Peoples R China
[6] Univ York, Dept Biol, York, N Yorkshire, England
[7] UEVE, INRA, OECG, Evry, France
[8] Univ Paris Saclay, Univ Evry Val Essonne, CNRS, Inst Syst & Synthet Biol,Genopole, Evry, France
[9] Southern Cross Univ, Southern Cross Plant Sci, Lismore, NSW, Australia
基金
英国生物技术与生命科学研究理事会; 澳大利亚研究理事会;
关键词
genome assembly; whole genome comparison; genotyping by sequencing; genome assembly improvement; Brassica napus; Tapidor; contigPlacer; HOMEOLOGOUS RECOMBINATION; MAPPING POPULATIONS; SEQUENCING REVEALS; CICER-ARIETINUM; GENE; EVOLUTION; ALIGNMENT; TRANSCRIPTOME; ANNOTATION; REARRANGEMENTS;
D O I
10.1111/pbi.12742
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
As an increasing number of plant genome sequences become available, it is clear that gene content varies between individuals, and the challenge arises to predict the gene content of a species. However, genome comparison is often confounded by variation in assembly and annotation. Differentiating between true gene absence and variation in assembly or annotation is essential for the accurate identification of conserved and variable genes in a species. Here, we present the de novo assembly of the B.napus cultivar Tapidor and comparison with an improved assembly of the Brassicanapus cultivar Darmor-bzh. Both cultivars were annotated using the same method to allow comparison of gene content. We identified genes unique to each cultivar and differentiate these from artefacts due to variation in the assembly and annotation. We demonstrate that using a common annotation pipeline can result in different gene predictions, even for closely related cultivars, and repeat regions which collapse during assembly impact whole genome comparison. After accounting for differences in assembly and annotation, we demonstrate that the genome of Darmor-bzh contains a greater number of genes than the genome of Tapidor. Our results are the first step towards comparison of the true differences between B.napus genomes and highlight the potential sources of error in future production of a B.napus pangenome.
引用
收藏
页码:1602 / 1610
页数:9
相关论文
共 58 条
[21]  
Hayward Alice, 2012, Journal of Plant Biotechnology, V39, P49, DOI 10.5010/JPB.2012.39.1.049
[22]   The Arabidopsis SOMATIC EMBRYOGENESIS RECEPTOR KINASE 1 gene is expressed in developing ovules and embryos and enhances embryogenic competence in culture [J].
Hecht, V ;
Vielle-Calzada, JP ;
Hartog, MV ;
Schmidt, EDL ;
Boutilier, K ;
Grossniklaus, U ;
de Vries, SC .
PLANT PHYSIOLOGY, 2001, 127 (03) :803-816
[23]   Use of mRNA-seq to discriminate contributions to the transcriptome from the constituent genomes of the polyploid crop species Brassica napus [J].
Higgins, Janet ;
Magusin, Andreas ;
Trick, Martin ;
Fraser, Fiona ;
Bancroft, Ian .
BMC GENOMICS, 2012, 13
[24]   Insights into the Maize Pan-Genome and Pan-Transcriptome [J].
Hirsch, Candice N. ;
Foerster, Jillian M. ;
Johnson, James M. ;
Sekhon, Rajandeep S. ;
Muttoni, German ;
Vaillancourt, Brieanne ;
Penagaricano, Francisco ;
Lindquist, Erika ;
Pedraza, Mary Ann ;
Barry, Kerrie ;
de Leon, Natalia ;
Kaeppler, Shawn M. ;
Buell, C. Robin .
PLANT CELL, 2014, 26 (01) :121-135
[25]  
Joshi N., 2011, Sickle: A sliding-window, adaptive, quality-based trimming tool for FastQ files Version 1.33
[26]   A novel hybrid gene prediction method employing protein multiple sequence alignments [J].
Keller, Oliver ;
Kollmar, Martin ;
Stanke, Mario ;
Waack, Stephan .
BIOINFORMATICS, 2011, 27 (06) :757-763
[27]   TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions [J].
Kim, Daehwan ;
Pertea, Geo ;
Trapnell, Cole ;
Pimentel, Harold ;
Kelley, Ryan ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2013, 14 (04)
[28]  
Langmead B, 2012, NAT METHODS, V9, P357, DOI [10.1038/NMETH.1923, 10.1038/nmeth.1923]
[29]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[30]   Fast and accurate short read alignment with Burrows-Wheeler transform [J].
Li, Heng ;
Durbin, Richard .
BIOINFORMATICS, 2009, 25 (14) :1754-1760