High-quality haplotype-resolved genome assembly of cultivated octoploid strawberry

被引:38
作者
Mao, Jianxin [1 ]
Wang, Yan [1 ]
Wang, Baotian [1 ]
Li, Jiqi [1 ]
Zhang, Chao [1 ]
Zhang, Wenshuo [3 ]
Li, Xue [1 ]
Li, Jie [1 ]
Zhang, Junxiang [1 ,2 ]
Li, He [1 ,2 ]
Zhang, Zhihong [1 ,2 ]
机构
[1] Shenyang Agr Univ, Coll Hort, Liaoning Key Lab Strawberry Breeding & Cultivat, 120 Dongling Rd, Shenyang 110866, Peoples R China
[2] Shenyang Agr Univ, Lab Protected Hort, Minist Educ, Shenyang 110866, Peoples R China
[3] ShanghaiTech Univ, Sch Informat Sci & Technol, 393 Middle Huaxia Rd, Shanghai 201210, Peoples R China
基金
中国国家自然科学基金;
关键词
COMMON WHEAT; CHROMOSOME; IDENTIFICATION; GENES; REARRANGEMENTS; PREDICTION; ANCESTRY; ACCURATE; PROVIDES; PROGRAM;
D O I
10.1093/hr/uhad002
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Cultivated strawberry (Fragaria x ananassa), a perennial herb belonging to the family Rosaceae, is a complex octoploid with high heterozygosity at most loci. However, there is no research on the haplotype of the octoploid strawberry genome. Here we aimed to obtain a high-quality genome of the cultivated strawberry cultivar, "Yanli", using single molecule real-time sequencing and high-throughput chromosome conformation capture technology. The "Yanli" genome was 823 Mb in size, with a long terminal repeat assembly index of 14.99. The genome was phased into two haplotypes, Hap1 (825 Mb with contig N50 of 26.70 Mb) and Hap2 (808 Mb with contig N50 of 27.51 Mb). Using the combination of Hap1 and Hap2, we obtained for the first time a haplotype-resolved genome with 56 chromosomes for the cultivated octoploid strawberry. We identified a similar to 10 Mb inversion and translocation on chromosome 2-1. 104 957 and 102 356 protein-coding genes were annotated in Hap1 and Hap2, respectively. Analysis of the genes related to the anthocyanin biosynthesis pathway revealed the structural diversity and complexity in the expression of the alleles in the octoploid F. x ananassa genome. In summary, we obtained a high-quality haplotype-resolved genome assembly of F. x ananassa, which will provide the foundation for investigating gene function and evolution of the genome of cultivated octoploid strawberry.
引用
收藏
页数:12
相关论文
共 90 条
[51]   Minimap2: pairwise alignment for nucleotide sequences [J].
Li, Heng .
BIOINFORMATICS, 2018, 34 (18) :3094-3100
[52]  
Li HL, 2021, HORTIC RES-ENGLAND, V8, DOI 10.1038/s41438-021-00627-7
[53]   Revisiting the origin of octoploid strawberry [J].
Liston, Aaron ;
Wei, Na ;
Tennessen, Jacob A. ;
Li, Junmin ;
Dong, Ming ;
Ashman, Tia-Lynn .
NATURE GENETICS, 2020, 52 (01) :2-+
[54]   tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence [J].
Lowe, TM ;
Eddy, SR .
NUCLEIC ACIDS RESEARCH, 1997, 25 (05) :955-964
[55]   TigrScan and GlimmerHMM:: two open source ab initio eukaryotic gene-finders [J].
Majoros, WH ;
Pertea, M ;
Salzberg, SL .
BIOINFORMATICS, 2004, 20 (16) :2878-2879
[56]   Genomics of Evolutionary Novelty in Hybrids and Polyploids [J].
Nieto Feliner, Gonzalo ;
Casacuberta, Josep ;
Wendel, Jonathan F. .
FRONTIERS IN GENETICS, 2020, 11
[57]   Compact and evenly distributed k-mer binning for genomic sequences [J].
Nystrom-Persson, Johan ;
Keeble-Gagnere, Gabriel ;
Zawad, Niamat .
BIOINFORMATICS, 2021, 37 (17) :2563-2569
[58]   LTR_retriever: A Highly Accurate and Sensitive Program for Identification of Long Terminal Repeat Retrotransposons [J].
Ou, Shujun ;
Jiang, Ning .
PLANT PHYSIOLOGY, 2018, 176 (02) :1410-1422
[59]   CEGMA: a pipeline to accurately annotate core genes in eukaryotic genornes [J].
Parra, Genis ;
Bradnam, Keith ;
Korf, Ian .
BIOINFORMATICS, 2007, 23 (09) :1061-1067
[60]   StringTie enables improved reconstruction of a transcriptome from RNA-seq reads [J].
Pertea, Mihaela ;
Pertea, Geo M. ;
Antonescu, Corina M. ;
Chang, Tsung-Cheng ;
Mendell, Joshua T. ;
Salzberg, Steven L. .
NATURE BIOTECHNOLOGY, 2015, 33 (03) :290-+