Two long read-based genome assembly and annotation of polyploidy woody plants, Hibiscus syriacus L. using PacBio and Nanopore platforms

被引:0
作者
Hyunjin Koo
Gir-Won Lee
Seo-Rin Ko
Sangjin Go
Suk-Yoon Kwon
Yong-Min Kim
Ah-Young Shin
机构
[1] Korea Research Institute of Bioscience and Biotechnology (KRIBB),Plant Systems Engineering Research Center
[2] SML Genetree Co. Ltd.,Biosystems and Bioengineering Program
[3] University of Science and Technology,Department of Bioinformatics, KRIBB School of Bioscience
[4] Korea University of Science and Technology (UST),Digital Biotech Innovation Center
[5] Korea Research Institute of Bioscience and Biotechnology (KRIBB),undefined
来源
Scientific Data | / 10卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Improvements in long read DNA sequencing and related techniques facilitated the generation of complex eukaryotic genomes. Despite these advances, the quality of constructed plant reference genomes remains relatively poor due to the large size of genomes, high content of repetitive sequences, and wide variety of ploidy. Here, we developed the de novo sequencing and assembly of high polyploid plant genome, Hibiscus syriacus, a flowering plant species of the Malvaceae family, using the Oxford Nanopore Technologies and Pacific Biosciences Sequel sequencing platforms. We investigated an efficient combination of high-quality and high-molecular-weight DNA isolation procedure and suitable assembler to achieve optimal results using long read sequencing data. We found that abundant ultra-long reads allow for large and complex polyploid plant genome assemblies with great recovery of repetitive sequences and error correction even at relatively low depth Nanopore sequencing data and polishing compared to previous studies. Collectively, our combination provides cost effective methods to improve genome continuity and quality compared to the previously reported reference genome by accessing highly repetitive regions. The application of this combination may enable genetic research and breeding of polyploid crops, thus leading to improvements in crop production.
引用
收藏
相关论文
共 96 条
  • [1] Aury J-M(2022)Long-read and chromosome-scale assembly of the hexaploid wheat genome achieves high resolution for research and breeding GigaScience 11 17-28
  • [2] Faulk C(2023)De novo sequencing, diploid assembly, and annotation of the black carpenter ant, Camponotus pennsylvanicus, and its symbionts by one person for $1000, using nanopore sequencing Nucleic acids research 51 e2115640118-696
  • [3] Kress WJ(2022)Green plant genomes: What we know in an era of rapidly expanding opportunities Proceedings of the National Academy of Sciences 119 e5-33
  • [4] Pucker B(2022)Plant genome sequence assembly in the era of long reads: Progress, challenges and future directions Quantitative Plant Biology 3 688-1578
  • [5] Irisarri I(2014)Reconstructing complex regions of genomes using long-read sequencing technology Genome research 24 26-2348
  • [6] de Vries J(2020)Building near-complete plant genomes Current Opinion in Plant Biology 54 1571-3085
  • [7] Xu B(2021)Representation and participation across 20 years of plant genome sequencing Nature plants 7 2336-1004
  • [8] Huddleston J(2017)De novo assembly of a new Solanum pennellii accession using nanopore sequencing The Plant Cell 29 3079-8
  • [9] Michael TP(2020)The draft nuclear genome assembly of Eucalyptus pauciflora: a pipeline for comparing de novo assemblies Gigascience 9 990-70
  • [10] VanBuren R(2019)De novo genome sequence assemblies of Gossypium raimondii and Gossypium turneri G3: Genes, Genomes, Genetics 9 1-15