De novo chromosome level assembly of a plant genome from long read sequence data

被引:17
作者
Sharma, Priyanka [1 ]
Masouleh, Ardashir Kharabian [1 ]
Topp, Bruce [1 ]
Furtado, Agnelo [1 ]
Henry, Robert J. [1 ,2 ]
机构
[1] Univ Queensland, Queensland Alliance Agr & Food Innovat, Brisbane, Qld 4072, Australia
[2] Univ Queensland, ARC Ctr Excellence Plant Success Nat & Agr, Brisbane, Qld 4072, Australia
关键词
de novo genome assembly; HiFiasm; HiFi reads; nuclear genome; plastid genome; mitochondrial genome; nuclear ribosomal RNA; technical advance; Macadamia jansenii; MACADAMIA; ANNOTATION;
D O I
10.1111/tpj.15583
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Recent advances in the sequencing and assembly of plant genomes have allowed the generation of genomes with increasing contiguity and sequence accuracy. Chromosome level genome assemblies using sequence contigs generated from long read sequencing have involved the use of proximity analysis (Hi-C) or traditional genetic maps to guide the placement of sequence contigs within chromosomes. The development of highly accurate long reads by repeated sequencing of circularized DNA (HiFi; PacBio) has greatly increased the size of contigs. We now report the use of HiFiasm to assemble the genome of Macadamia jansenii, a genome that has been used as a model to test sequencing and assembly. This achieved almost complete chromosome level assembly from the sequence data alone without the need for higher level chromosome map information. Eight of the 14 chromosomes were represented by a single large contig (six with telomere repeats at both ends) and the other six assembled from two to four main contigs. The small number of chromosome breaks appears to be the result of highly repetitive regions including ribosomal genes that cannot be assembled by these approaches. De novo assembly of near complete chromosome level plant genomes now appears possible using these sequencing and assembly tools. Further targeted strategies might allow these remaining gaps to be closed.
引用
收藏
页码:727 / 736
页数:10
相关论文
共 31 条
[1]   Early growth and graft success in macadamia seedling and cutting rootstocks [J].
Alam, M. M. ;
Wilkie, J. ;
Topp, B. L. .
INTERNATIONAL SYMPOSIA ON TROPICAL AND TEMPERATE HORTICULTURE - ISTTH2016, 2018, 1205 :637-643
[2]   Opportunities and challenges in long-read sequencing data analysis [J].
Amarasinghe, Shanika L. ;
Su, Shian ;
Dong, Xueyi ;
Zappia, Luke ;
Ritchie, Matthew E. ;
Gouil, Quentin .
GENOME BIOLOGY, 2020, 21 (01)
[3]   D-GENIES: dot plot large genomes in an interactive, efficient and simple way [J].
Cabanettes, Floreal ;
Klopp, Christophe .
PEERJ, 2018, 6
[4]   Recent advances in sequence assembly: principles and applications [J].
Chen, Qingfeng ;
Lan, Chaowang ;
Zhao, Liang ;
Wang, Jianxin ;
Chen, Baoshan ;
Chen, Yi-Ping Phoebe .
BRIEFINGS IN FUNCTIONAL GENOMICS, 2017, 16 (06) :361-378
[5]   Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm [J].
Cheng, Haoyu ;
Concepcion, Gregory T. ;
Feng, Xiaowen ;
Zhang, Haowen ;
Li, Heng .
NATURE METHODS, 2021, 18 (02) :170-+
[6]   LeafGo: Leaf to Genome, a quick workflow to produce high-quality de novo plant genomes using long-read sequencing technology [J].
Driguez, Patrick ;
Bougouffa, Salim ;
Carty, Karen ;
Putra, Alexander ;
Jabbari, Kamel ;
Reddy, Muppala ;
Soppe, Richard ;
Cheung, Ming Sin ;
Fukasawa, Yoshinori ;
Ermini, Luca .
GENOME BIOLOGY, 2021, 22 (01)
[7]   De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds [J].
Dudchenko, Olga ;
Batra, Sanjit S. ;
Omer, Arina D. ;
Nyquist, Sarah K. ;
Hoeger, Marie ;
Durand, Neva C. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Lander, Eric S. ;
Aiden, Aviva Presser ;
Aiden, Erez Lieberman .
SCIENCE, 2017, 356 (6333) :92-95
[8]   Using linkage maps to correct and scaffold de novo genome assemblies: methods, challenges, and computational tools [J].
Fierst, Janna L. .
FRONTIERS IN GENETICS, 2015, 6
[9]  
Gross C. L., 1992, Australian Systematic Botany, V5, P725, DOI 10.1071/SB9920725
[10]   QUAST: quality assessment tool for genome assemblies [J].
Gurevich, Alexey ;
Saveliev, Vladislav ;
Vyahhi, Nikolay ;
Tesler, Glenn .
BIOINFORMATICS, 2013, 29 (08) :1072-1075