A high-quality de novo genome assembly based on nanopore sequencing of a wild-caught coconut rhinoceros beetle (Oryctes rhinoceros)

被引:10
作者
Filipovic, Igor [1 ,2 ]
Rasic, Gordana [2 ]
Hereward, James [1 ]
Gharuka, Maria [3 ]
Devine, Gregor J. [2 ]
Furlong, Michael J. [1 ]
Etebari, Kayvan [1 ]
机构
[1] Univ Queensland, Sch Biol Sci, St Lucia, Qld, Australia
[2] QIMR Berghofer Med Res Inst, Mosquito Control Lab, Brisbane, Qld, Australia
[3] Minist Agr & Livestock, Div Res, Honiara, Solomon Islands
关键词
Genome assembly; Genome annotation; Single insect nanopore sequencing; Oryctes rhinoceros; Coleoptera; ANNOTATION; GENERATION; ALIGNMENT; MOSQUITO; GENES; BUSCO;
D O I
10.1186/s12864-022-08628-z
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background An optimal starting point for relating genome function to organismal biology is a high-quality nuclear genome assembly, and long-read sequencing is revolutionizing the production of this genomic resource in insects. Despite this, nuclear genome assemblies have been under-represented for agricultural insect pests, particularly from the order Coleoptera. Here we present a de novo genome assembly and structural annotation for the coconut rhinoceros beetle, Oryctes rhinoceros (Coleoptera: Scarabaeidae), based on Oxford Nanopore Technologies (ONT) long-read data generated from a wild-caught female, as well as the assembly process that also led to the recovery of the complete circular genome assemblies of the beetle's mitochondrial genome and that of the biocontrol agent, Oryctes rhinoceros nudivirus (OrNV). As an invasive pest of palm trees, O. rhinoceros is undergoing an expansion in its range across the Pacific Islands, requiring new approaches to management that may include strategies facilitated by genome assembly and annotation. Results High-quality DNA isolated from an adult female was used to create four ONT libraries that were sequenced using four MinION flow cells, producing a total of 27.2 Gb of high-quality long-read sequences. We employed an iterative assembly process and polishing with one lane of high-accuracy Illumina reads, obtaining a final size of the assembly of 377.36 Mb that had high contiguity (fragment N50 length = 12 Mb) and accuracy, as evidenced by the exceptionally high completeness of the benchmarked set of conserved single-copy orthologous genes (BUSCO completeness = 99.1%). These quality metrics place our assembly ahead of the published Coleopteran genomes, including that of an insect model, the red flour beetle (Tribolium castaneum). The structural annotation of the nuclear genome assembly contained a highly-accurate set of 16,371 protein-coding genes, with only 2.8% missing BUSCOs, and the expected number of non-coding RNAs. The number and structure of paralogous genes in a gene family like Sigma GST is lower than in another scarab beetle (Onthophagus taurus), but higher than in the red flour beetle (Tribolium castaneum), which suggests expansion of this GST class in Scarabaeidae. The quality of our gene models was also confirmed with the correct placement of O. rhinoceros among other members of the rhinoceros beetles (subfamily Dynastinae) in a phylogeny based on the sequences of 95 protein-coding genes in 373 beetle species from all major lineages of Coleoptera. Finally, we provide a list of 30 candidate dsRNA targets whose orthologs have been experimentally validated as highly effective targets for RNAi-based control of several beetles. Conclusions The genomic resources produced in this study form a foundation for further functional genetic research and management programs that may inform the control and surveillance of O. rhinoceros populations, and we demonstrate the efficacy of de novo genome assembly using long-read ONT data from a single field-caught insect.
引用
收藏
页数:15
相关论文
共 73 条
  • [1] One fly-one genome: chromosome-scale genome assembly of a single outbred Drosophila melanogaster
    Adams, Matthew
    McBroome, Jakob
    Maurer, Nicholas
    Pepper-Tunick, Evan
    Saremi, Nedda
    Green, Richard E.
    Vollmers, Christopher
    Corbett-Detig, Russell B.
    [J]. NUCLEIC ACIDS RESEARCH, 2020, 48 (13)
  • [2] Dataset of de novo assembly and functional annotation of the transcriptome of certain developmental stages of coconut rhinoceros beetle, Oryctes rhinoceros L.
    Arvind, Kumar
    Rajesh, M. K.
    Josephrajkumar, A.
    Grace, Tony
    [J]. DATA IN BRIEF, 2020, 28
  • [3] Regional sequence expansion or collapse in heterozygous genome assemblies
    Asalone, Kathryn C.
    Ryan, Kara M.
    Yamadi, Maryam
    Cohen, Annastelle L.
    Farmer, William G.
    George, Deborah J.
    Joppert, Claudia
    Kim, Kaitlyn
    Mughal, Madeeha Froze
    Said, Rana
    Toksoz-Exley, Metin
    Bisk, Evgeny
    Bracht, John R.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (07)
  • [4] BamTools: a C++ API and toolkit for analyzing and managing BAM files
    Barnett, Derek W.
    Garrison, Erik K.
    Quinlan, Aaron R.
    Stroemberg, Michael P.
    Marth, Gabor T.
    [J]. BIOINFORMATICS, 2011, 27 (12) : 1691 - 1692
  • [5] Bedford GO, 2018, ADV ENTOMOL, V06, P27, DOI DOI 10.4236/AE.2018.61004
  • [6] Trimmomatic: a flexible trimmer for Illumina sequence data
    Bolger, Anthony M.
    Lohse, Marc
    Usadel, Bjoern
    [J]. BIOINFORMATICS, 2014, 30 (15) : 2114 - 2120
  • [7] Fast and sensitive protein alignment using DIAMOND
    Buchfink, Benjamin
    Xie, Chao
    Huson, Daniel H.
    [J]. NATURE METHODS, 2015, 12 (01) : 59 - 60
  • [8] Synthetically engineered Medea gene drive system in the worldwide crop pest Drosophila suzukii
    Buchman, Anna
    Marshall, John M.
    Ostrovski, Dennis
    Yang, Ting
    Akbari, Omar S.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2018, 115 (18) : 4725 - 4730
  • [9] How to sequence and annotate insect mitochondrial genomes for systematic and comparative genomics research
    Cameron, Stephen L.
    [J]. SYSTEMATIC ENTOMOLOGY, 2014, 39 (03) : 400 - 411
  • [10] Next-generation gene drive for population modification of the malaria vector mosquito, Anopheles gambiae
    Carballar-Lejarazu, Rebeca
    Ogaugwu, Christian
    Tushar, Taylor
    Kelsey, Adam
    Thai Binh Pham
    Murphy, Jazmin
    Schmidt, Hanno
    Lee, Yoosook
    Lanzaro, Gregory C.
    James, Anthony A.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (37) : 22805 - 22814