Whole-genome sequence of the oriental lung fluke Paragonimus westermani

被引:31
作者
Oey, Harald [1 ]
Zakrzewski, Martha [2 ]
Narain, Kanwar [6 ]
Devi, K. Rekha [6 ]
Agatsuma, Takeshi [7 ]
Nawaratna, Sujeevi [2 ,8 ]
Gobert, Geoffrey N. [2 ,4 ]
Jones, Malcolm K. [3 ]
Ragan, Mark A. [5 ]
McManus, Donald P. [2 ]
Krause, Lutz [1 ,2 ]
机构
[1] Univ Queensland, Fac Med, Diamantina Inst, Translat Res Inst, 37 Kent St, Woolloongabba, Qld 4102, Australia
[2] QIMR Berghofer Med Res Inst, Dept Immunol, Mol Parasitol Lab, 300 Herston Rd, Brisbane, Qld 4006, Australia
[3] Univ Queensland, Sch Vet Sci, Gatton, Qld 4343, Australia
[4] Queens Univ Belfast, Sch Biol Sci, 19 Chlorine Gardens, Belfast BT9 5DL, Antrim, North Ireland
[5] Univ Queensland, Inst Mol Biosci, 306 Carmody Rd, St Lucia, Qld 4072, Australia
[6] ICMR Reg Med Res Ctr, Dibrugarh 786010, Assam, India
[7] Kochi Univ, Dept Environm Med, Nankoku, Kochi 7838505, Japan
[8] Griffith Univ, Sch Med, Gold Coast Campus, Southport, Qld 4222, Australia
关键词
Paragonimus westermani; whole-genome sequence; genome assembly; paragonimiasis; food-borne disease; oriental lung fluke; parasitic infection; comparative genomics; neglected tropical disease; flatworm; HIGH-ACCURACY; RNA; PHYLOGENIES; ANNOTATION; INFECTION; ASSEMBLER; ALIGNMENT; TOOL;
D O I
10.1093/gigascience/giy146
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Foodborne infections caused by lung flukes of the genus Paragonimus are a significant and widespread public health problem in tropical areas. Approximately 50 Paragonimus species have been reported to infect animals and humans, but Paragonimus westermani is responsible for the bulk of human disease. Despite their medical and economic importance, no genome sequence for any Paragonimus species is available. Results: We sequenced and assembled the genome of P. westermani, which is among the largest of the known pathogen genomes with an estimated size of 1.1 Gb. A 922.8 Mb genome assembly was generated from Illumina and Pacific Biosciences (PacBio) sequence data, covering 84% of the estimated genome size. The genome has a high proportion (45%) of repeat-derived DNA, particularly of the long interspersed element and long terminal repeat subtypes, and the expansion of these elements may explain some of the large size. We predicted 12,852 protein coding genes, showing a high level of conservation with related trematode species. The majority of proteins (80%) had homologs in the human liver fluke Opisthorchis viverrini, with an average sequence identity of 64.1%. Assembly of the P. westermani mitochondrial genome from long PacBio reads resulted in a single high-quality circularized 20.6 kb contig. The contig harbored a 6.9 kb region of non-coding repetitive DNA comprised of three distinct repeat units. Our results suggest that the region is highly polymorphic in P. westermani, possibly even within single worm isolates. Conclusions: The generated assembly represents the first Paragonimus genome sequence and will facilitate future molecular studies of this important, but neglected, parasite group.
引用
收藏
页数:8
相关论文
共 54 条
[1]  
BaetsDe K, 2015, ADV PARASIT, V90, P93
[2]   MITOS: Improved de novo metazoan mitochondrial genome annotation [J].
Bernt, Matthias ;
Donath, Alexander ;
Juehling, Frank ;
Externbrink, Fabian ;
Florentz, Catherine ;
Fritzsch, Guido ;
Puetz, Joern ;
Middendorf, Martin ;
Stadler, Peter F. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2013, 69 (02) :313-319
[3]   The genome of the blood fluke Schistosoma mansoni [J].
Berriman, Matthew ;
Haas, Brian J. ;
LoVerde, Philip T. ;
Wilson, R. Alan ;
Dillon, Gary P. ;
Cerqueira, Gustavo C. ;
Mashiyama, Susan T. ;
Al-Lazikani, Bissan ;
Andrade, Luiza F. ;
Ashton, Peter D. ;
Aslett, Martin A. ;
Bartholomeu, Daniella C. ;
Blandin, Gaelle ;
Caffrey, Conor R. ;
Coghlan, Avril ;
Coulson, Richard ;
Day, Tim A. ;
Delcher, Art ;
DeMarco, Ricardo ;
Djikeng, Appolinaire ;
Eyre, Tina ;
Gamble, John A. ;
Ghedin, Elodie ;
Gu, Yong ;
Hertz-Fowler, Christiane ;
Hirai, Hirohisha ;
Hirai, Yuriko ;
Houston, Robin ;
Ivens, Alasdair ;
Johnston, David A. ;
Lacerda, Daniela ;
Macedo, Camila D. ;
McVeigh, Paul ;
Ning, Zemin ;
Oliveira, Guilherme ;
Overington, John P. ;
Parkhill, Julian ;
Pertea, Mihaela ;
Pierce, Raymond J. ;
Protasio, Anna V. ;
Quail, Michael A. ;
Rajandream, Marie-Adele ;
Rogers, Jane ;
Sajid, Mohammed ;
Salzberg, Steven L. ;
Stanke, Mario ;
Tivey, Adrian R. ;
White, Owen ;
Williams, David L. ;
Wortman, Jennifer .
NATURE, 2009, 460 (7253) :352-U65
[4]   Paragonimiasis [J].
Blair, David .
DIGENETIC TREMATODES, 2014, 766 :115-152
[5]   Scaffolding pre-assembled contigs using SSPACE [J].
Boetzer, Marten ;
Henkel, Christiaan V. ;
Jansen, Hans J. ;
Butler, Derek ;
Pirovano, Walter .
BIOINFORMATICS, 2011, 27 (04) :578-579
[6]   BEAST 2: A Software Platform for Bayesian Evolutionary Analysis [J].
Bouckaert, Remco ;
Heled, Joseph ;
Kuehnert, Denise ;
Vaughan, Tim ;
Wu, Chieh-Hsi ;
Xie, Dong ;
Suchard, Marc A. ;
Rambaut, Andrew ;
Drummond, Alexei J. .
PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (04)
[7]   trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses [J].
Capella-Gutierrez, Salvador ;
Silla-Martinez, Jose M. ;
Gabaldon, Toni .
BIOINFORMATICS, 2009, 25 (15) :1972-1973
[8]   Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs [J].
Chevreux, B ;
Pfisterer, T ;
Drescher, B ;
Driesel, AJ ;
Müller, WEG ;
Wetter, T ;
Suhai, S .
GENOME RESEARCH, 2004, 14 (06) :1147-1159
[9]   The Fasciola hepatica genome: gene duplication and polymorphism reveals adaptation to the host environment and the capacity for rapid evolution [J].
Cwiklinski, Krystyna ;
Dalton, John Pius ;
Dufresne, Philippe J. ;
La Course, James ;
Williams, Diana J. L. ;
Hodgkinson, Jane ;
Paterson, Steve .
GENOME BIOLOGY, 2015, 16
[10]   STAR: ultrafast universal RNA-seq aligner [J].
Dobin, Alexander ;
Davis, Carrie A. ;
Schlesinger, Felix ;
Drenkow, Jorg ;
Zaleski, Chris ;
Jha, Sonali ;
Batut, Philippe ;
Chaisson, Mark ;
Gingeras, Thomas R. .
BIOINFORMATICS, 2013, 29 (01) :15-21