Resequencing and assembly of seven complex loci to improve the Leishmania major (Friedlin strain) reference genome

被引:12
作者
Alonso, Graciela [1 ]
Rastrojo, Alberto [1 ]
Lopez-Perez, Sara [1 ]
Requena, Jose M. [1 ]
Aguado, Begona [1 ]
机构
[1] Univ Autonoma Madrid, Ctr Biol Mol Severo Ochoa CSIC UAM, C Nicolas Cabrera 1, E-28049 Madrid, Spain
关键词
GENE-EXPRESSION; SEQUENCE; PROMASTIGOTE; ORGANIZATION; SUBFAMILIES; ALIGNMENT; BLAST;
D O I
10.1186/s13071-016-1329-4
中图分类号
R38 [医学寄生虫学]; Q [生物科学];
学科分类号
07 ; 0710 ; 09 ; 100103 ;
摘要
Background: Leishmania parasites cause severe human diseases known as leishmaniasis. These eukaryotic microorganisms possess an atypical chromosomal architecture and the regulation of gene expression occurs almost exclusively at post-transcriptional levels. Accordingly, sequencing of the genome of Leishmania major, and subsequently the genome of other related species, was paramount for highlighting these peculiar molecular aspects. Recently, we carried out an analysis of gene expression by massive sequencing of RNA in the L. major promastigote, and data derived from that analysis were suggestive of possible errors in the current genome assembly for this Leishmania species. Results: During the analysis by RNA-Seq of the transcriptome for L. major Friedlin strain, 163,714 reads could not be aligned with the reference genome. Thus, de novo assembly with these reads was carried out and the resulting contigs were further analyzed. After detailed homology searches using available databases, it was postulated that 15 contigs might correspond to genomic sequences lost during the initial genome assembly of the L. major Friedlin strain. This was experimentally confirmed by PCR amplification, cloning and sequencing of the new genomic regions. As a result, we have identified seven regions of the L. major (Friedlin) genome that were lost during the sequence assembly. This led to the uncovering of six new genes (LmjF.15.1475, LmjF.15.0285, LmjF.24.0765, LmjF.14.0860, LmjF.19.0305, and LmjF.27.2035), and correction of the annotation for two others (LmjF.15.1480 and LmjF.27.2030). Our data suggest that these genomic regions probably collapsed during the genome assembly due to the existence of gene duplications and/or repeated regions surrounding the missed genes. Conclusion: RNA-seq data helped to reconstruct some genomic regions misassembled during the L. major Friedlin genome assembly, which is otherwise quite robust. On the other hand, this study shows that data derived from massive sequencing approaches, including RNA-Seq, should be carefully inspected to improve current genome definition and gene annotations.
引用
收藏
页数:13
相关论文
共 26 条
[1]   Leishmaniasis Worldwide and Global Estimates of Its Incidence [J].
Alvar, Jorge ;
Velez, Ivan D. ;
Bern, Caryn ;
Herrero, Merce ;
Desjeux, Philippe ;
Cano, Jorge ;
Jannin, Jean ;
den Boer, Margriet .
PLOS ONE, 2012, 7 (05)
[2]   TriTrypDB: a functional genomic resource for the Trypanosomatidae [J].
Aslett, Martin ;
Aurrecoechea, Cristina ;
Berriman, Matthew ;
Brestelli, John ;
Brunk, Brian P. ;
Carrington, Mark ;
Depledge, Daniel P. ;
Fischer, Steve ;
Gajria, Bindu ;
Gao, Xin ;
Gardner, Malcolm J. ;
Gingle, Alan ;
Grant, Greg ;
Harb, Omar S. ;
Heiges, Mark ;
Hertz-Fowler, Christiane ;
Houston, Robin ;
Innamorato, Frank ;
Iodice, John ;
Kissinger, Jessica C. ;
Kraemer, Eileen ;
Li, Wei ;
Logan, Flora J. ;
Miller, John A. ;
Mitra, Siddhartha ;
Myler, Peter J. ;
Nayak, Vishal ;
Pennington, Cary ;
Phan, Isabelle ;
Pinney, Deborah F. ;
Ramasamy, Gowthaman ;
Rogers, Matthew B. ;
Roos, David S. ;
Ross, Chris ;
Sivam, Dhileep ;
Smith, Deborah F. ;
Srinivasamoorthy, Ganesh ;
Stoeckert, Christian J., Jr. ;
Subramanian, Sandhya ;
Thibodeau, Ryan ;
Tivey, Adrian ;
Treatman, Charles ;
Velarde, Giles ;
Wang, Haiming .
NUCLEIC ACIDS RESEARCH, 2010, 38 :D457-D462
[3]   Members of a large retroposon family are determinants of post-transcriptional gene expression in Leishmania [J].
Bringaud, Frederic ;
Mueller, Michaela ;
Cerqueira, Gustavo Coutinho ;
Smith, Martin ;
Rochette, Annie ;
El-Sayed, Najib M. A. ;
Papadopoulou, Barbara ;
Ghedin, Elodie .
PLOS PATHOGENS, 2007, 3 (09) :1291-1307
[4]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10
[5]   Global gene expression in Leishmania [J].
Cohen-Freue, Gabriela ;
Holzer, Timothy R. ;
Forney, James D. ;
McMaster, W. Robert .
INTERNATIONAL JOURNAL FOR PARASITOLOGY, 2007, 37 (10) :1077-1086
[6]   MULTIPLE SEQUENCE ALIGNMENT WITH HIERARCHICAL-CLUSTERING [J].
CORPET, F .
NUCLEIC ACIDS RESEARCH, 1988, 16 (22) :10881-10890
[7]   The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas disease [J].
El-Sayed, NM ;
Myler, PJ ;
Bartholomeu, DC ;
Nilsson, D ;
Aggarwal, G ;
Tran, AN ;
Ghedin, E ;
Worthey, EA ;
Delcher, AL ;
Blandin, G ;
Westenberger, SJ ;
Caler, E ;
Cerqueira, GC ;
Branche, C ;
Haas, B ;
Anupama, A ;
Arner, E ;
Åslund, L ;
Attipoe, P ;
Bontempi, E ;
Bringaud, F ;
Burton, P ;
Cadag, E ;
Campbell, DA ;
Carrington, M ;
Crabtree, J ;
Darban, H ;
da Silveira, JF ;
de Jong, P ;
Edwards, K ;
Englund, PT ;
Fazelina, G ;
Feldblyum, T ;
Ferella, M ;
Frasch, AC ;
Gull, K ;
Horn, D ;
Hou, LH ;
Huang, YT ;
Kindlund, E ;
Ktingbeil, M ;
Kluge, S ;
Koo, H ;
Lacerda, D ;
Levin, MJ ;
Lorenzi, H ;
Louie, T ;
Machado, CR ;
McCulloch, R ;
McKenna, A .
SCIENCE, 2005, 309 (5733) :409-415
[8]   Uridine insertion/deletion RNA editing in trypanosome mitochondria -: a review [J].
Estévez, AM ;
Simpson, L .
GENE, 1999, 240 (02) :247-260
[9]   Genomic organization and expression of the HSP70 locus in New and Old World Leishmania species [J].
Folgueira, C. ;
Canavate, C. ;
Chicharro, C. ;
Requena, J. M. .
PARASITOLOGY, 2007, 134 :369-377
[10]   Expression profiling by whole-genome interspecies microarray hybridization reveals differential gene expression in procyclic promastigotes, lesion-derived amastigotes, and axenic amastigotes in Leishmania mexicana [J].
Holzer, TR ;
McMaster, WR ;
Forney, JD .
MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2006, 146 (02) :198-218