PacBio assembly of a Plasmodium knowlesi genome sequence with Hi-C correction and manual annotation of the SIC Avar gene family

被引:27
作者
Lapp, S. A. [1 ]
Geraldo, J. A. [2 ,3 ]
Chien, J. -T. [1 ,4 ]
Ay, F. [5 ]
Pakala, S. B. [6 ,7 ]
Batugedara, G. [8 ]
Humphrey, J. [6 ,7 ]
Debarry, J. D. [6 ,7 ]
Le Roch, K. G. [8 ]
Galinski, M. R. [1 ,4 ,10 ]
Kissinger, J. C. [6 ,7 ,11 ]
机构
[1] Emory Univ, Yerkes Natl Primate Res Ctr, Emory Vaccine Ctr, Atlanta, GA 30322 USA
[2] Univ Fed Minas Gerais, Belo Horizonte, MG, Brazil
[3] Rene Rachou Res Ctr CPqRR FIOCRUZ, Belo Horizonte, MG, Brazil
[4] Emory Univ, Dept Math & Comp Sci, Atlanta, GA 30322 USA
[5] La Jolla Inst Allergy & Immunol, La Jolla, CA 92037 USA
[6] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[7] Univ Georgia, Ctr Trop & Emerging Global Dis, Athens, GA 30602 USA
[8] Univ Calif Riverside, Inst Integrat Genome Biol, Ctr Dis & Vector Res, Dept Cell Biol & Neurosci, Riverside, CA 92521 USA
[9] Malaria Host Pathogen Interact Ctr, Atlanta, GA USA
[10] Emory Univ, Dept Med, Div Infect Dis, Atlanta, GA 30322 USA
[11] Univ Georgia, Dept Genet, Athens, GA 30602 USA
基金
美国国家卫生研究院;
关键词
Plasmodium knowlesi; PacBio; Hi-C; SICAvar; MaHPIC; genome; sequence; annotation; antigenic variation; ANTIGENIC VARIATION; VARIANT ANTIGEN; ERYTHROCYTE-MEMBRANE; ZOONOTIC MALARIA; HUMAN INFECTIONS; EXPRESSION; REVEALS; ORGANIZATION; SOFTWARE; MONKEYS;
D O I
10.1017/S0031182017001329
中图分类号
R38 [医学寄生虫学]; Q [生物科学];
学科分类号
07 ; 0710 ; 09 ; 100103 ;
摘要
Plasmodium knowlesi has risen in importance as a zoonotic parasite that has been causing regular episodes of malaria throughout South East Asia. The P. knowlesi genome sequence generated in 2008 highlighted and confirmed many similarities and differences in Plasmodium species, including a global view of several multigene families, such as the large SIC Avar multigene family encoding the variant antigens known as the schizont-infected cell agglutination proteins. However, repetitive DNA sequences are the bane of any genome project, and this and other Plasmodium genome projects have not been immune to the gaps, rearrangements and other pitfalls created by these genomic features. Today, long-read PacBio and chromatin conformation technologies are overcoming such obstacles. Here, based on the use of these technologies, we present a highly refined de novo P. knowlesi genome sequence of the Pk1(A+) clone. This sequence and annotation, referred to as the 'MaHPIC Pk genome sequence', includes manual annotation of the SIC Avar gene family with 136 full-length members categorized as type I or II. This sequence provides a framework that will permit a better understanding of the SICAvar repertoire, selective pressures acting on this gene family and mechanisms of antigenic variation in this species and other pathogens.
引用
收藏
页码:71 / 84
页数:14
相关论文
共 52 条
  • [41] Pasini E. M., 2017, PARASITOLOGY
  • [42] A 3D Map of the Human Genome at Kilobase Resolution Reveals Principles of Chromatin Looping
    Rao, Suhas S. P.
    Huntley, Miriam H.
    Durand, Neva C.
    Stamenova, Elena K.
    Bochkov, Ivan D.
    Robinson, James T.
    Sanborn, Adrian L.
    Machol, Ido
    Omer, Arina D.
    Lander, Eric S.
    Aiden, Erez Lieberman
    [J]. CELL, 2014, 159 (07) : 1665 - 1680
  • [43] Artemis: sequence visualization and annotation
    Rutherford, K
    Parkhill, J
    Crook, J
    Horsnell, T
    Rice, P
    Rajandream, MA
    Barrell, B
    [J]. BIOINFORMATICS, 2000, 16 (10) : 944 - 945
  • [44] HiC-Pro: an optimized and flexible pipeline for Hi-C data processing
    Servant, Nicolas
    Varoquaux, Nelle
    Lajoie, Bryan R.
    Viara, Eric
    Chen, Chong-Jian
    Vert, Jean-Philippe
    Heard, Edith
    Dekker, Job
    Barillot, Emmanuel
    [J]. GENOME BIOLOGY, 2015, 16
  • [45] Estimating Geographical Variation in the Risk of Zoonotic Plasmodium knowlesi Infection in Countries Eliminating Malaria
    Shearer, Freya M.
    Huang, Zhi
    Weiss, Daniel J.
    Wiebe, Antoinette
    Gibson, Harry S.
    Battle, Katherine E.
    Pigott, David M.
    Brady, Oliver J.
    Putaporntip, Chaturong
    Jongwutiwes, Somchai
    Lau, Yee Ling
    Manske, Magnus
    Amato, Roberto
    Elyazar, Iqbal R. F.
    Vythilingam, Indra
    Bhatt, Samir
    Gething, Peter W.
    Singh, Balbir
    Golding, Nick
    Hay, Simon I.
    Moyes, Catherine L.
    [J]. PLOS NEGLECTED TROPICAL DISEASES, 2016, 10 (08):
  • [46] A large focus of naturally acquired Plasmodium knowlesi infections in human beings
    Singh, B
    Sung, LK
    Matusop, A
    Radhakrishnan, A
    Shamsul, SSG
    Cox-Singh, J
    Thomas, A
    Conway, DJ
    [J]. LANCET, 2004, 363 (9414) : 1017 - 1024
  • [47] Human Infections and Detection of Plasmodium knowlesi
    Singh, Balbir
    Daneshvar, Cyrus
    [J]. CLINICAL MICROBIOLOGY REVIEWS, 2013, 26 (02) : 165 - 184
  • [48] SyMAP v3.4: a turnkey synteny system with application to plant genomes
    Soderlund, Carol
    Bomhoff, Matthew
    Nelson, William M.
    [J]. NUCLEIC ACIDS RESEARCH, 2011, 39 (10) : e68
  • [49] ParsEval: parallel comparison and analysis of gene structure annotations
    Standage, Daniel S.
    Brendel, Volker P.
    [J]. BMC BIOINFORMATICS, 2012, 13
  • [50] Using native and syntenically mapped cDNA alignments to improve de novo gene finding
    Stanke, Mario
    Diekhans, Mark
    Baertsch, Robert
    Haussler, David
    [J]. BIOINFORMATICS, 2008, 24 (05) : 637 - 644