Hla-mapper: An application to optimize the mapping of HLA sequences produced by massively parallel sequencing procedures

被引:34
作者
Castelli, Erick C. [1 ,2 ]
Paz, Michelle A. [1 ]
Souza, Andreia S. [1 ]
Ramalho, Jaqueline [1 ]
Mendes-Junior, Celso Teixeira [3 ]
机构
[1] Sao Paulo State Univ UNESP, Mol Genet & Bioinformat Lab, Expt Res Unit UNIPEX, Sch Med, Botucatu, SP, Brazil
[2] Sao Paulo State Univ UNESP, Pathol Dept, Sch Med, BR-18618687 Botucatu, SP, Brazil
[3] Univ Sao Paulo, Dept Quim, Fac Filosofia Ciencias & Letras Ribeirao Preto, BR-14040901 Ribeirao Preto, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
MHC; HLA; Next Generation Sequencing (NGS); Second Generation Sequencing; Variability; Polymorphisms; Typing; Aligners; Mapping tool; BRAZILIAN POPULATION-SAMPLE; REGION VARIABILITY; READ ALIGNMENT; HAPLOTYPES; FRAMEWORK;
D O I
10.1016/j.humimm.2018.06.010
中图分类号
R392 [医学免疫学]; Q939.91 [免疫学];
学科分类号
100102 ;
摘要
A challenging task when more than one HLA gene is evaluated together by second-generation sequencing is to achieve a reliable read mapping. The polymorphic and repetitive nature of HLA genes might bias the read mapping process, usually underestimating variability at very polymorphic segments, or overestimating variability at some segments. To overcome this issue we developed hla-mapper, which takes into account HLA sequences derived from the IPD-IMGT/HLA database and unpublished HLA sequences to apply a scoring system. This comprehends the evaluation of each read pair, addressing them to the most likely HLA gene they were derived from. Ma-mapper provides a reliable map of HLA sequences, allowing accurate downstream analysis such as variant calling, haplotype inference, and allele typing. Moreover, hla-mapper supports whole genome, exome, and targeted sequencing data. To assess the software performance in comparison with traditional mapping algorithms, we used three different simulated datasets to compare the results obtained with hla-mapper, BWA MEM, and Bowtie2. Overall, hla-mapper presented a superior performance, mainly for the classical HLA class I genes, minimizing wrong mapping and cross-mapping that are typically observed when using BWA MEM or Bowtie2 with a single reference genome.
引用
收藏
页码:678 / 684
页数:7
相关论文
共 25 条
[1]   HLA-F coding and regulatory segments variability determined by massively parallel sequencing procedures in a Brazilian population sample [J].
Ayala Lima, Thalitta Hetamaro ;
Buttura, Renato Vidal ;
Donadi, Eduardo Antonio ;
Veiga-Castelli, Luciana Caricati ;
Mendes-Junior, Celso Teixeira ;
Castelli, Erick C. .
HUMAN IMMUNOLOGY, 2016, 77 (10) :841-853
[2]  
Bauer DC., 2016, Briefings in Bioinformatics, P1
[3]  
Brandt D. Y., 2015, G3-GENES GENOM GENET, P931
[4]   HLA-G variability and haplotypes detected by massively parallel sequencing procedures in the geographicaly distinct population samples of Brazil and Cyprus [J].
Castelli, Erick C. ;
Gerasimou, Petroula ;
Paz, Michelle A. ;
Ramalho, Jaqueline ;
Porto, Iane O. P. ;
Lima, Thalitta H. A. ;
Souza, Andreia S. ;
Veiga-Castelli, Luciana C. ;
Collares, Cristhianna V. A. ;
Donadi, Eduardo A. ;
Mendes-Junior, Celso T. ;
Costeas, Paul .
MOLECULAR IMMUNOLOGY, 2017, 83 :115-126
[5]   HLA-E coding and 3′ untranslated region variability determined by next-generation sequencing in two West-African population samples [J].
Castelli, Erick C. ;
Mendes-Junior, Celso T. ;
Sabbagh, Audrey ;
Porto, Iane O. P. ;
Garcia, Andre ;
Ramalho, Jaqueline ;
Lima, Thalitta H. A. ;
Massaro, Juliana D. ;
Dias, Fabricio C. ;
Collares, Cristhianna V. A. ;
Jamonneau, Vincent ;
Bucheton, Bruno ;
Camara, Mamadou ;
Donadi, Eduardo A. .
HUMAN IMMUNOLOGY, 2015, 76 (12) :945-953
[6]   Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data [J].
Degner, Jacob F. ;
Marioni, John C. ;
Pai, Athma A. ;
Pickrell, Joseph K. ;
Nkadori, Everlyne ;
Gilad, Yoav ;
Pritchard, Jonathan K. .
BIOINFORMATICS, 2009, 25 (24) :3207-3212
[7]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[8]   Implications of the polymorphism of HLA-G on its function, regulation, evolution and disease association [J].
Donadi, Eduardo A. ;
Castelli, Erick C. ;
Arnaiz-Villena, Antonio ;
Roger, Michel ;
Rey, Diego ;
Moreau, Philippe .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2011, 68 (03) :369-395
[9]   Phase-defined complete sequencing of the HLA genes by next-generation sequencing [J].
Hosomichi, Kazuyoshi ;
Jinam, Timothy A. ;
Mitsunaga, Shigeki ;
Nakaoka, Hirofumi ;
Inoue, Ituro .
BMC GENOMICS, 2013, 14
[10]   HLAreporter: a tool for HLA typing from next generation sequencing data [J].
Huang, Yazhi ;
Yang, Jing ;
Ying, Dingge ;
Zhang, Yan ;
Shotelersuk, Vorasuk ;
Hirankarn, Nattiya ;
Sham, Pak Chung ;
Lau, Yu Lung ;
Yang, Wanling .
GENOME MEDICINE, 2015, 7