Dual redundant sequencing strategy: Full-length gene characterisation of 1056 novel and confirmatory HLA alleles

被引:51
|
作者
Albrecht, V. [1 ]
Zweiniger, C. [1 ]
Surendranath, V. [1 ]
Lang, K. [1 ]
Schoefl, G. [1 ]
Dahl, A. [2 ]
Winkler, S. [3 ]
Lange, V. [1 ]
Boehme, I. [1 ]
Schmidt, A. H. [1 ,4 ]
机构
[1] DKMS Life Sci Lab, Dresden, Germany
[2] CRTD, Deep Sequencing Grp, Dresden, Germany
[3] Max Planck Inst Mol Cell Biol & Genet, DNA Sequencing, Dresden, Germany
[4] DKMS, Tubingen, Germany
关键词
full-length gene sequencing; HLA typing; NGS; novel HLA alleles; PacBio; IMGT/HLA DATABASE; TRANSPLANTATION; TECHNOLOGIES; IPD;
D O I
10.1111/tan.13057
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
The high-throughput department of DKMS Life Science Lab encounters novel human leukocyte antigen (HLA) alleles on a daily basis. To characterise these alleles, we have developed a system to sequence the whole gene from 5'- to 3'-UTR for the HLA loci A, B, C, DQB1 and DPB1 for submission to the European Molecular Biology Laboratory - European Nucleotide Archive (EMBL-ENA) and the IPD-IMGT/HLA Database. Our workflow is based on a dual redundant sequencing strategy. Using shotgun sequencing on an Illumina MiSeq instrument and single molecule real-time (SMRT) sequencing on a PacBio RS II instrument, we are able to achieve highly accurate HLA full-length consensus sequences. Remaining conflicts are resolved using the R package DR2S (Dual Redundant Reference Sequencing). Given the relatively high throughput of this strategy, we have developed the semi-automated web service TypeLoader, to aid in the submission of sequences to the EMBL-ENA and the IPD-IMGT/HLA Database. In the IPD-IMGT/HLA Database release 3.24.0 (April 2016; prior to the submission of the sequences described here), only 5.2% of all known HLA alleles have been fully characterised together with intronic and UTR sequences. So far, we have applied our strategy to characterise and submit 1056 HLA alleles, thereby more than doubling the number of fully characterised alleles. Given the increasing application of next generation sequencing (NGS) for full gene characterisation in clinical practice, extending the HLA database concomitantly is highly desirable. Therefore, we propose this dual redundant sequencing strategy as a workflow for submission of novel full-length alleles and characterisation of sequences that are as yet incomplete. This would help to mitigate the predominance of partially known alleles in the database.
引用
收藏
页码:79 / 87
页数:9
相关论文
共 50 条
  • [21] Genomic full-length sequence of two HLA-A alleles, A*02:48 and A*02:251, identified by cloning and sequencing
    Lu, L.
    Xu, Y. P.
    TISSUE ANTIGENS, 2015, 86 (05): : 377 - 379
  • [22] Cloning and sequencing full-length HLA-B and -C genes
    Cox, ST
    McWhinnie, AJ
    Robinson, J
    Marsh, SGE
    Parham, P
    Madrigal, JA
    Little, AM
    TISSUE ANTIGENS, 2003, 61 (01): : 20 - 48
  • [23] IDENTIFICATION OF GENOMIC FULL-LENGTH SEQUENCE OF HLA-E IN CHINESE INDIVIDUALS AND TWO NOVEL HLA-E ALLELES
    Xu, Yun-Ping
    Wang Song-Xing
    HLA, 2017, 89 (06) : 449 - 449
  • [24] The genomic full-length sequence of the HLA-A02:304 allele, identified by full-length group-specific sequencing
    Hu, Bin
    Feng, Zhihui
    Jiao, Shuxian
    Pang, Shutao
    HLA, 2022, 100 (05) : 506 - 508
  • [25] The genomic full-length sequence of the HLA-A*24:257 allele, identified by full-length group-specific sequencing
    Hu, Bin
    Feng, Zhihui
    Jiao, Shuxian
    Pang, Shutao
    HLA, 2021, 98 (06) : 536 - 538
  • [26] The genomic full-length sequence of the HLA-A02:344 allele, identified by full-length group-specific sequencing
    Chi, Xiaoyun
    Liu, Li
    Jiao, Shuxian
    Pang, Shutao
    HLA, 2022, 100 (03) : 254 - 256
  • [27] Genomic full-length sequence of the HLA-A*11:97 allele, identified by full-length group-specific sequencing
    Hu, Bin
    Jiao, Shuxian
    Chi, Xiaoyun
    Pang, Shutao
    HLA, 2020, 96 (05) : 625 - 626
  • [28] Genomic full-length sequence of the HLA-A*24:233 allele, identified by full-length group-specific sequencing
    Hu, Bin
    Jiao, Shuxian
    An, Run
    Pang, Shutao
    HLA, 2021, 97 (04) : 356 - +
  • [29] Genomic full-length sequence of HLA-B*40:54 was identified by full-length group-specific sequencing
    Chen, Peng
    Xu, Yunping
    HLA, 2019, 93 (06) : 487 - +
  • [30] The genomic full-length sequence of the HLA-A*03:30 allele, identified by full-length group-specific sequencing
    Chi, Xiaoyun
    Fang, Ruifeng
    Jiao, Shuxian
    Hu, Bin
    Pang, Shutao
    HLA, 2021, 98 (05) : 469 - 471