Template-specific optimization of NGS genotyping pipelines reveals allele-specific variation in MHC gene expression

被引:0
作者
Efstratiou, Artemis [1 ,2 ]
Gaigher, Arnaud [1 ,2 ,3 ]
Kuenzel, Sven [4 ]
Teles, Ana [1 ,2 ,5 ]
Lenz, Tobias L. [1 ,2 ]
机构
[1] Univ Hamburg, Dept Biol, Res Unit Evolutionary Immunogen, Martin Luther King Pl 3, D-20146 Hamburg, Germany
[2] Max Planck Inst Evolutionary Biol, Res Grp Evolutionary Immunogen, Plon, Germany
[3] Univ Porto, Res Ctr Biodivers & Genet Resources, CIBIO InBio, Vairao, Portugal
[4] Max Planck Inst Evolutionary Biol, Dept Evolutionary Genet, Plon, Germany
[5] Max Planck Inst Evolutionary Biol, Dept Evolutionary Ecol, Plon, Germany
基金
瑞士国家科学基金会;
关键词
amplicon genotyping; AmpliSAS; bioinformatics pipelines; major histocompatibility complex; multigene families; next-generation sequencing; CLASS-I PATHWAY; SEQUENCE POLYMORPHISM; STICKLEBACK; SELECTION; INFECTION; INFERENCE; EVOLUTION; TELEOST; NUMBER; VIRUS;
D O I
10.1111/1755-0998.13935
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Using high-throughput sequencing for precise genotyping of multi-locus gene families, such as the major histocompatibility complex (MHC), remains challenging, due to the complexity of the data and difficulties in distinguishing genuine from erroneous variants. Several dedicated genotyping pipelines for data from high-throughput sequencing, such as next-generation sequencing (NGS), have been developed to tackle the ensuing risk of artificially inflated diversity. Here, we thoroughly assess three such multi-locus genotyping pipelines for NGS data, the DOC method, AmpliSAS and ACACIA, using MHC class II beta data sets of three-spined stickleback gDNA, cDNA and "artificial" plasmid samples with known allelic diversity. We show that genotyping of gDNA and plasmid samples at optimal pipeline parameters was highly accurate and reproducible across methods. However, for cDNA data, the gDNA-optimal parameter configuration yielded decreased overall genotyping precision and consistency between pipelines. Further adjustments of key clustering parameters were required t omicron account for higher error rates and larger variation in sequencing depth per allele, highlighting the importance of template-specific pipeline optimization for reliable genotyping of multi-locus gene families. Through accurate paired gDNA-cDNA typing and MHC-II haplotype inference, we show that MHC-II allele-specific expression levels correlate negatively with allele number across haplotypes. Lastly, sibship-assisted cDNA-typing of MHC-I revealed novel variants linked in haplotype blocks, and a higher-than-previously-reported individual MHC-I allelic diversity. In conclusion, we provide novel genotyping protocols for the three-spined stickleback MHC-I and -II genes, and evaluate the performance of popular NGS-genotyping pipelines. We also show that fine-tuned genotyping of paired gDNA-cDNA samples facilitates amplification bias-corrected MHC allele expression analysis.
引用
收藏
页数:21
相关论文
共 20 条
  • [1] Allele-specific gene expression in a wild nonhuman primate population
    Tung, J.
    Akinyi, M. Y.
    Mutura, S.
    Altmann, J.
    Wray, G. A.
    Alberts, S. C.
    MOLECULAR ECOLOGY, 2011, 20 (04) : 725 - 739
  • [2] Allele-specific expression variation at different ploidy levels in Squalius alburnoides
    Matos, Isa
    Machado, Miguel P.
    Schartl, Manfred
    Coelho, Maria Manuela
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [3] Bayesian Inference of Allele-Specific Gene Expression Indicates Abundant Cis-Regulatory Variation in Natural Flycatcher Populations
    Wang, Mi
    Uebbing, Severin
    Ellegren, Hans
    GENOME BIOLOGY AND EVOLUTION, 2017, 9 (05): : 1266 - 1279
  • [4] Ploidy mosaicism and allele-specific gene expression differences in the allopolyploid Squalius alburnoides
    Matos, Isa
    Sucena, Elio
    Machado, Miguel P.
    Gardner, Rui
    Inacio, Angela
    Schartl, Manfred
    Coelho, Maria M.
    BMC GENETICS, 2011, 12
  • [5] Allele-specific single-cell RNA sequencing reveals different architectures of intrinsic and extrinsic gene expression noises
    Sun, Mengyi
    Zhang, Jianzhi
    NUCLEIC ACIDS RESEARCH, 2020, 48 (02) : 533 - 547
  • [6] Haplotype-resolved genome assembly and allele-specific gene expression in cultivated ginger
    Cheng, Shi-Ping
    Jia, Kai-Hua
    Liu, Hui
    Zhang, Ren-Gang
    Li, Zhi-Chao
    Zhou, Shan-Shan
    Shi, Tian-Le
    Ma, Ai-Chu
    Yu, Cong-Wen
    Gao, Chan
    Cao, Guang-Lei
    Zhao, Wei
    Nie, Shuai
    Guo, Jing-Fang
    Jiao, Si-Qian
    Tian, Xue-Chan
    Yan, Xue-Mei
    Bao, Yu-Tao
    Yun, Quan-Zheng
    Wang, Xin-Zhu
    Porth, Ilga
    El-Kassaby, Yousry A.
    Wang, Xiao-Ru
    Li, Zhen
    Van de Peer, Yves
    Mao, Jian-Feng
    HORTICULTURE RESEARCH, 2021, 8 (01)
  • [7] Elimination of Reference Mapping Bias Reveals Robust Immune Related Allele-Specific Expression in Crossbred Sheep
    Salavati, Mazdak
    Bush, Stephen J.
    Palma-Vera, Sergio
    McCulloch, Mary E. B.
    Hume, David A.
    Clark, Emily L.
    FRONTIERS IN GENETICS, 2019, 10
  • [8] Trans-Species Polymorphism and Allele-Specific Expression in the CBF Gene Family of Wild Tomatoes
    Mboup, Mamadou
    Fischer, Iris
    Lainer, Hilde
    Stephan, Wolfgang
    MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (12) : 3641 - 3652
  • [9] Gene and Allele-Specific Expression Underlying the Electric Signal Divergence in African Weakly Electric Fish
    Cheng, Feng
    Dennis, Alice B.
    Baumann, Otto
    Kirschbaum, Frank
    Abdelilah-Seyfried, Salim
    Tiedemann, Ralph
    MOLECULAR BIOLOGY AND EVOLUTION, 2024, 41 (02)
  • [10] Haplotype-resolved genome assembly of Ficus carica L. reveals allele-specific expression in the fruit
    Usai, Gabriele
    Giordani, Tommaso
    Vangelisti, Alberto
    Castellacci, Marco
    Simoni, Samuel
    Bosi, Emanuele
    Natali, Lucia
    Mascagni, Flavia
    Cavallini, Andrea
    PLANT JOURNAL, 2025, 121 (04)