A reference haplotype panel for genome-wide imputation of short tandem repeats

被引:46
|
作者
Saini, Shubham [1 ]
Mitra, Ileena [2 ]
Mousavi, Nima [3 ]
Fotsing, Stephanie Feupe [2 ,4 ]
Gymrek, Melissa [1 ,5 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, 9500 Gilman Dr, La Jolla, CA 92093 USA
[2] Univ Calif San Diego, Bioinformat & Syst Biol Program, 9500 Gilman Dr, La Jolla, CA 92093 USA
[3] Univ Calif San Diego, Dept Elect & Comp Engn, 9500 Gilman Dr, La Jolla, CA 92093 USA
[4] Univ Calif San Diego, Dept Biomed Informat, 9500 Gilman Dr, La Jolla, CA 92093 USA
[5] Univ Calif San Diego, Dept Med, 9500 Gilman Dr, La Jolla, CA 92093 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
GENE-EXPRESSION VARIATION; LINKAGE DISEQUILIBRIUM; DNA METHYLATION; CAG REPEAT; EXPANSION; MICROSATELLITE; VARIANTS; MUTATION; DISEASE; ASSOCIATION;
D O I
10.1038/s41467-018-06694-0
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Short tandem repeats (STRs) are involved in dozens of Mendelian disorders and have been implicated in complex traits. However, genotyping arrays used in genome-wide association studies focus on single nucleotide polymorphisms (SNPs) and do not readily allow identification of STR associations. We leverage next-generation sequencing (NGS) from 479 families to create a SNP + STR reference haplotype panel. Our panel enables imputing STR genotypes into SNP array data when NGS is not available for directly genotyping STRs. Imputed genotypes achieve mean concordance of 97% with observed genotypes in an external dataset compared to 71% expected under a naive model. Performance varies widely across STRs, with near perfect concordance at bi-allelic STRs vs. 70% at highly polymorphic repeats. Imputation increases power over individual SNPs to detect STR associations with gene expression. Imputing STRs into existing SNP datasets will enable the first large-scale STR association studies across a range of complex traits.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Genome-wide detection of somatic mosaicism at short tandem repeats
    Sehgal, Aarushi
    Jam, Helyaneh Ziaei
    Shen, Andrew
    Gymrek, Melissa
    BIOINFORMATICS, 2024, 40 (08)
  • [2] Genome-wide meta-analysis of short-tandem repeats for Parkinson's disease risk using genotype imputation
    Ohlei, Olena
    Paul, Kimberly
    Nielsen, Susan Searles
    Gmelin, David
    Dobricic, Valerija
    Altmann, Vivian
    Schilling, Marcel
    Bronstein, Jeff M.
    Franke, Andre
    Wittig, Michael
    Parkkinen, Laura
    Hansen, Johnni
    Checkoway, Harvey
    Ritz, Beate
    Bertram, Lars
    Lill, Christina M.
    BRAIN COMMUNICATIONS, 2024, 6 (03)
  • [3] Profiling the Genome-Wide Landscape of Short Tandem Repeats by Long-Read Sequencing
    Liu, Zhenhua
    Zhao, Guihu
    Xiao, Yuhui
    Zeng, Sheng
    Yuan, Yanchun
    Zhou, Xun
    Fang, Zhenghuan
    He, Runcheng
    Li, Bin
    Zhao, Yuwen
    Pan, Hongxu
    Wang, Yige
    Yu, Guoliang
    Peng, I-Feng
    Wang, Depeng
    Meng, Qingtuan
    Xu, Qian
    Sun, Qiying
    Yan, Xinxiang
    Shen, Lu
    Jiang, Hong
    Xia, Kun
    Wang, Junling
    Guo, Jifeng
    Liang, Fan
    Li, Jinchen
    Tang, Beisha
    FRONTIERS IN GENETICS, 2022, 13
  • [4] PHARP: a pig haplotype reference panel for genotype imputation
    Wang, Zhen
    Zhang, Zhenyang
    Chen, Zitao
    Sun, Jiabao
    Cao, Caiyun
    Wu, Fen
    Xu, Zhong
    Zhao, Wei
    Sun, Hao
    Guo, Longyu
    Zhang, Zhe
    Wang, Qishan
    Pan, Yuchun
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [5] Genome-wide detection of tandem DNA repeats that are expanded in autism
    Trost, Brett
    Engchuan, Worrawat
    Nguyen, Charlotte M.
    Thiruvahindrapuram, Bhooma
    Dolzhenko, Egor
    Backstrom, Ian
    Mirceta, Mila
    Mojarad, Bahareh A.
    Yin, Yue
    Dov, Alona
    Chandrakumar, Induja
    Prasolava, Tanya
    Shum, Natalie
    Hamdan, Omar
    Pellecchia, Giovanna
    Howe, Jennifer L.
    Whitney, Joseph
    Klee, Eric W.
    Baheti, Saurabh
    Amaral, David G.
    Anagnostou, Evdokia
    Elsabbagh, Mayada
    Fernandez, Bridget A.
    Ny Hoang
    Lewis, M. E. Suzanne
    Liu, Xudong
    Sjaarda, Calvin
    Smith, Isabel M.
    Szatmari, Peter
    Zwaigenbaum, Lonnie
    Glazer, David
    Hartley, Dean
    Stewart, A. Keith
    Eberle, Michael A.
    Sato, Nozomu
    Pearson, Christopher E.
    Scherer, Stephen W.
    Yuen, Ryan K. C.
    NATURE, 2020, 586 (7827) : 80 - +
  • [6] The construction of a haplotype reference panel using extremely low coverage whole genome sequences and its application in genome-wide association studies and genomic prediction in Duroc pigs
    Zhang, Zhe
    Ma, Peipei
    Zhang, Zhenyang
    Wang, Zhen
    Wang, Qishan
    Pan, Yuchun
    GENOMICS, 2022, 114 (01) : 340 - 350
  • [7] Genome-wide enhancer-associated tandem repeats are expanded in cardiomyopathy
    Mitina, Aleksandra
    Khan, Mahreen
    Lesurf, Robert
    Yin, Yue
    Engchuan, Worrawat
    Hamdan, Omar
    Pellecchia, Giovanna
    Trost, Brett
    Backstrom, Ian
    Guo, Keyi
    Pallotto, Linda M.
    Doong, Phoenix Hoi Lam
    Wang, Zhuozhi
    Nalpathamkalam, Thomas
    Thiruvahindrapuram, Bhooma
    Papaz, Tanya
    Pearson, Christopher E.
    Ragoussis, Jiannis
    Subbarao, Padmaja
    Azad, Meghan B.
    Turvey, Stuart E.
    Mandhane, Piushkumar
    Moraes, Theo J.
    Simons, Elinor
    Scherer, Stephen W.
    Lougheed, Jane
    Mondal, Tapas
    Smythe, John
    Altamirano-Diaz, Luis
    Oechslin, Erwin
    Mital, Seema
    Yuen, Ryan K. C.
    EBIOMEDICINE, 2024, 101
  • [8] Genome-wide contribution of common short-tandem repeats to Parkinson's disease genetic risk
    Bustos, Bernabe, I
    Billingsley, Kimberley
    Blauwendraat, Cornelis
    Gibbs, J. Raphael
    Gan-Or, Ziv
    Krainc, Dimitri
    Singleton, Andrew B.
    Lubbe, Steven J.
    BRAIN, 2023, 146 (01) : 65 - 74
  • [9] Effect of Genome-Wide Genotyping and Reference Panels on Rare Variants Imputation
    Zheng, Hou-Feng
    Ladouceur, Martin
    Greenwood, Celia M. T.
    Richards, J. Brent
    JOURNAL OF GENETICS AND GENOMICS, 2012, 39 (10) : 545 - 550
  • [10] Sequencing and characterizing short tandem repeats in the human genome
    Tanudisastro, Hope A.
    Deveson, Ira W.
    Dashnow, Harriet
    Macarthur, Daniel G.
    NATURE REVIEWS GENETICS, 2024, 25 (07) : 460 - 475