Characterization of genome-wide STR variation in 6487 human genomes

被引:28
|
作者
Shi, Yirong [1 ,2 ]
Niu, Yiwei [1 ,3 ]
Zhang, Peng [1 ]
Luo, Huaxia [1 ]
Liu, Shuai [1 ,3 ]
Zhang, Sijia [1 ,3 ]
Wang, Jiajia [1 ]
Li, Yanyan [1 ]
Liu, Xinyue [1 ,2 ]
Song, Tingrui [1 ]
Xu, Tao [4 ,5 ]
He, Shunmin [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Biophys, Ctr Big Data Res Hlth, Key Lab RNA Biol, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Biophys, CAS Ctr Excellence Biomacromolecules, Natl Lab Biomacromolecules, Beijing 100101, Peoples R China
[5] Shandong First Med Univ & Shandong Acad Med Sci, Jinan 250117, Shandong, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金; 国家重点研发计划;
关键词
TANDEM REPEATS; GENE-EXPRESSION; FRAGILE-X; MICROSATELLITE REPEAT; STRUCTURAL VARIATION; POPULATION; MUTATIONS; DNA; DISCOVERY; EVOLUTION;
D O I
10.1038/s41467-023-37690-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (similar to 31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (similar to 33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3 ' UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Characterization of the Poplar Pan-Genome by Genome-Wide Identification of Structural Variation
    Pinosio, Sara
    Giacomello, Stefania
    Faivre-Rampant, Patricia
    Taylor, Gail
    Jorge, Veronique
    Le Paslier, Marie Christine
    Zaina, Giusi
    Bastien, Catherine
    Cattonaro, Federica
    Marroni, Fabio
    Morgante, Michele
    MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (10) : 2706 - 2719
  • [2] Genome-wide investigation of VNTR motif polymorphisms in 8,222 genomes: Implications for biological regulation and human traits
    Zhang, Sijia
    Song, Qiao
    Zhang, Peng
    Wang, Xiaona
    Guo, Rong
    Li, Yanyan
    Liu, Shuai
    Yan, Xiaoyu
    Zhang, Jingjing
    Niu, Yiwei
    Shi, Yirong
    Song, Tingrui
    Xu, Tao
    He, Shunmin
    CELL GENOMICS, 2024, 4 (12):
  • [3] Analysis of Brachypodium genomes with genome-wide optical maps
    Zhu, Tingting
    Hu, Zhaorong
    Rodriguez, Juan C.
    Deal, Karin R.
    Dvorak, Jan
    Vogel, John P.
    Liu, Zhiyong
    Luo, Ming-Cheng
    GENOME, 2018, 61 (08) : 559 - 565
  • [4] Genome-wide characterization of centromeric satellites from multiple mammalian genomes
    Alkan, Can
    Cardone, Maria Francesca
    Catacchio, Claudia Rita
    Antonacci, Francesca
    O'Brien, Stephen J.
    Ryder, Oliver A.
    Purgato, Stefania
    Zoli, Monica
    Della Valle, Giuliano
    Eichler, Evan E.
    Ventura, Mario
    GENOME RESEARCH, 2011, 21 (01) : 137 - 145
  • [5] A genome-wide perspective of genetic variation in human metabolism
    Illig, Thomas
    Gieger, Christian
    Zhai, Guangju
    Roemisch-Margl, Werner
    Wang-Sattler, Rui
    Prehn, Cornelia
    Altmaier, Elisabeth
    Kastenmueller, Gabi
    Kato, Bernet S.
    Mewes, Hans-Werner
    Meitinger, Thomas
    de Angelis, Martin Hrabe
    Kronenberg, Florian
    Soranzo, Nicole
    Wichmann, H-Erich
    Spector, Tim D.
    Adamski, Jerzy
    Suhre, Karsten
    NATURE GENETICS, 2010, 42 (02) : 137 - U66
  • [6] Genome-Wide Variation in Betacoronaviruses
    LaTourrette, Katherine
    Holste, Natalie M.
    Rodriguez-Pena, Rosalba
    Leme, Raquel Arruda
    Garcia-Ruiz, Hernan
    JOURNAL OF VIROLOGY, 2021, 95 (15)
  • [7] Genome-wide analysis of G-quadruplexes in herpesvirus genomes
    Biswas, Banhi
    Kandpal, Manish
    Jauhari, Utkarsh Kumar
    Vivekanandan, Perumal
    BMC GENOMICS, 2016, 17
  • [8] A Genome-Wide Landscape of Retrocopies in Primate Genomes
    Navarro, Fabio C. P.
    Galante, Pedro A. F.
    GENOME BIOLOGY AND EVOLUTION, 2015, 7 (08): : 2265 - 2275
  • [9] Pervasive, Genome-Wide Transcription in the Organelle Genomes of Diverse Plastid-Bearing Protists
    Lima, Matheus Sanita
    Smith, David Roy
    G3-GENES GENOMES GENETICS, 2017, 7 (11): : 3789 - 3796
  • [10] Genome-wide characterization and analysis of microsatellite sequences in camelid species
    Manee, Manee M.
    Algarni, Abdulmalek T.
    Alharbi, Sultan N.
    Al-Shomrani, Badr M.
    Ibrahim, Mohanad A.
    Binghadir, Sarah A.
    Al-Fageeh, Mohamed B.
    MAMMAL RESEARCH, 2020, 65 (02) : 359 - 373