Characterization of genome-wide STR variation in 6487 human genomes

被引:28
|
作者
Shi, Yirong [1 ,2 ]
Niu, Yiwei [1 ,3 ]
Zhang, Peng [1 ]
Luo, Huaxia [1 ]
Liu, Shuai [1 ,3 ]
Zhang, Sijia [1 ,3 ]
Wang, Jiajia [1 ]
Li, Yanyan [1 ]
Liu, Xinyue [1 ,2 ]
Song, Tingrui [1 ]
Xu, Tao [4 ,5 ]
He, Shunmin [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Biophys, Ctr Big Data Res Hlth, Key Lab RNA Biol, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Biophys, CAS Ctr Excellence Biomacromolecules, Natl Lab Biomacromolecules, Beijing 100101, Peoples R China
[5] Shandong First Med Univ & Shandong Acad Med Sci, Jinan 250117, Shandong, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金; 国家重点研发计划;
关键词
TANDEM REPEATS; GENE-EXPRESSION; FRAGILE-X; MICROSATELLITE REPEAT; STRUCTURAL VARIATION; POPULATION; MUTATIONS; DNA; DISCOVERY; EVOLUTION;
D O I
10.1038/s41467-023-37690-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (similar to 31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (similar to 33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3 ' UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Genome-wide localization of the polyphenol quercetin in human monocytes
    Atrahimovich, Dana
    Samson, Avraham O.
    Barsheshet, Yifthah
    Vaya, Jacob
    Khatib, Soliman
    Reuveni, Eli
    BMC GENOMICS, 2019, 20 (1)
  • [22] Genome-wide Identification and Characterization of Enhancers Across 10 Human Tissues
    Xiong, Lili
    Kang, Ran
    Ding, Ruofan
    Kang, Wenyuan
    Zhang, Yiming
    Liu, Wenrong
    Huang, Qingqing
    Meng, Junhua
    Guo, Zhiyun
    INTERNATIONAL JOURNAL OF BIOLOGICAL SCIENCES, 2018, 14 (10): : 1321 - 1332
  • [23] Assessing genome-wide copy number variation in the Han Chinese population
    Lu, Jianqi
    Lou, Haiyi
    Fu, Ruiqing
    Lu, Dongsheng
    Zhang, Feng
    Wu, Zhendong
    Zhang, Xi
    Li, Changhua
    Fang, Baijun
    Pu, Fangfang
    Wei, Jingning
    Wei, Qian
    Zhang, Chao
    Wang, Xiaoji
    Lu, Yan
    Yan, Shi
    Yang, Yajun
    Jin, Li
    Xu, Shuhua
    JOURNAL OF MEDICAL GENETICS, 2017, 54 (10) : 685 - 692
  • [24] Genome-wide characterization of mammalian promoters with distal enhancer functions
    Dao, Lan T. M.
    Galindo-Albarran, Ariel O.
    Castro-Mondragon, Jaime A.
    Andrieu-Soler, Charlotte
    Medina-Rivera, Alejandra
    Souaid, Charbel
    Charbonnier, Guillaume
    Griffon, Aurelien
    Vanhille, Laurent
    Stephen, Tharshana
    Alomairi, Jaafar
    Martin, David
    Torres, Magali
    Fernandez, Nicolas
    Soler, Eric
    van Helden, Jacques
    Puthier, Denis
    Spicuglia, Salvatore
    NATURE GENETICS, 2017, 49 (07) : 1073 - +
  • [25] Genome-wide identification and characterization of WOX genes in Cucumis sativus
    Han, Ni
    Tang, Rui
    Chen, Xueqian
    Xu, Zhixuan
    Ren, Zhonghai
    Wang, Lina
    GENOME, 2021, 64 (08) : 761 - 776
  • [26] Genome-Wide Characterization of Insertion and Deletion Variation in Chicken Using Next Generation Sequencing
    Yan, Yiyuan
    Yi, Guoqiang
    Sun, Congjiao
    Qu, Lujiang
    Yang, Ning
    PLOS ONE, 2014, 9 (08):
  • [27] Genome-wide map of regulatory interactions in the human genome
    Heidari, Nastaran
    Phanstiel, Douglas H.
    He, Chao
    Grubert, Fabian
    Jahanbani, Fereshteh
    Kasowski, Maya
    Zhang, Michael Q.
    Snyder, Michael P.
    GENOME RESEARCH, 2014, 24 (12) : 1905 - 1917
  • [28] The application of genome-wide SNP genotyping methods in studies on livestock genomes
    Gurgul, Artur
    Semik, Ewelina
    Pawlina, Klaudia
    Szmatoa, Tomasz
    Jasielczuk, Igor
    Bugno-Poniewierska, Monika
    JOURNAL OF APPLIED GENETICS, 2014, 55 (02) : 197 - 208
  • [29] Genome-wide identification and characterization of exapted transposable elements in the large genome of sunflower (Helianthus annuus L.)
    Ventimiglia, Maria
    Marturano, Giovanni
    Vangelisti, Alberto
    Usai, Gabriele
    Simoni, Samuel
    Cavallini, Andrea
    Giordani, Tommaso
    Natali, Lucia
    Zuccolo, Andrea
    Mascagni, Flavia
    PLANT JOURNAL, 2023, 113 (04) : 734 - 748
  • [30] Genome-wide patterns of copy number variation in the Chinese yak genome
    Zhang, Xiao
    Wang, Kun
    Wang, Lizhong
    Yang, Yongzhi
    Ni, Zhengqiang
    Xie, Xiuyue
    Shao, Xuemin
    Han, Jin
    Wan, Dongshi
    Qiu, Qiang
    BMC GENOMICS, 2016, 17