Characterization of genome-wide STR variation in 6487 human genomes

被引:28
作者
Shi, Yirong [1 ,2 ]
Niu, Yiwei [1 ,3 ]
Zhang, Peng [1 ]
Luo, Huaxia [1 ]
Liu, Shuai [1 ,3 ]
Zhang, Sijia [1 ,3 ]
Wang, Jiajia [1 ]
Li, Yanyan [1 ]
Liu, Xinyue [1 ,2 ]
Song, Tingrui [1 ]
Xu, Tao [4 ,5 ]
He, Shunmin [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Biophys, Ctr Big Data Res Hlth, Key Lab RNA Biol, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Biophys, CAS Ctr Excellence Biomacromolecules, Natl Lab Biomacromolecules, Beijing 100101, Peoples R China
[5] Shandong First Med Univ & Shandong Acad Med Sci, Jinan 250117, Shandong, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划; 中国博士后科学基金;
关键词
TANDEM REPEATS; GENE-EXPRESSION; FRAGILE-X; MICROSATELLITE REPEAT; STRUCTURAL VARIATION; POPULATION; MUTATIONS; DNA; DISCOVERY; EVOLUTION;
D O I
10.1038/s41467-023-37690-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (similar to 31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (similar to 33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3 ' UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Natural Selection Shapes Variation in Genome-wide Recombination Rate in Drosophila pseudoobscura
    Samuk, Kieran
    Manzano-Winkler, Brenda
    Ritz, Kathryn R.
    Noor, Mohamed A. F.
    CURRENT BIOLOGY, 2020, 30 (08) : 1517 - +
  • [42] A genome-wide atlas of human cell morphology
    Ramezani, Meraj
    Weisbart, Erin
    Bauman, Julia
    Singh, Avtar
    Yong, John
    Lozada, Maria
    Way, Gregory P.
    Kavari, Sanam L.
    Diaz, Celeste
    Leardini, Eddy
    Jetley, Gunjan
    Pagnotta, Jenlu
    Haghighi, Marzieh
    Batista, Thiago M.
    Perez-Schindler, Joaquin
    Claussnitzer, Melina
    Singh, Shantanu
    Cimini, Beth A.
    Blainey, Paul C.
    Carpenter, Anne E.
    Jan, Calvin H.
    Neal, James T.
    NATURE METHODS, 2025, 22 (03) : 621 - 633
  • [43] Genome-wide discovery of human splicing branchpoints
    Mercer, Tim R.
    Clark, Michael B.
    Andersen, Stacey B.
    Brunck, Marion E.
    Haerty, Wilfried
    Crawford, Joanna
    Taft, Ryan J.
    Nielsen, Lars K.
    Dinger, Marcel E.
    Mattick, John S.
    GENOME RESEARCH, 2015, 25 (02) : 290 - 303
  • [44] Genome-wide discovery of human heart enhancers
    Narlikar, Leelavati
    Sakabe, Noboru J.
    Blanski, Alexander A.
    Arimura, Fabio E.
    Westlund, John M.
    Nobrega, Marcelo A.
    Ovcharenko, Ivan
    GENOME RESEARCH, 2010, 20 (03) : 381 - 392
  • [45] Genome-wide Analysis and Characterization of Eucalyptus grandis TCP Transcription Factors
    Ilhan, Emre
    Kasapoglu, Ayse Gul
    Muslu, Selman
    Aygoren, Ahmed Sidar
    Aydin, Murat
    JOURNAL OF AGRICULTURAL SCIENCES-TARIM BILIMLERI DERGISI, 2023, 29 (02): : 413 - 426
  • [46] A computational framework To assess genome-wide distribution Of polymorphic human endogenous retrovirus-K In human populations
    Li, Weiling
    Lin, Lin
    Malhotra, Raunaq
    Yang, Lei
    Acharya, Raj
    Poss, Mary
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (03)
  • [47] Filling annotation gaps in yeast genomes using genome-wide contact maps
    Marie-Nelly, Herve
    Marbouty, Martial
    Cournac, Axel
    Liti, Gianni
    Fischer, Gilles
    Zimmer, Christophe
    Koszul, Romain
    BIOINFORMATICS, 2014, 30 (15) : 2105 - 2113
  • [48] Genome-wide recombination rate variation in a recombination map of cotton
    Shen, Chao
    Li, Ximei
    Zhang, Ruiting
    Lin, Zhongxu
    PLOS ONE, 2017, 12 (11):
  • [49] Genome-wide copy number variation study in anorectal malformations
    Wong, Emily H. M.
    Cui, Long
    Ng, Chun-Laam
    Tang, Clara S. M.
    Liu, Xue-Lai
    So, Man-Ting
    Yip, Benjamin Hon-Kei
    Cheng, Guo
    Zhang, Ruizhong
    Tang, Wai-Kiu
    Yang, Wanling
    Lau, Yu-Lung
    Baum, Larry
    Kwan, Patrick
    Sun, Liang-Dan
    Zuo, Xian-Bo
    Ren, Yun-Qing
    Yin, Xian-Yong
    Miao, Xiao-Ping
    Liu, Jianjun
    Lui, Vincent Chi-Hang
    Ngan, Elly Sau-Wai
    Yuan, Zhen-Wei
    Zhang, Shi-Wei
    Xia, Jinglong
    Wang, Hualong
    Sun, Xiao-bing
    Wang, Ruoyi
    Chang, Tao
    Chan, Ivy Hau-Yee
    Chung, Patrick Ho-Yu
    Zhang, Xue-Jun
    Wong, Kenneth Kak-Yuen
    Cherny, Stacey S.
    Sham, Pak-Chung
    Tam, Paul Kwong-Hang
    Garcia-Barcelo, Maria-Merce
    HUMAN MOLECULAR GENETICS, 2013, 22 (03) : 621 - 631
  • [50] Sex-specific variation in the genome-wide recombination rate
    Peterson, April L.
    Payseur, Bret A.
    GENETICS, 2021, 217 (01)