Characterization of genome-wide STR variation in 6487 human genomes

被引:28
作者
Shi, Yirong [1 ,2 ]
Niu, Yiwei [1 ,3 ]
Zhang, Peng [1 ]
Luo, Huaxia [1 ]
Liu, Shuai [1 ,3 ]
Zhang, Sijia [1 ,3 ]
Wang, Jiajia [1 ]
Li, Yanyan [1 ]
Liu, Xinyue [1 ,2 ]
Song, Tingrui [1 ]
Xu, Tao [4 ,5 ]
He, Shunmin [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Biophys, Ctr Big Data Res Hlth, Key Lab RNA Biol, Beijing 100101, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Coll Life Sci, Beijing 100049, Peoples R China
[4] Chinese Acad Sci, Inst Biophys, CAS Ctr Excellence Biomacromolecules, Natl Lab Biomacromolecules, Beijing 100101, Peoples R China
[5] Shandong First Med Univ & Shandong Acad Med Sci, Jinan 250117, Shandong, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划; 中国博士后科学基金;
关键词
TANDEM REPEATS; GENE-EXPRESSION; FRAGILE-X; MICROSATELLITE REPEAT; STRUCTURAL VARIATION; POPULATION; MUTATIONS; DNA; DISCOVERY; EVOLUTION;
D O I
10.1038/s41467-023-37690-8
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Short tandem repeats (STRs) are abundant and highly mutagenic in the human genome. Many STR loci have been associated with a range of human genetic disorders. However, most population-scale studies on STR variation in humans have focused on European ancestry cohorts or are limited by sequencing depth. Here, we depicted a comprehensive map of 366,013 polymorphic STRs (pSTRs) constructed from 6487 deeply sequenced genomes, comprising 3983 Chinese samples (similar to 31.5x, NyuWa) and 2504 samples from the 1000 Genomes Project (similar to 33.3x, 1KGP). We found that STR mutations were affected by motif length, chromosome context and epigenetic features. We identified 3273 and 1117 pSTRs whose repeat numbers were associated with gene expression and 3 ' UTR alternative polyadenylation, respectively. We also implemented population analysis, investigated population differentiated signatures, and genotyped 60 known disease-causing STRs. Overall, this study further extends the scale of STR variation in humans and propels our understanding of the semantics of STRs.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Genome-Wide Structural Variation Detection by Genome Mapping on Nanochannel Arrays
    Mak, Angel C. Y.
    Lai, Yvonne Y. Y.
    Lam, Ernest T.
    Kwok, Tsz-Piu
    Leung, Alden K. Y.
    Poon, Annie
    Mostovoy, Yulia
    Hastie, Alex R.
    Stedman, William
    Anantharaman, Thomas
    Andrews, Warren
    Zhou, Xiang
    Pang, Andy W. C.
    Dai, Heng
    Chu, Catherine
    Lin, Chin
    Wu, Jacob J. K.
    Li, Catherine M. L.
    Li, Jing-Woei
    Yim, Aldrin K. Y.
    Chan, Saki
    Sibert, Justin
    Dzakula, Zeljko
    Cao, Han
    Yiu, Siu-Ming
    Chan, Ting-Fung
    Yip, Kevin Y.
    Xiao, Ming
    Kwok, Pui-Yan
    GENETICS, 2016, 202 (01) : 351 - +
  • [32] The crucial role of genome-wide genetic variation in conservation
    Kardos, Marty
    Armstrong, Ellie E.
    Fitzpatrick, Sarah W.
    Hauser, Samantha
    Hedrick, Philip W.
    Miller, Joshua M.
    Tallmon, David A.
    Funk, W. Chris
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2021, 118 (48)
  • [33] Genome-Wide Computational Analysis of Dioxin Response Element Location and Distribution in the Human, Mouse, and Rat Genomes
    Dere, Edward
    Forgacs, Agnes L.
    Zacharewski, Timothy R.
    Burgoon, Lyle D.
    CHEMICAL RESEARCH IN TOXICOLOGY, 2011, 24 (04) : 494 - 504
  • [34] Genome-wide mapping and characterization of microsatellites in the swamp eel genome
    Li, Zhigang
    Chen, Feng
    Huang, Chunhua
    Zheng, Weixin
    Yu, Chunlai
    Cheng, Hanhua
    Zhou, Rongjia
    SCIENTIFIC REPORTS, 2017, 7
  • [35] A genome-wide association study identifies multiple loci for variation in human ear morphology
    Adhikari, Kaustubh
    Reales, Guillermo
    Smith, Andrew J. P.
    Konka, Esra
    Palmen, Jutta
    Quinto-Sanchez, Mirsha
    Acuna-Alonzo, Victor
    Jaramillo, Claudia
    Arias, William
    Fuentes, Macarena
    Pizarro, Maria
    Barquera Lozano, Rodrigo
    Macin Perez, Gaston
    Gomez-Valdes, Jorge
    Villamil-Ramirez, Hugo
    Hunemeier, Tabita
    Ramallo, Virginia
    Silva de Cerqueira, Caio C.
    Hurtado, Malena
    Villegas, Valeria
    Granja, Vanessa
    Gallo, Carla
    Poletti, Giovanni
    Schuler-Faccini, Lavinia
    Salzano, Francisco M.
    Bortolini, Maria-Catira
    Canizales-Quinteros, Samuel
    Rothhammer, Francisco
    Bedoya, Gabriel
    Calderon, Rosario
    Rosique, Javier
    Cheeseman, Michael
    Bhutta, Mahmood F.
    Humphries, Steve E.
    Gonzalez-Jose, Rolando
    Headon, Denis
    Balding, David
    Ruiz-Linares, Andres
    NATURE COMMUNICATIONS, 2015, 6
  • [36] Genome-wide Identification and Characterization of Fixed Human-Specific Regulatory Regions
    Marnetto, Davide
    Molineris, Ivan
    Grassi, Elena
    Provero, Paolo
    AMERICAN JOURNAL OF HUMAN GENETICS, 2014, 95 (01) : 39 - 48
  • [37] Genome-wide analyses of human perisylvian cerebral cortical patterning
    Abrahams, B. S.
    Tentler, D.
    Perederiy, J. V.
    Oldham, M. C.
    Coppola, G.
    Geschwind, D. H.
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (45) : 17849 - 17854
  • [38] Genome-wide chromatin analysis in mature mouse and human spermatozoa
    Hisano, Mizue
    Erkek, Serap
    Dessus-Babus, Sophie
    Ramos, Liliana
    Stadler, Michael B.
    Peters, Antoine H. F. M.
    NATURE PROTOCOLS, 2013, 8 (12) : 2449 - 2470
  • [39] The landscape of human STR variation
    Willems, Thomas
    Gymrek, Melissa
    Highnam, Gareth
    Mittelman, David
    Erlich, Yaniv
    GENOME RESEARCH, 2014, 24 (11) : 1894 - 1904
  • [40] Genome-wide characterization of human minisatellite VNTRs: population-specific alleles and gene expression differences
    Rasekh, Marzieh Eslami
    Hernandez, Yozen
    Drinan, Samantha D.
    Bass, Juan I. Fuxman
    Benson, Gary
    NUCLEIC ACIDS RESEARCH, 2021, 49 (08) : 4308 - 4324