WebSTR: A Population-wide Database of Short Tandem Repeat Variation in Humans

被引:12
作者
Lundstrom, Oxana [1 ,2 ,3 ]
Verbiest, Max Adriaan [3 ,4 ,5 ]
Xia, Feifei [3 ,4 ,5 ]
Jam, Helyaneh Ziaei [7 ]
Zlobec, Inti [6 ]
Anisimova, Maria [3 ,4 ]
Gymrek, Melissa [7 ,8 ]
机构
[1] Stockholm Univ, Dept Biochem & Biophys, Stockholm, Sweden
[2] Vildly AB, Kalmar, Sweden
[3] Zurich Univ Appl Sci ZHAW, Inst Computat Life Sci, Sch Life Sci & Facil Management, Wadenswil, Switzerland
[4] Swiss Inst Bioinformat SIB, Lausanne, Switzerland
[5] Univ Zurich, Dept Mol Life Sci, Zurich, Switzerland
[6] Univ Bern, Inst Tissue Med & Pathol, Bern, Switzerland
[7] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[8] Univ Calif San Diego, Dept Med, La Jolla, CA USA
基金
瑞士国家科学基金会;
关键词
human genetic variation; short tandem repeats; database; API; web portal; GENE-EXPRESSION;
D O I
10.1016/j.jmb.2023.168260
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hyper -variable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic variation at STRs has been shown to influence a range of traits in humans, including gene expression, cancer risk, and autism. Until recently STRs have been poorly studied since they pose significant challenges to bioinformatics analyses. Moreover, genome-wide analysis of STR variation in population-scale cohorts requires large amounts of data and computational resources. However, the recent advent of genome-wide analysis tools has resulted in multiple large genome-wide datasets of STR variation spanning nearly two million genomic loci in thousands of individuals from diverse populations. Here we present WebSTR, a database of genetic variation and other characteris-tics of genome-wide STRs across human populations. WebSTR is based on reference panels of more than 1.7 million human STRs created with state of the art repeat annotation methods and can easily be extended to include additional cohorts or species. It currently contains data based on STR genotypes for individuals from the 1000 Genomes Project, H3Africa, the Genotype-Tissue Expression (GTEx) Project and colorectal cancer patients from the TCGA dataset. WebSTR is implemented as a relational database with programmatic access available through an API and a web portal for browsing data. The web portal is publicly available at https://webstr.ucsd.edu.(c) 2023 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecom-mons.org/licenses/by/4.0/).
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Mutation analysis of 19 commonly used short tandem repeat loci in a Guangdong Han population
    Xiao, Cheng
    Peng, Zhiyong
    Chen, Feilong
    Yan, Hui
    Zhu, Bofeng
    Tai, Yunchun
    Qiu, Pingming
    Liu, Chao
    Song, Xuheng
    Wu, Zihao
    Chen, Ling
    LEGAL MEDICINE, 2018, 32 : 92 - 97
  • [42] VARIATION IN SHORT TANDEM REPEAT SEQUENCES - A SURVEY OF 12 MICROSATELLITE LOCI FOR USE AS FORENSIC IDENTIFICATION MARKERS
    URQUHART, A
    KIMPTON, CP
    DOWNES, TJ
    GILL, P
    INTERNATIONAL JOURNAL OF LEGAL MEDICINE, 1994, 107 (01) : 13 - 20
  • [43] Genetic variation at nine short tandem repeat loci among islanders of the eastern Adriatic coast of Croatia
    Klaric, IM
    Pericic, M
    Lauc, LB
    Janicijevic, B
    Kubat, M
    Pavicic, D
    Rudan, I
    Wang, N
    Jin, L
    Chakraborty, R
    Deka, R
    Rudan, P
    HUMAN BIOLOGY, 2005, 77 (04) : 471 - 486
  • [44] LUSTR: a new customizable tool for calling genome-wide germline and somatic short tandem repeat variants
    Lu, Jinfeng
    Toro, Camilo
    Adams, David R.
    Acosta, Maria T.
    Adam, Margaret
    Alvarez, Raquel L.
    Alvey, Justin
    Amendola, Laura
    Andrews, Ashley
    Ashley, Euan A.
    Bacino, Carlos A.
    Bademci, Guney
    Balasubramanyam, Ashok
    Baldridge, Dustin
    Bale, Jim
    Bamshad, Michael
    Barbouth, Deborah
    Bayrak-Toydemir, Pinar
    Beck, Anita
    Beggs, Alan H.
    Behrens, Edward
    Bejerano, Gill
    Bellen, Hugo J.
    Bennett, Jimmy
    Berg-Rood, Beverly
    Bernstein, Jonathan A.
    Berry, Gerard T.
    Bican, Anna
    Bivona, Stephanie
    Blue, Elizabeth
    Bohnsack, John
    Bonner, Devon
    Botto, Lorenzo
    Boyd, Brenna
    Briere, Lauren C.
    Brown, Gabrielle
    Burke, Elizabeth A.
    Burrage, Lindsay C.
    Butte, Manish J.
    Byers, Peter
    Byrd, William E.
    Carey, John
    Carrasquillo, Olveen
    Cassini, Thomas
    Chang, Ta Chen Peter
    Chanprasert, Sirisak
    Chao, Hsiao-Tuan
    Chinn, Ivan
    Clark, Gary D.
    Coakley, Terra R.
    BMC GENOMICS, 2024, 25 (01)
  • [45] LUSTR: a new customizable tool for calling genome-wide germline and somatic short tandem repeat variants
    Jinfeng Lu
    Camilo Toro
    David R. Adams
    Cristiane Araujo Martins Moreno
    Wan-Ping Lee
    Yuk Yee Leung
    Mathew B. Harms
    Badri Vardarajan
    Erin L. Heinzen
    BMC Genomics, 25
  • [46] Seventeen Y-chromosomal short tandem repeat haplotypes in seven groups of population living in Taiwan
    Hsiao-Lin Hwa
    Li-Hui Tseng
    Tsang-Ming Ko
    Yih-Yuan Chang
    Hsiang-Yi Yin
    Yi-Ning Su
    James Chun-I Lee
    International Journal of Legal Medicine, 2010, 124 : 295 - 300
  • [47] Population genetic study of 10 short tandem repeat loci from 600 domestic dogs in Korea
    Moon, Seo Hyun
    Jang, Yoon-Jeong
    Han, Myun Soo
    Cho, Myung-Haing
    JOURNAL OF VETERINARY SCIENCE, 2016, 17 (03) : 391 - 398
  • [48] Seventeen Y-chromosomal short tandem repeat haplotypes in seven groups of population living in Taiwan
    Hwa, Hsiao-Lin
    Tseng, Li-Hui
    Ko, Tsang-Ming
    Chang, Yih-Yuan
    Yin, Hsiang-Yi
    Su, Yi-Ning
    Lee, James Chun-I
    INTERNATIONAL JOURNAL OF LEGAL MEDICINE, 2010, 124 (04) : 295 - 300
  • [49] A new allele of the short tandem repeat locus D21S11 in a Venezuelan population
    Lander, N
    Tovar, F
    Chiurillo, MA
    Ramirez, JL
    JOURNAL OF FORENSIC SCIENCES, 2006, 51 (03) : 695 - 695
  • [50] Structure and polymorphism of novel X-chromosome short tandem repeat loci in a Chinese Han population
    Zhu, Y. S.
    Wu, H.
    Lai, J. H.
    GENETICS AND MOLECULAR RESEARCH, 2015, 14 (04) : 15044 - 15049