WebSTR: A Population-wide Database of Short Tandem Repeat Variation in Humans

被引:12
作者
Lundstrom, Oxana [1 ,2 ,3 ]
Verbiest, Max Adriaan [3 ,4 ,5 ]
Xia, Feifei [3 ,4 ,5 ]
Jam, Helyaneh Ziaei [7 ]
Zlobec, Inti [6 ]
Anisimova, Maria [3 ,4 ]
Gymrek, Melissa [7 ,8 ]
机构
[1] Stockholm Univ, Dept Biochem & Biophys, Stockholm, Sweden
[2] Vildly AB, Kalmar, Sweden
[3] Zurich Univ Appl Sci ZHAW, Inst Computat Life Sci, Sch Life Sci & Facil Management, Wadenswil, Switzerland
[4] Swiss Inst Bioinformat SIB, Lausanne, Switzerland
[5] Univ Zurich, Dept Mol Life Sci, Zurich, Switzerland
[6] Univ Bern, Inst Tissue Med & Pathol, Bern, Switzerland
[7] Univ Calif San Diego, Dept Comp Sci & Engn, La Jolla, CA 92093 USA
[8] Univ Calif San Diego, Dept Med, La Jolla, CA USA
基金
瑞士国家科学基金会;
关键词
human genetic variation; short tandem repeats; database; API; web portal; GENE-EXPRESSION;
D O I
10.1016/j.jmb.2023.168260
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Short tandem repeats (STRs) are consecutive repetitions of one to six nucleotide motifs. They are hyper -variable due to the high prevalence of repeat unit insertions or deletions primarily caused by polymerase slippage during replication. Genetic variation at STRs has been shown to influence a range of traits in humans, including gene expression, cancer risk, and autism. Until recently STRs have been poorly studied since they pose significant challenges to bioinformatics analyses. Moreover, genome-wide analysis of STR variation in population-scale cohorts requires large amounts of data and computational resources. However, the recent advent of genome-wide analysis tools has resulted in multiple large genome-wide datasets of STR variation spanning nearly two million genomic loci in thousands of individuals from diverse populations. Here we present WebSTR, a database of genetic variation and other characteris-tics of genome-wide STRs across human populations. WebSTR is based on reference panels of more than 1.7 million human STRs created with state of the art repeat annotation methods and can easily be extended to include additional cohorts or species. It currently contains data based on STR genotypes for individuals from the 1000 Genomes Project, H3Africa, the Genotype-Tissue Expression (GTEx) Project and colorectal cancer patients from the TCGA dataset. WebSTR is implemented as a relational database with programmatic access available through an API and a web portal for browsing data. The web portal is publicly available at https://webstr.ucsd.edu.(c) 2023 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY license (http://creativecom-mons.org/licenses/by/4.0/).
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Genetic variation for five short tandem repeat loci in a Central China population sample
    Ying, B. W.
    Fan, H.
    Liu, T. T.
    Zhao, Z. H.
    Liang, Z. H.
    Feng, S.
    Yuan, W. A.
    Yun, L. B.
    JOURNAL OF FORENSIC SCIENCES, 2006, 51 (05) : 1201 - 1201
  • [2] Genome-wide evaluation of the effect of short tandem repeat variation on local DNA methylation
    Martin-Trujillo, Alejandro
    Garg, Paras
    Patel, Nihir
    Jadhav, Bharati
    Sharp, Andrew J.
    GENOME RESEARCH, 2023, 33 (02) : 184 - 196
  • [3] The overdue promise of short tandem repeat variation for heritability
    Press, Maximilian O.
    Carlson, Keisha D.
    Queitsch, Christine
    TRENDS IN GENETICS, 2014, 30 (11) : 504 - 512
  • [4] Genetic variation for 15 short tandem repeat loci in an El Salvadoran (Central America) population
    Monterrosa, JC
    Morales, JA
    García, O
    JOURNAL OF FORENSIC SCIENCES, 2006, 51 (02) : 451 - 452
  • [5] Population data on eight short tandem repeat loci in the Barbadian population
    Henry, RAC
    Alleyne, LE
    Budowle, B
    JOURNAL OF FORENSIC SCIENCES, 2006, 51 (02) : 440 - 441
  • [6] Population data of 8 short tandem repeat loci in the Thai population
    Sueblinvong, T
    Kongsrisook, U
    FORENSIC SCIENCE INTERNATIONAL, 1999, 103 (03) : 199 - 205
  • [7] Polymorphism of three short tandem repeat loci in Chinese population
    Ran, P
    Zhang, BL
    Zhou, B
    Bai, P
    Zhou, XP
    Chen, K
    Li, YB
    Hou, YP
    Wu, J
    JOURNAL OF FORENSIC SCIENCES, 2006, 51 (02) : 450 - 450
  • [8] Analysis of short tandem repeat polymorphisms in the southwest Chinese population
    Zhang, Weijuan
    Xu, Jiejie
    JOURNAL OF FORENSIC SCIENCES, 2006, 51 (06) : 1421 - 1421
  • [9] Y chromosomal short tandem repeat haplotypes in the Japanese population
    Tamura, Akiyoshi
    Iwata, Misa
    Takase, Izumi
    Miyazaki, Tokiko
    Fukunishi, Shinya
    Nishio, Hajime
    Suzuki, Koichi
    JOURNAL OF FORENSIC SCIENCES, 2006, 51 (06) : 1431 - 1433
  • [10] Genetic variation and population structure of Botswana populations as identified with AmpFLSTR Identifiler short tandem repeat (STR) loci
    Tau, Tiroyamodimo
    Wally, Anthony
    Fanie, Thokozile Patricia
    Ngono, Goitseone Lorato
    Mpoloka, Sununguko Wata
    Davison, Sean
    D'Amato, Maria Eugenia
    SCIENTIFIC REPORTS, 2017, 7