Observed Antibody Space: A diverse database of cleaned, annotated, and translated unpaired and paired antibody sequences

被引:111
作者
Olsen, Tobias H. [1 ]
Boyles, Fergus [1 ]
Deane, Charlotte M. [1 ]
机构
[1] Univ Oxford, Dept Stat, Oxford OX1 3LB, England
基金
英国工程与自然科学研究理事会;
关键词
annotated antibody sequences; antibody database; antibody repertoire; antibody sequence; BCR-seq; Observed Antibody Space (OAS); RECEPTOR REPERTOIRE;
D O I
10.1002/pro.4205
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The antibody repertoires of individuals and groups have been used to explore disease states, understand vaccine responses, and drive therapeutic development. The arrival of B-cell receptor repertoire sequencing has enabled researchers to get a snapshot of these antibody repertoires, and as more data are generated, increasingly in-depth studies are possible. However, most publicly available data only exist as raw FASTQ files, making the data hard to access, process, and compare. The Observed Antibody Space (OAS) database was created in 2018 to offer clean, annotated, and translated repertoire data. In this paper, we describe an update to OAS that has been driven by the increasing volume of data and the appearance of paired (VH/VL) sequence data. OAS is now accessible via a new web server, with standardized search parameters and a new sequence-based search option. The new database provides both nucleotides and amino acids for every sequence, with additional sequence annotations to make the data Minimal Information about Adaptive Immune Receptor Repertoire compliant, and comments on potential problems with the sequence. OAS now contains 25 new studies, including severe acute respiratory syndrome coronavirus 2 data and paired sequencing data. The new database is accessible at , and all data are freely available for download.
引用
收藏
页码:141 / 146
页数:6
相关论文
共 33 条
  • [1] Adaptive Biotechnologies, IMMUNEACCESS
  • [2] Agarwala R, 2018, NUCLEIC ACIDS RES, V46, pD8, DOI [10.1093/nar/gks1189, 10.1093/nar/gkx1095, 10.1093/nar/gkq1172]
  • [3] Commonality despite exceptional diversity in the baseline human antibody repertoire
    Briney, Bryan
    Inderbitzin, Anne
    Joyce, Collin
    Burton, Dennis R.
    [J]. NATURE, 2019, 566 (7744) : 393 - +
  • [4] Analyzing Immunoglobulin Repertoires
    Chaudhary, Neha
    Wesemann, Duane R.
    [J]. FRONTIERS IN IMMUNOLOGY, 2018, 9
  • [5] Antibody Structure and Function: The Basis for Engineering Therapeutics
    Chiu, Mark L.
    Goulet, Dennis R.
    Teplyakov, Alexey
    Gilliland, Gary L.
    [J]. ANTIBODIES, 2019, 8 (04)
  • [6] The ADC API: A Web API for the Programmatic Query of the AIRR Data Commons
    Christley, Scott
    Aguiar, Ademar
    Blanck, George
    Breden, Felix
    Bukhari, Syed Ahmad Chan
    Busse, Christian E.
    Jaglale, Jerome
    Harikrishnan, Srilakshmy L.
    Laserson, Uri
    Peters, Bjoern
    Rocha, Artur
    Schramm, Chaim A.
    Taylor, Sarah
    Vander Heiden, Jason Anthony
    Zimonja, Bojan
    Watson, Corey T.
    Corrie, Brian
    Cowell, Lindsay G.
    [J]. FRONTIERS IN BIG DATA, 2020, 3
  • [7] Coordinators NCBI Resource, 2021, SRA TOOLK
  • [8] Simple paired heavy- and light-chain antibody repertoire sequencing using endoplasmic reticulum microsomes
    Devulapally, Praneeth Reddy
    Buerger, Joerg
    Mielke, Thorsten
    Konthur, Zoltan
    Lehrach, Hans
    Yaspo, Marie-Laure
    Gloekler, Joern
    Warnatz, Hans-Joerg
    [J]. GENOME MEDICINE, 2018, 10
  • [9] ANARCI: antigen receptor numbering and receptor classification
    Dunbar, James
    Deane, Charlotte M.
    [J]. BIOINFORMATICS, 2016, 32 (02) : 298 - 300
  • [10] The promise and challenge of high-throughput sequencing of the antibody repertoire
    Georgiou, George
    Ippolito, Gregory C.
    Beausang, John
    Busse, Christian E.
    Wardemann, Hedda
    Quake, Stephen R.
    [J]. NATURE BIOTECHNOLOGY, 2014, 32 (02) : 158 - 168