CDD: NCBI's conserved domain database

被引:2597
作者
Marchler-Bauer, Aron [1 ]
Derbyshire, Myra K. [1 ]
Gonzales, Noreen R. [1 ]
Lu, Shennan [1 ]
Chitsaz, Farideh [1 ]
Geer, Lewis Y. [1 ]
Geer, Renata C. [1 ]
He, Jane [1 ]
Gwadz, Marc [1 ]
Hurwitz, David I. [1 ]
Lanczycki, Christopher J. [1 ]
Lu, Fu [1 ]
Marchler, Gabriele H. [1 ]
Song, James S. [1 ]
Thanki, Narmada [1 ]
Wang, Zhouxi [1 ]
Yamashita, Roxanne A. [1 ]
Zhang, Dachuan [1 ]
Zheng, Chanjuan [1 ]
Bryant, Stephen H. [1 ]
机构
[1] NIH, Natl Lib Med, Natl Ctr Biotechnol Informat, Bethesda, MD 20894 USA
基金
美国国家卫生研究院;
关键词
D O I
10.1093/nar/gku1221
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
NCBI's CDD, the Conserved Domain Database, enters its 15(th) year as a public resource for the annotation of proteins with the location of conserved domain footprints. Going forward, we strive to improve the coverage and consistency of domain annotation provided by CDD. We maintain a live search system as well as an archive of pre-computed domain annotation for sequences tracked in NCBI's Entrez protein database, which can be retrieved for single sequences or in bulk. We also maintain import procedures so that CDD contains domain models and domain definitions provided by several collections available in the public domain, as well as those produced by an in-house curation effort. The curation effort aims at increasing coverage and providing finer-grained classifications of common protein domains, for which a wealth of functional and structural data has become available. CDD curation generates alignment models of representative sequence fragments, which are in agreement with domain boundaries as observed in protein 3D structure, and which model the structurally conserved cores of domain families as well as annotate conserved features.
引用
收藏
页码:D222 / D226
页数:5
相关论文
共 13 条
  • [1] Annotation of functional sites with the Conserved Domain Database
    Derbyshire, Myra K.
    Lanczycki, Christopher J.
    Bryant, Stephen H.
    Marchler-Bauer, Aron
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2012,
  • [2] Pfam: the protein families database
    Finn, Robert D.
    Bateman, Alex
    Clements, Jody
    Coggill, Penelope
    Eberhardt, Ruth Y.
    Eddy, Sean R.
    Heger, Andreas
    Hetherington, Kirstie
    Holm, Liisa
    Mistry, Jaina
    Sonnhammer, Erik L. L.
    Tate, John
    Punta, Marco
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D222 - D230
  • [3] CDART: Protein homology by domain architecture
    Geer, LY
    Domrachev, M
    Lipman, DJ
    Bryant, SH
    [J]. GENOME RESEARCH, 2002, 12 (10) : 1619 - 1623
  • [4] Novel β-Propeller of the BTB-Kelch Protein Krp1 Provides a Binding Site for Lasp-1 That Is Necessary for Pseudopodial Extension
    Gray, Christopher H.
    McGarry, Lynn C.
    Spence, Heather J.
    Riboldi-Tunnicliffe, Alan
    Ozanne, Bradford W.
    [J]. JOURNAL OF BIOLOGICAL CHEMISTRY, 2009, 284 (44) : 30498 - 30507
  • [5] TIGRFAMs and Genome Properties in 2013
    Haft, Daniel H.
    Selengut, Jeremy D.
    Richter, Roland A.
    Harkins, Derek
    Basu, Malay K.
    Beck, Erin
    [J]. NUCLEIC ACIDS RESEARCH, 2013, 41 (D1) : D387 - D395
  • [6] The National Center for Biotechnology Information's Protein Clusters Database
    Klimke, William
    Agarwala, Richa
    Badretdin, Azat
    Chetvernin, Slava
    Ciufo, Stacy
    Fedorov, Boris
    Kiryutin, Boris
    O'Neill, Kathleen
    Resch, Wolfgang
    Resenchuk, Sergei
    Schafer, Susan
    Tolstoy, Igor
    Tatusova, Tatiana
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D216 - D223
  • [7] SMART: recent updates, new developments and status in 2015
    Letunic, Ivica
    Doerks, Tobias
    Bork, Peer
    [J]. NUCLEIC ACIDS RESEARCH, 2015, 43 (D1) : D257 - D260
  • [8] MMDB and VAST+: tracking structural similarities between macromolecular complexes
    Madej, Thomas
    Lanczycki, Christopher J.
    Zhang, Dachuan
    Thiessen, Paul A.
    Geer, Renata C.
    Marchler-Bauer, Aron
    Bryant, Stephen H.
    [J]. NUCLEIC ACIDS RESEARCH, 2014, 42 (D1) : D297 - D303
  • [9] CDD: a curated Entrez database of conserved domain alignments
    Marchler-Bauer, A
    Anderson, JB
    DeWeese-Scott, C
    Fedorova, ND
    Geer, LY
    He, SQ
    Hurwitz, DI
    Jackson, JD
    Jacobs, AR
    Lanczycki, CJ
    Liebert, CA
    Liu, CL
    Madej, T
    Marchler, GH
    Mazumder, R
    Nikolskaya, AN
    Panchenko, AR
    Rao, BS
    Shoemaker, BA
    Simonyan, V
    Song, JS
    Thiessen, PA
    Vasudevan, S
    Wang, YL
    Yamashita, RA
    Yin, JJ
    Bryant, SH
    [J]. NUCLEIC ACIDS RESEARCH, 2003, 31 (01) : 383 - 387
  • [10] CD-Search: protein domain annotations on the fly
    Marchler-Bauer, A
    Bryant, SH
    [J]. NUCLEIC ACIDS RESEARCH, 2004, 32 : W327 - W331