SEQATOMS: a web tool for identifying missing regions in PDB in sequence context

被引:19
作者
Brandt, Bernd W. [1 ]
Heringa, Jaap [1 ]
Leunissen, Jack A. M. [2 ]
机构
[1] Vrije Univ Amsterdam, Ctr Integrat Bioinformat, IBIVU, NL-1081 HV Amsterdam, Netherlands
[2] Wageningen Univ, Lab Bioinformat, NL-6700 ET Wageningen, Netherlands
关键词
D O I
10.1093/nar/gkn237
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
With over 46 000 proteins, the Protein Data Bank (PDB) is the most important database with structural information of biological macromolecules. PDB files contain sequence and coordinate information. Residues present in the sequence can be absent from the coordinate section, which means their position in space is unknown. Similarity searches are routinely carried out against sequences taken from PDB SEQRES. However, there no distinction is made between residues that have a known or unknown position in the 3D protein structure. We present a FASTA sequence database that is produced by combining the sequence and coordinate information. All residues absent from the PDB coordinate section are masked with lower-case letters, thereby providing a view of these residues in the context of the entire protein sequence, which facilitates inspecting missing regions. We also provide a masked version of the CATH domain database. A user-friendly BLAST interface is available for similarity searching. In contrast to standard (stand-alone) BLAST output, which only contains upper-case letters, our output retains the lower-case letters of the masked regions. Thus, our server can be used to perform BLAST searching case-sensitively. Here, we have applied it to the study of missing regions in their sequence context. SEQATOMS is available at http://www.bioinformatics.nl/tools/seqatoms/.
引用
收藏
页码:W255 / W259
页数:5
相关论文
共 13 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data [J].
Berman, Helen ;
Henrick, Kim ;
Nakamura, Haruki ;
Markley, John L. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D301-D303
[3]   Flexible nets - The roles of intrinsic disorder in protein interaction networks [J].
Dunker, AK ;
Cortese, MS ;
Romero, P ;
Iakoucheva, LM ;
Uversky, VN .
FEBS JOURNAL, 2005, 272 (20) :5129-5148
[4]   FoldUnfold: web server for the prediction of disordered regions in protein chain [J].
Galzitskaya, Oxana V. ;
Garbuzynskiy, Sergiy O. ;
Lobanov, Michail Yu. .
BIOINFORMATICS, 2006, 22 (23) :2948-2949
[5]   The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution [J].
Greene, Lesley H. ;
Lewis, Tony E. ;
Addou, Sarah ;
Cuff, Alison ;
Dallman, Tim ;
Dibley, Mark ;
Redfern, Oliver ;
Pearl, Frances ;
Nambudiry, Rekha ;
Reid, Adam ;
Sillitoe, Ian ;
Yeats, Corin ;
Thornton, Janet M. ;
Orengo, Christine A. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D291-D297
[6]   GlobPlot: exploring protein sequences for globularity and disorder [J].
Linding, R ;
Russell, RB ;
Neduva, V ;
Gibson, TJ .
NUCLEIC ACIDS RESEARCH, 2003, 31 (13) :3701-3708
[7]   Protein disorder prediction: Implications for structural proteomics [J].
Linding, R ;
Jensen, LJ ;
Diella, F ;
Bork, P ;
Gibson, TJ ;
Russell, RB .
STRUCTURE, 2003, 11 (11) :1453-1459
[8]   Intrinsic disorder in transcription factors [J].
Liu, Jiangang ;
Perumal, Narayanan B. ;
Oldfield, Christopher J. ;
Su, Eric W. ;
Uversky, Vladimir N. ;
Dunker, A. Keith .
BIOCHEMISTRY, 2006, 45 (22) :6873-6888
[9]   DisProt: the database of disordered proteins [J].
Sickmeier, Megan ;
Hamilton, Justin A. ;
LeGall, Tanguy ;
Vacic, Vladimir ;
Cortese, Marc S. ;
Tantos, Agnes ;
Szabo, Beata ;
Tompa, Peter ;
Chen, Jake ;
Uversky, Vladimir N. ;
Obradovic, Zoran ;
Dunker, A. Keith .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D786-D793
[10]   The bioperl toolkit:: Perl modules for the life sciences [J].
Stajich, JE ;
Block, D ;
Boulez, K ;
Brenner, SE ;
Chervitz, SA ;
Dagdigian, C ;
Fuellen, G ;
Gilbert, JGR ;
Korf, I ;
Lapp, H ;
Lehväslaiho, H ;
Matsalla, C ;
Mungall, CJ ;
Osborne, BI ;
Pocock, MR ;
Schattner, P ;
Senger, M ;
Stein, LD ;
Stupka, E ;
Wilkinson, MD ;
Birney, E .
GENOME RESEARCH, 2002, 12 (10) :1611-1618