Mining combinatorial data in protein sequences and structures

被引:6
|
作者
Jacchieri, SG [1 ]
机构
[1] Fdn Antonio Prudente, Ctr Pesquisas, BR-01509090 Sao Paulo, SP, Brazil
关键词
combinatorial search; high structural propensity; peptide fragment;
D O I
10.1023/A:1016286720984
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Combinatorial searches of structural and physical chemical properties involving the components of libraries of dipeptide, tripeptide and tetra peptide fragments were carried out in the Protein Data Bank and the SwissProt databases. The properties investigated are structural propensities, co localization of peptide fragments in protein sequences, interactions between peptide fragments in close structural proximity and the participation of physical chemical profiles in the distribution of structural motifs among peptide fragments. The results obtained for each combinatorial search in the study are classified according to the structural motifs alpha-helix, beta-sheet, reverse turn I and reverse turn II. The application of combinatorial data mined in protein databases to the design of new peptide libraries is discussed. The present findings have implications for the study of protein structure which are also discussed.
引用
收藏
页码:145 / 152
页数:8
相关论文
共 50 条
  • [41] Amyloidogenic sequences in native protein structures
    Tzotzos, Susan
    Doig, Andrew J.
    PROTEIN SCIENCE, 2010, 19 (02) : 327 - 348
  • [42] Translating gene sequences to protein structures
    New Media Cent of Loyola Coll, Baltimore, United States
    Sci Comput Autom, 7 (53-54):
  • [43] Compressed Data Structures for Dynamic Sequences
    Munro, J. Ian
    Nekrich, Yakov
    ALGORITHMS - ESA 2015, 2015, 9294 : 891 - 902
  • [44] Deciphering membrane protein structures from protein sequences
    Tilman Flock
    AJ Venkatakrishnan
    KR Vinothkumar
    M Madan Babu
    Genome Biology, 13
  • [45] Deciphering membrane protein structures from protein sequences
    Flock, Tilman
    Venkatakrishnan, A. J.
    Vinothkumar, K. R.
    Babu, M. Madan
    GENOME BIOLOGY, 2012, 13 (06)
  • [46] Data mining in protein interactomics
    Chen, JY
    Sivachenko, AY
    IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE, 2005, 24 (03): : 95 - 102
  • [47] Identifying Factors Controlling Protein Release from Combinatorial Biomaterial Libraries via Hybrid Data Mining Methods
    Li, Xue
    Petersen, Latrisha
    Broderick, Scott
    Narasimhan, Balaji
    Rajan, Krishna
    ACS COMBINATORIAL SCIENCE, 2011, 13 (01) : 50 - 58
  • [48] Mining substructures in protein data
    Hadzic, Fedja
    Dillon, Tharam S.
    Sidhu, Amandeep S.
    Chang, Elizabeth
    Tan, Henry
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 213 - 217
  • [49] Data Mining the Protein Data Bank to Identify and Characterise Chameleon Coil Sequences that Form Symmetric Homodimer β-Sheet Interfaces
    Laibe, Johanna
    Broutin, Melanie
    Caffrey, Aaron
    Pierscionek, Barbara
    Nebel, Jean-Christophe
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II, 2017, 10209 : 118 - 126
  • [50] GRAPHLET DATA MINING OF ENERGETICAL INTERACTION PATTERNS IN PROTEIN 3D STRUCTURES
    Henneges, Carsten
    Roettig, Marc
    Kohlbacher, Oliver
    Zell, Andreas
    ICFC 2010/ ICNC 2010: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON FUZZY COMPUTATION AND INTERNATIONAL CONFERENCE ON NEURAL COMPUTATION, 2010, : 190 - 195