Mining combinatorial data in protein sequences and structures

被引:6
|
作者
Jacchieri, SG [1 ]
机构
[1] Fdn Antonio Prudente, Ctr Pesquisas, BR-01509090 Sao Paulo, SP, Brazil
关键词
combinatorial search; high structural propensity; peptide fragment;
D O I
10.1023/A:1016286720984
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Combinatorial searches of structural and physical chemical properties involving the components of libraries of dipeptide, tripeptide and tetra peptide fragments were carried out in the Protein Data Bank and the SwissProt databases. The properties investigated are structural propensities, co localization of peptide fragments in protein sequences, interactions between peptide fragments in close structural proximity and the participation of physical chemical profiles in the distribution of structural motifs among peptide fragments. The results obtained for each combinatorial search in the study are classified according to the structural motifs alpha-helix, beta-sheet, reverse turn I and reverse turn II. The application of combinatorial data mined in protein databases to the design of new peptide libraries is discussed. The present findings have implications for the study of protein structure which are also discussed.
引用
收藏
页码:145 / 152
页数:8
相关论文
共 50 条
  • [21] Special issue on combinatorial optimization in data mining
    Chaovalitwongse, Wanpracha Art
    Seref, Onur
    JOURNAL OF COMBINATORIAL OPTIMIZATION, 2008, 15 (03) : 223 - 224
  • [22] A framework for data mining on combinatorial game theory
    Hooks, David
    Ding, Qin
    JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2009, 9 (01) : S91 - S98
  • [23] Applying Combinatorial Testing to Data Mining Algorithms
    Chandrasekaran, Jaganmohan
    Feng, Huadong
    Lei, Yu
    Kuhn, D. Richard
    Kacker, Raghu
    10TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE TESTING, VERIFICATION AND VALIDATION WORKSHOPS - ICSTW 2017, 2017, : 253 - 261
  • [24] Special Issue on Combinatorial Optimization in Data Mining
    Wanpracha Art Chaovalitwongse
    Onur Seref
    Journal of Combinatorial Optimization, 2008, 15 : 223 - 224
  • [25] Data mining patented antibody sequences
    Krawczyk, Konrad
    Buchanan, Andrew
    Marcatili, Paolo
    MABS, 2021, 13 (01)
  • [26] Data mining for motifs in DNA sequences
    Bell, DA
    Guan, JW
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, 2003, 2639 : 507 - 514
  • [27] Genome sequences and protein structures
    Teichmann, Sarah
    Park, Jong
    Chothia, Cyrus
    Proceedings of the Annual International Conference on Computational Molecular Biology, RECOMB, 1999,
  • [28] Evolution of protein sequences and structures
    Wood, TC
    Pearson, WR
    JOURNAL OF MOLECULAR BIOLOGY, 1999, 291 (04) : 977 - 995
  • [29] Optimization of classifiers for data mining based on combinatorial semigroups
    Kelarev, A. V.
    Yearwood, J. L.
    Watters, P. A.
    SEMIGROUP FORUM, 2011, 82 (02) : 242 - 251