The effect of relational background knowledge on learning of protein three-dimensional fold signatures

被引:13
|
作者
Turcotte, M
Muggleton, SH
Sternberg, MJE
机构
[1] Imperial Canc Res Fund, Biomolec Modelling Lab, London WC2A 3PX, England
[2] Univ York, Dept Comp Sci, York YO1 5DD, N Yorkshire, England
基金
英国工程与自然科学研究理事会; 英国生物技术与生命科学研究理事会;
关键词
inductive logic programming; scientific discovery; protein fold;
D O I
10.1023/A:1007672817406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As a form of Machine Learning the study of Inductive Logic Programming (ILP) is motivated by a central belief: relational description languages are better tin terms of accuracy and understandability) than propositional ones for certain real-world applications. This claim is investigated here for a particular application in structural molecular biology, that of constructing readable descriptions of the major protein folds. To the authors' knowledge Machine Learning has not previously been applied systematically to this task. In this application, the domain expert (third author) identified a natural divide between essentially propositional features and more structurally-oriented relational ones. The following null hypotheses are tested: 1) for a given ILP system (Progol) provision of relational background knowledge does not increase predictive accuracy, 2) a good propositional learning system (C5.0) without relational background knowledge will outperform Progol with relational background knowledge, 3) relational background knowledge does not produce improved explanatory insight. Null hypotheses 1) and 2) are both refuted on cross-validation results carried out over 20 of the most populated protein folds. Hypothesis 3 is refuted by demonstration of various insightful rules discovered only in the relationally-oriented learned rules.
引用
收藏
页码:81 / 95
页数:15
相关论文
共 50 条
  • [1] The Effect of Relational Background Knowledge on Learning of Protein Three-Dimensional Fold Signatures
    Marcel Turcotte
    Stephen H. Muggleton
    Michael J.E. Sternberg
    Machine Learning, 2001, 43 : 81 - 95
  • [2] Generating protein three-dimensional fold signatures using inductive logic programming
    Turcotte, M
    Muggleton, SH
    Sternberg, MJE
    COMPUTERS & CHEMISTRY, 2001, 26 (01): : 57 - 64
  • [3] Research on Virtual Three-Dimensional Clothing Styles and Fold Effect Show
    Li, Jingjing
    PROCEEDINGS OF THE 2016 2ND WORKSHOP ON ADVANCED RESEARCH AND TECHNOLOGY IN INDUSTRY APPLICATIONS, 2016, 81 : 261 - 264
  • [4] Determining the three-dimensional fold of a protein from approximate constraintsA simulation study
    Kizhake V. Soman
    Werner Braun
    Cell Biochemistry and Biophysics, 2001, 34 : 283 - 304
  • [5] Determining the three-dimensional fold of a protein from approximate constraints - A simulation study
    Soman, KV
    Braun, W
    CELL BIOCHEMISTRY AND BIOPHYSICS, 2001, 34 (03) : 283 - 304
  • [6] Three-dimensional invariants and classification of Ladar signatures
    Sadjadi, F
    AUTOMATIC TARGET RECOGNITION XIV, 2004, 5426 : 124 - 132
  • [7] Three-dimensional morphological signatures for nuclear forensics
    Chung, Brandon
    Roberts, David
    Talbot, William
    Ashley, Denise
    Teslich, Nick
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2017, 254
  • [8] Photoactive yellow protein: A structural prototype for the three-dimensional fold of the PAS domain superfamily
    Pellequer, JL
    Wager-Smith, KA
    Kay, SA
    Getzoff, ED
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (11) : 5884 - 5890
  • [9] Three-dimensional structure of iminodisuccinate epimerase defines the fold of the MmgE/PrpD protein family
    Lohkamp, Bernhard
    Baeuerle, Bettina
    Rieger, Paul-Gerhard
    Schneider, Gunter
    JOURNAL OF MOLECULAR BIOLOGY, 2006, 362 (03) : 555 - 566
  • [10] A three-dimensional model of vocal fold abduction/adduction
    Hunter, EJ
    Titze, IR
    Alipour, F
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2004, 115 (04): : 1747 - 1759