Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

被引:0
|
作者
Liang-Tsung Huang
M. Michael Gromiha
Shinn-Ying Ho
机构
[1] Feng-Chia University,Institute of Information Engineering and Computer Science
[2] Ming-Dao University,Department of Computer Science and Information Engineering
[3] National Institute of Advanced Industrial Science and Technology (AIST),Computational Biology Research Center (CBRC)
[4] National Chiao Tung University,Department of Biological Science and Technology, and Institute of Bioinformatics
来源
Journal of Molecular Modeling | 2007年 / 13卷
关键词
Bioinformatics; Data mining; Decision trees; Prediction; Protein stability;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding the mechanism of the protein stability change is one of the most challenging tasks. Recently, the prediction of protein stability change affected by single point mutations has become an interesting topic in molecular biology. However, it is desirable to further acquire knowledge from large databases to provide new insights into the nature of them. This paper presents an interpretable prediction tree method (named iPTREE-2) that can accurately predict changes of protein stability upon mutations from sequence based information and analyze sequence characteristics from the viewpoint of composition and order. Therefore, iPTREE-2 based on a regression tree algorithm exhibits the ability of finding important factors and developing rules for the purpose of data mining. On a dataset of 1859 different single point mutations from thermodynamic database, ProTherm, iPTREE-2 yields a correlation coefficient of 0.70 between predicted and experimental values. In the task of data mining, detailed analysis of sequences reveals the possibility of the compositional specificity of residues in different ranges of stability change and implies the existence of certain patterns. As building rules, we found that the mutation residues in wild type and in mutant protein play an important role. The present study demonstrates that iPTREE-2 can serve the purpose of predicting protein stability change, especially when one requires more understandable knowledge.
引用
收藏
页码:879 / 890
页数:11
相关论文
共 36 条
  • [1] Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model
    Huang, Liang-Tsung
    Gromiha, M. Michael
    Ho, Shinn-Ying
    JOURNAL OF MOLECULAR MODELING, 2007, 13 (08) : 879 - 890
  • [2] Predicting Protein Stability Change upon Double Mutation from Partial Sequence Information Using Data Mining Approach
    Lai, Lien-Fu
    Wu, Chao-Chin
    Huang, Liang-Tsung
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, 2010, 6215 : 664 - +
  • [3] High throughput computing to improve efficiency of predicting protein stability change upon mutation
    Wu, Chao-Chin
    Lai, Lien-Fu
    Gromiha, M. Michael
    Huang, Liang-Tsung
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 10 (02) : 206 - 224
  • [4] Three Simple Properties Explain Protein Stability Change upon Mutation
    Caldararu, Octav
    Blundell, Tom L.
    Kepp, Kasper P.
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2021, 61 (04) : 1981 - 1988
  • [5] Rule extraction for securities analysis based on decision tree classification model
    Ren, N
    Zargham, MR
    IKE '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGNINEERING, 2004, : 145 - 151
  • [6] Predicting changes in protein stability caused by mutation using sequence-and structure-based methods in a CAGI5 blind challenge
    Strokach, Alexey
    Corbi-Verge, Caries
    Kim, Philip M.
    HUMAN MUTATION, 2019, 40 (09) : 1414 - 1423
  • [7] A Model for Predicting Disapproval of Apprentices in Distance Education Using Decision Tree
    Cavalcante Ferreira, Joao Luiz
    Aloise, Andre Filipe
    Matter, Vitor Kehl
    Victoria Barbosa, Jorge Luis
    Rigo, Sandro Jose
    Farias de Oliveira, Kleinner S.
    PROCEEDINGS OF THE XV BRAZILIAN SYMPOSIUM ON INFORMATION SYSTEMS, SBSI 2019: Complexity on Modern Information Systems, 2019,
  • [8] PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality
    Dehouck, Yves
    Kwasigroch, Jean Marc
    Gilis, Dimitri
    Rooman, Marianne
    BMC BIOINFORMATICS, 2011, 12
  • [9] PoPMuSiC 2.1: a web server for the estimation of protein stability changes upon mutation and sequence optimality
    Yves Dehouck
    Jean Marc Kwasigroch
    Dimitri Gilis
    Marianne Rooman
    BMC Bioinformatics, 12
  • [10] First Report of Knowledge Discovery in Predicting Protein Folding Rate Change upon Single Mutation
    Lai, Lien-Fu
    Wu, Chao-Chin
    Huang, Liang-Tsung
    BIO-INSPIRED COMPUTING AND APPLICATIONS, 2012, 6840 : 624 - +