Sequence analysis and rule development of predicting protein stability change upon mutation using decision tree model

被引:0
|
作者
Liang-Tsung Huang
M. Michael Gromiha
Shinn-Ying Ho
机构
[1] Feng-Chia University,Institute of Information Engineering and Computer Science
[2] Ming-Dao University,Department of Computer Science and Information Engineering
[3] National Institute of Advanced Industrial Science and Technology (AIST),Computational Biology Research Center (CBRC)
[4] National Chiao Tung University,Department of Biological Science and Technology, and Institute of Bioinformatics
来源
Journal of Molecular Modeling | 2007年 / 13卷
关键词
Bioinformatics; Data mining; Decision trees; Prediction; Protein stability;
D O I
暂无
中图分类号
学科分类号
摘要
Understanding the mechanism of the protein stability change is one of the most challenging tasks. Recently, the prediction of protein stability change affected by single point mutations has become an interesting topic in molecular biology. However, it is desirable to further acquire knowledge from large databases to provide new insights into the nature of them. This paper presents an interpretable prediction tree method (named iPTREE-2) that can accurately predict changes of protein stability upon mutations from sequence based information and analyze sequence characteristics from the viewpoint of composition and order. Therefore, iPTREE-2 based on a regression tree algorithm exhibits the ability of finding important factors and developing rules for the purpose of data mining. On a dataset of 1859 different single point mutations from thermodynamic database, ProTherm, iPTREE-2 yields a correlation coefficient of 0.70 between predicted and experimental values. In the task of data mining, detailed analysis of sequences reveals the possibility of the compositional specificity of residues in different ranges of stability change and implies the existence of certain patterns. As building rules, we found that the mutation residues in wild type and in mutant protein play an important role. The present study demonstrates that iPTREE-2 can serve the purpose of predicting protein stability change, especially when one requires more understandable knowledge.
引用
收藏
页码:879 / 890
页数:11
相关论文
共 36 条
  • [21] A NEW METHOD FOR PREDICTING VEGETATION DISTRIBUTIONS USING DECISION TREE ANALYSIS IN A GEOGRAPHIC INFORMATION-SYSTEM
    MOORE, DM
    LEES, BG
    DAVEY, SM
    ENVIRONMENTAL MANAGEMENT, 1991, 15 (01) : 59 - 71
  • [22] The diagnostic value of texture analysis in predicting WHO grades of meningiomas based on ADC maps: an attempt using decision tree and decision forest
    Yiping Lu
    Li Liu
    Shihai Luan
    Ji Xiong
    Daoying Geng
    Bo Yin
    European Radiology, 2019, 29 : 1318 - 1328
  • [23] The diagnostic value of texture analysis in predicting WHO grades of meningiomas based on ADC maps: an attempt using decision tree and decision forest
    Lu, Yiping
    Liu, Li
    Luan, Shihai
    Xiong, Ji
    Geng, Daoying
    Yin, Bo
    EUROPEAN RADIOLOGY, 2019, 29 (03) : 1318 - 1328
  • [24] Predicting Disease-Specific Survival for Patients With Primary Cholangiocarcinoma Undergoing Curative Resection by Using a Decision Tree Model
    Quan, Bing
    Li, Miao
    Lu, Shenxin
    Li, Jinghuan
    Liu, Wenfeng
    Zhang, Feng
    Chen, Rongxin
    Ren, Zhenggang
    Yin, Xin
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [25] Analysis on land use change based on decision-tree model with comprehensive multi-scale characteristics
    Liang, Ming
    Sun, Yizhong
    Luo, Rong
    Hu, Zui
    Chen, Zhao
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2014, 30 (17): : 259 - 267
  • [26] Predicting membrane protein types using various decision tree classifiers based on various modes of general PseAAC for imbalanced datasets
    Sankari, E. Siva
    Manimegalai, D.
    JOURNAL OF THEORETICAL BIOLOGY, 2017, 435 : 208 - 217
  • [27] A Novel Clinical Prediction Model for Prognosis in Malignant Pleural Mesothelioma Using Decision Tree Analysis
    Brims, Fraser J. H.
    Meniawy, Tarek M.
    Duffus, Ian
    de Fonseka, Duneesha
    Segal, Amanda
    Creaney, Jenette
    Maskell, Nicholas
    Lake, Richard A.
    de Klerk, Nick
    Nowak, Anna K.
    JOURNAL OF THORACIC ONCOLOGY, 2016, 11 (04) : 573 - 582
  • [28] Surrogate Model Development for Slope Stability Analysis Using Machine Learning
    Li, Xianfeng
    Nishio, Mayuko
    Sugawara, Kentaro
    Iwanaga, Shoji
    Chun, Pang-jo
    SUSTAINABILITY, 2023, 15 (14)
  • [29] Analysis of Design Change Mechanism in Apartment Housing Projects Using Association Rule Mining (ARM) Model
    Kim, Moonhwan
    Lee, Joosung
    Kim, Jaejun
    APPLIED SCIENCES-BASEL, 2022, 12 (21):
  • [30] Development of a model for analysis of slope stability for circular mode failure using genetic algorithm
    Manouchehrian, Amin
    Gholamnejad, Javad
    Sharifzadeh, Mostafa
    ENVIRONMENTAL EARTH SCIENCES, 2014, 71 (03) : 1267 - 1277