Machine Learning Approaches for Protein-Protein Interaction Hot Spot Prediction: Progress and Comparative Assessment

被引:53
作者
Liu, Siyu [1 ]
Liu, Chuyao [1 ]
Deng, Lei [1 ]
机构
[1] Cent South Univ, Sch Software, Changsha 410075, Hunan, Peoples R China
来源
MOLECULES | 2018年 / 23卷 / 10期
基金
中国国家自然科学基金;
关键词
hot spots; protein-protein interaction; machine learning; performance evaluation; AMINO-ACID; SOLVENT ACCESSIBILITY; BINDING-ENERGY; DATABASE; RESIDUES; SELECTION; CONSERVATION; INFORMATION; SEQUENCE; SERVER;
D O I
10.3390/molecules23102535
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Hot spots are the subset of interface residues that account for most of the binding free energy, and they play essential roles in the stability of protein binding. Effectively identifying which specific interface residues of protein-protein complexes form the hot spots is critical for understanding the principles of protein interactions, and it has broad application prospects in protein design and drug development. Experimental methods like alanine scanning mutagenesis are labor-intensive and time-consuming. At present, the experimentally measured hot spots are very limited. Hence, the use of computational approaches to predicting hot spots is becoming increasingly important. Here, we describe the basic concepts and recent advances of machine learning applications in inferring the protein-protein interaction hot spots, and assess the performance of widely used features, machine learning algorithms, and existing state-of-the-art approaches. We also discuss the challenges and future directions in the prediction of hot spots.
引用
收藏
页数:15
相关论文
共 80 条
  • [1] Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
    Altschul, SF
    Madden, TL
    Schaffer, AA
    Zhang, JH
    Zhang, Z
    Miller, W
    Lipman, DJ
    [J]. NUCLEIC ACIDS RESEARCH, 1997, 25 (17) : 3389 - 3402
  • [3] [Anonymous], 2014, C4. 5: programs for machine learning
  • [4] [Anonymous], 2016, KDD16 P 22 ACM, DOI DOI 10.1145/2939672.2939785
  • [5] ConSurf 2010: calculating evolutionary conservation in sequence and structure of proteins and nucleic acids
    Ashkenazy, Haim
    Erez, Elana
    Martz, Eric
    Pupko, Tal
    Ben-Tal, Nir
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 : W529 - W533
  • [6] PCRPi: Presaging Critical Residues in Protein interfaces, a new computational tool to chart hot spots in protein interfaces
    Assi, Salam A.
    Tanaka, Tomoyuki
    Rabbitts, Terence H.
    Fernandez-Fuentes, Narcis
    [J]. NUCLEIC ACIDS RESEARCH, 2010, 38 (06) : e86.1 - e86.11
  • [7] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [8] Relationship between local structural entropy and protein thermostability
    Chan, CH
    Liang, HK
    Hsiao, NW
    Ko, MT
    Lyu, PC
    Hwang, JK
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (04) : 684 - 691
  • [9] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [10] Accurate prediction of hot spot residues through physicochemical characteristics of amino acid sequences
    Chen, Peng
    Li, Jinyan
    Wong, Limsoon
    Kuwahara, Hiroyuki
    Huang, Jianhua Z.
    Gao, Xin
    [J]. PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2013, 81 (08) : 1351 - 1362