An efficient Pearson correlation based improved random forest classification for protein structure prediction techniques

被引:24
作者
Kalaiselvi, B. [1 ]
Thangamani, M. [2 ]
机构
[1] Mahendra Engn Coll Women, Dept Comp Sci & Engn, Namakkal, Tamil Nadu, India
[2] Kongu Engn Coll, Dept Comp Sci & Engn, Perundurai, Erode, India
关键词
Amino acid features; Improved random forest classification; Protein structure; Weighted covariance; Weighted mean; Weighted pearson correlation; SECONDARY STRUCTURE PREDICTION;
D O I
10.1016/j.measurement.2020.107885
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
In biochemistry, the protein structure prediction from the primary sequence is a significant issue. Few research works are intended for performing protein structure prediction with assist of diverse data mining techniques. However, the existing technique does not provide enhanced performance for protein structure prediction. To resolve this limitation, Weighted Pearson Correlation based Improved Random Forest Classification (WPC-IRFC) Technique is introduced. The WPC-IRFC Technique is developed for enhancing the protein structure prediction performance with higher accuracy and lesser time. The WPC-IRFC uses Weighted Pearson Correlation (WPC) to select relevant amino acid features based on weighted mean and weighted covariance. After selecting the relevant amino acid features, WPC-IRFC Technique designs an Improved Random Forest Classification (IRFC) for predicting the protein structure from a big protein dataset (DS). IRFC significantly lessens the error rate of classification with aid of iteratively reweighted least squares model to accurately identify protein structures. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 25 条
  • [1] [Anonymous], INT J RES ADVENT TEC
  • [2] [Anonymous], EVOLUT COMPUT
  • [3] [Anonymous], INT J ENG SCI ADV TE
  • [4] [Anonymous], 2013, INT J COMPUT SCI MOB
  • [5] [Anonymous], 2017, INT J CURRENT ENG TE
  • [6] [Anonymous], INT J BIOSCI BIOTECH
  • [7] Chen J, 2007, IEEE ACM T COMPUT BI, V4, P572, DOI [10.1109/tcbb.2007.1055, 10.1109/TCBB.2007.1055]
  • [8] Cheng Jianlin, 2008, IEEE Rev Biomed Eng, V1, P41, DOI 10.1109/RBME.2008.2008239
  • [9] A Memetic Algorithm for 3D Protein Structure Prediction Problem
    Correa, Leonardo
    Borguesan, Bruno
    Farfan, Camilo
    Inostroza-Ponta, Mario
    Dorn, Marcio
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (03) : 690 - 704
  • [10] A Segmentation-Based Method to Extract Structural and Evolutionary Features for Protein Fold Recognition
    Dehzangi, Abdollah
    Paliwal, Kuldip
    Lyons, James
    Sharma, Alok
    Sattar, Abdul
    [J]. IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (03) : 510 - 519