ProtFold-DFG: protein fold recognition by combining Directed Fusion Graph and PageRank algorithm

被引:38
|
作者
Shao, Jiangyi [1 ]
Liu, Bin [1 ]
机构
[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金; 北京市自然科学基金;
关键词
protein fold recognition; Directed Fusion Graph; PageRank; KL divergence; transitive closure; HIDDEN MARKOV-MODELS; HOMOLOGY DETECTION; CLASSIFICATION; INFORMATION; ALIGNMENT; FEATURES;
D O I
10.1093/bib/bbaa192
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
As one of the most important tasks in protein structure prediction, protein fold recognition has attracted more and more attention. In this regard, some computational predictors have been proposed with the development of machine learning and artificial intelligence techniques. However, these existing computational methods are still suffering from some disadvantages. In this regard, we propose a new network-based predictor called ProtFold-DFG for protein fold recognition. We propose the Directed Fusion Graph (DFG) to fuse the ranking lists generated by different methods, which employs the transitive closure to incorporate more relationships among proteins and uses the KL divergence to calculate the relationship between two proteins so as to improve its generalization ability. Finally, the PageRank algorithm is performed on the DFG to accurately recognize the protein folds by considering the global interactions among proteins in the DFG. Tested on a widely used and rigorous benchmark data set, LINDAHL dataset, experimental results show that the ProtFold-DFG outperforms the other 35 competing methods, indicating that ProtFold-DFG will be a useful method for protein fold recognition.
引用
收藏
页数:11
相关论文
共 17 条
  • [1] FoldRec-C2C: protein fold recognition by combining cluster-to-cluster model and protein similarity network
    Shao, Jiangyi
    Yan, Ke
    Liu, Bin
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [2] Protein Fold Recognition by Combining Support Vector Machines and Pairwise Sequence Similarity Scores
    Yan, Ke
    Wen, Jie
    Liu, Jin-Xing
    Xu, Yong
    Liu, Bin
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2021, 18 (05) : 2008 - 2016
  • [3] PRFold-TNN: Protein Fold Recognition With an Ensemble Feature Selection Method Using PageRank Algorithm Based on Transformer
    Qin, Xinyi
    Zhang, Lu
    Liu, Min
    Liu, Guangzhong
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (06) : 1740 - 1751
  • [4] DeepSVM-fold: protein fold recognition by combining support vector machines and pairwise sequence similarity scores generated by deep learning networks
    Liu, Bin
    Li, Chen-Chen
    Yan, Ke
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1733 - 1741
  • [5] Fusion of classifiers for protein fold recognition
    Nanni, L
    NEUROCOMPUTING, 2005, 68 : 315 - 321
  • [6] ResCNNT-fold: Combining residual convolutional neural network and Transformer for protein fold recognition from language model embeddings
    Qin, Xinyi
    Liu, Min
    Liu, Guangzhong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 166
  • [7] Protein Fold Recognition with Adaptive Local Hyperplane Algorithm
    Kecman, Vojislav
    Yang, Tao
    CIBCB: 2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2009, : 75 - +
  • [8] Protein fold recognition with support vector machines fusion network
    Shi, JY
    Pan, Q
    Zhang, SW
    Liang, Y
    PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2006, 33 (02) : 155 - 162
  • [9] An advanced pre-positioning method for the force-directed graph visualization based on pagerank algorithm
    Dong, Wenqiang
    Wang, Fulai
    Huang, Yu
    Xu, Guangluan
    Guo, Zhi
    Fu, Xingyu
    Fu, Kun
    COMPUTERS & GRAPHICS-UK, 2015, 47 : 24 - 33
  • [10] A Grammatical Inference Sequential Mining Algorithm for Protein Fold Recognition
    Soliman, Taysir Hassan A.
    Eldin, Ahmed Sharaf
    Ghareeb, Marwa M.
    Marie, Mohammed E.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (12) : 97 - 106