PRFold-TNN: Protein Fold Recognition With an Ensemble Feature Selection Method Using PageRank Algorithm Based on Transformer

被引:0
|
作者
Qin, Xinyi [1 ]
Zhang, Lu [2 ]
Liu, Min [1 ]
Liu, Guangzhong [1 ]
机构
[1] Shanghai Maritime Univ, Coll Informat Engn, Shanghai 201306, Peoples R China
[2] Jiangnan Univ, Sch Internet Things Engn, Wuxi 214063, Jiangsu, Peoples R China
关键词
Feature extraction; Hidden Markov models; Amino acids; Protein sequence; Predictive models; Prediction algorithms; Vectors; ASTRAL; ensemble feature selection method; PageRank algorithm; protein fold recognition; transformer; SCORING MATRIX; CLASSIFICATION; PREDICTION; PROBABILITIES; NMR;
D O I
10.1109/TCBB.2024.3414497
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Understanding the tertiary structures of proteins is of great benefit to function in many aspects of human life. Protein fold recognition is a vital and salient means to know protein structure. Until now, researchers have successively proposed a variety of methods to realize protein fold recognition, but the novel and effective computational method is still needed to handle this problem with the continuous updating of protein structure databases. In this study, we develop a new protein structure dataset named AT and propose the PRFold-TNN model for protein fold recognition. First, different types of feature extraction methods including AAC, HMM, HMM-Bigram and ACC are selected to extract corresponding features for protein sequences. Then an ensemble feature selection method based on PageRank algorithm integrating various tree-based algorithms is used to screen the fusion features. Ultimately, the classifier based on the Transformer model achieves the final prediction. Experiments show that the prediction accuracy is 86.27% on the AT dataset and 88.91% on the independent test set, indicating that the model can demonstrate superior performance and generalization ability in the problem of protein fold recognition. Furthermore, we also carry out research on the DD, EDD and TG benchmark datasets, and make them achieve prediction accuracy of 88.41%, 97.91% and 95.16%, which are at least 3.0%, 0.8% and 2.5% higher than those of the state-of-the-art methods. It can be concluded that the PRFold-TNN model is more prominent.
引用
收藏
页码:1740 / 1751
页数:12
相关论文
共 24 条
  • [1] An ensemble feature selection algorithm based on PageRank centrality and fuzzy logic
    Joodaki, Mehdi
    Dowlatshahi, Mohammad Bagher
    Joodaki, Nazanin Zahra
    KNOWLEDGE-BASED SYSTEMS, 2021, 233
  • [2] An Effective Feature Selection Algorithm Based on the Class Similarity Used with a SVM-RDA Classifier to Protein Fold Recognition
    Chmielnicki, Wieslaw
    Stapor, Katarzyna
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 205 - +
  • [3] Genetic algorithm based feature selection method development for pattern recognition
    Kim, Ho-Duck
    Park, Chang-Hyun
    Yang, Hyun-Chang
    Sim, Kwee-Bo
    2006 SICE-ICASE INTERNATIONAL JOINT CONFERENCE, VOLS 1-13, 2006, : 5382 - +
  • [4] A novel ensemble-based wrapper method for feature selection using extreme learning machine and genetic algorithm
    Xue, Xiaowei
    Yao, Min
    Wu, Zhaohui
    KNOWLEDGE AND INFORMATION SYSTEMS, 2018, 57 (02) : 389 - 412
  • [5] An optimal signal selection method based on feature neighborhood using for human gait mode recognition
    Zhang, Miao
    Sun, Ronglei
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 93
  • [6] Feature Selection for Facial Emotion Recognition Using Cosine Similarity-Based Harmony Search Algorithm
    Saha, Soumyajit
    Ghosh, Manosij
    Ghosh, Soulib
    Sen, Shibaprasad
    Singh, Pawan Kumar
    Geem, Zong Woo
    Sarkar, Ram
    APPLIED SCIENCES-BASEL, 2020, 10 (08):
  • [7] Feature selection for facial emotion recognition using late hill-climbing based memetic algorithm
    Ghosh, Manosij
    Kundu, Tuhin
    Ghosh, Dipayan
    Sarkar, Ram
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25753 - 25779
  • [8] Enhancing EEG-Based Mental Stress State Recognition Using an Improved Hybrid Feature Selection Algorithm
    Hag, Ala
    Handayani, Dini
    Altalhi, Maryam
    Pillai, Thulasyammal
    Mantoro, Teddy
    Kit, Mun Hou
    Al-Shargie, Fares
    SENSORS, 2021, 21 (24)
  • [9] EEG-based depression recognition using feature selection method with fuzzy label
    Li, Yalin
    Fang, Yixian
    Ren, Xiuxiu
    Gao, Leiting
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (03)
  • [10] Feature Selection using Binary-ABC Algorithm for DWT-based Face Recognition
    Akhil, Malepati Bala Siva Sai
    Aashish, P.
    Manikantan, K.
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (ICCIC), 2015, : 30 - 36