TAWFN: a deep learning framework for protein function prediction

被引:0
|
作者
Meng, Lu [1 ]
Wang, Xiaoran [1 ]
机构
[1] Northeastern Univ, Coll Informat Sci & Engn, 3-11 Wenhua Rd, Shenyang 110000, Liaoning, Peoples R China
基金
中国国家自然科学基金;
关键词
SEQUENCE; GENERATION;
D O I
10.1093/bioinformatics/btae571
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation Proteins play pivotal roles in biological systems, and precise prediction of their functions is indispensable for practical applications. Despite the surge in protein sequence data facilitated by high-throughput techniques, unraveling the exact functionalities of proteins still demands considerable time and resources. Currently, numerous methods rely on protein sequences for prediction, while methods targeting protein structures are scarce, often employing convolutional neural networks (CNN) or graph convolutional networks (GCNs) individually.Results To address these challenges, our approach starts from protein structures and proposes a method that combines CNN and GCN into a unified framework called the two-model adaptive weight fusion network (TAWFN) for protein function prediction. First, amino acid contact maps and sequences are extracted from the protein structure. Then, the sequence is used to generate one-hot encoded features and deep semantic features. These features, along with the constructed graph, are fed into the adaptive graph convolutional networks (AGCN) module and the multi-layer convolutional neural network (MCNN) module as needed, resulting in preliminary classification outcomes. Finally, the preliminary classification results are inputted into the adaptive weight computation network, where adaptive weights are calculated to fuse the initial predictions from both networks, yielding the final prediction result. To evaluate the effectiveness of our method, experiments were conducted on the PDBset and AFset datasets. For molecular function, biological process, and cellular component tasks, TAWFN achieved area under the precision-recall curve (AUPR) values of 0.718, 0.385, and 0.488 respectively, with corresponding Fmax scores of 0.762, 0.628, and 0.693, and Smin scores of 0.326, 0.483, and 0.454. The experimental results demonstrate that TAWFN exhibits promising performance, outperforming existing methods.Availability and implementation The TAWFN source code can be found at: https://github.com/ss0830/TAWFN.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] A Deep Learning Framework for Predicting Protein Functions With Co-Occurrence of GO Terms
    Li, Min
    Shi, Wenbo
    Zhang, Fuhao
    Zeng, Min
    Li, Yaohang
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2023, 20 (02) : 833 - 842
  • [32] EITLEM-Kinetics: A deep-learning framework for kinetic parameter prediction of mutant enzymes
    Shen, Xiaowei
    Cui, Ziheng
    Long, Jianyu
    Zhang, Shiding
    Chen, Biqiang
    Tan, Tianwei
    CHEM CATALYSIS, 2024, 4 (09):
  • [33] Enhancing arrhythmia prediction through an adaptive deep reinforcement learning framework for ECG signal analysis
    Serhani, Mohamed Adel
    Ismail, Heba
    El-Kassabi, Hadeel T.
    Al Breiki, Hamda
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [34] A deep learning ensemble for function prediction of hypothetical proteins from pathogenic bacterial species
    Mishra, Sarthak
    Rastogi, Yash Pratap
    Jabin, Suraiya
    Kaur, Punit
    Amir, Mohammad
    Khatun, Shabnam
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2019, 83
  • [35] PhosTransfer: A Deep Transfer Learning Framework for Kinase-Specific Phosphorylation Site Prediction in Hierarchy
    Xu, Ying
    Wilson, Campbell
    Leier, Andre
    Marquez-Lago, Tatiana T.
    Whisstock, James
    Song, Jiangning
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT II, 2020, 12085 : 384 - 395
  • [36] A Deep Retrieval-Enhanced Meta-Learning Framework for Enzyme Optimum pH Prediction
    Zhang, Liang
    Luo, Kuan
    Zhou, Ziyi
    Yu, Yuanxi
    Jiang, Fan
    Wu, Banghao
    Li, Mingchen
    Hong, Liang
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2025, : 3761 - 3770
  • [37] SUPERMAGO: Protein Function Prediction Based on Transformer Embeddings
    de Oliveira, Gabriel Bianchin
    Pedrini, Helio
    Dias, Zanoni
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2024, : 981 - 996
  • [38] RBPsuite: RNA-protein binding sites prediction suite based on deep learning
    Pan, Xiaoyong
    Fang, Yi
    Li, Xianfeng
    Yang, Yang
    Shen, Hong-Bin
    BMC GENOMICS, 2020, 21 (01)
  • [39] ComplexContact: a web server for inter-protein contact prediction using deep learning
    Zeng, Hong
    Wang, Sheng
    Zhou, Tianming
    Zhao, Feifeng
    Li, Xiufeng
    Wu, Qing
    Xu, Jinbo
    NUCLEIC ACIDS RESEARCH, 2018, 46 (W1) : W432 - W437
  • [40] Analysis of deep learning methods for blind protein contact prediction in CASP12
    Wang, Sheng
    Sun, Siqi
    Xu, Jinbo
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2018, 86 : 67 - 77