MR2CPPIS: Accurate prediction of protein–protein interaction sites based on multi-scale Res2Net with coordinate attention mechanism

被引:2
作者
Gong Y. [1 ,2 ]
Li R. [1 ,2 ]
Liu Y. [1 ,2 ]
Wang J. [3 ]
Cao B. [4 ]
Fu X. [1 ]
Li R. [1 ,2 ]
Chen D.Z. [5 ]
机构
[1] College of Computer Science and Electronic Engineering, Hunan University, Changsha
[2] Hunan Engineering Research Center of Advanced Embedded Computing and Intelligent Medical Systems, Hunan University, Changsha
[3] Peng Cheng Laboratory, Shenzhen
[4] College of Information and Electronic Engineering, Hunan City University, Yiyang
[5] Department of Computer Science and Engineering, University of Notre Dame, Notre Dame, 46556, IN
基金
中国国家自然科学基金;
关键词
Coordinate attention; Multi-scale; PPI sites prediction; Res2Net; Sequence-based method;
D O I
10.1016/j.compbiomed.2024.108543
中图分类号
学科分类号
摘要
Proteins play a vital role in various biological processes and achieve their functions through protein–protein interactions (PPIs). Thus, accurate identification of PPI sites is essential. Traditional biological methods for identifying PPIs are costly, labor-intensive, and time-consuming. The development of computational prediction methods for PPI sites offers promising alternatives. Most known deep learning (DL) methods employ layer-wise multi-scale CNNs to extract features from protein sequences. But, these methods usually neglect the spatial positions and hierarchical information embedded within protein sequences, which are actually crucial for PPI site prediction. In this paper, we propose MR2CPPIS, a novel sequence-based DL model that utilizes the multi-scale Res2Net with coordinate attention mechanism to exploit multi-scale features and enhance PPI site prediction capability. We leverage the multi-scale Res2Net to expand the receptive field for each network layer, thus capturing multi-scale information of protein sequences at a granular level. To further explore the local contextual features of each target residue, we employ a coordinate attention block to characterize the precise spatial position information, enabling the network to effectively extract long-range dependencies. We evaluate our MR2CPPIS on three public benchmark datasets (Dset 72, Dset 186, and PDBset 164), achieving state-of-the-art performance. The source codes are available at https://github.com/YyinGong/MR2CPPIS. © 2024 Elsevier Ltd
引用
收藏
相关论文
共 40 条
[31]  
Hou Q., Zhou D., Feng J., Coordinate attention for efficient mobile network design, pp. 13713-13722, (2021)
[32]  
Murzin A.G., Brenner S.E., Hubbard T., Chothia C., SCOP: A structural classification of proteins database for the investigation of sequences and structures, J. Mol. Biol., 247, 4, pp. 536-540, (1995)
[33]  
Tusnady G.E., Dosztanyi Z., Simon I., Transmembrane proteins in the Protein Data Bank: Identification and classification, Bioinformatics, 20, 17, pp. 2964-2972, (2004)
[34]  
Kyte J., Doolittle R.F., A simple method for displaying the hydropathic character of a protein, J. Mol. Biol., 157, 1, pp. 105-132, (1982)
[35]  
Meiler J., Muller M., Zeidler A., Schmaschke F., Generation and evaluation of dimension-reduced amino acid parameter representations by artificial neural networks, Mol. Model. Annu., 7, 9, pp. 360-369, (2001)
[36]  
Altschul S.F., Madden T.L., Schaffer A.A., Zhang J., Zhang Z., Miller W., Lipman D.J., Gapped BLAST and PSI-BLAST: A new generation of protein database search programs, Nucleic Acids Res., 25, 17, pp. 3389-3402, (1997)
[37]  
Sellami A., Hwang H., A robust deep convolutional neural network with batch-weighted loss for heartbeat classification, Expert Syst. Appl., 122, pp. 75-84, (2019)
[38]  
Zhang J., Kurgan L., SCRIBER: Accurate and partner type-specific prediction of protein-binding residues from proteins sequences, Bioinformatics, 35, 14, pp. i343-i353, (2019)
[39]  
Mou M., Pan Z., Zhou Z., Zheng L., Zhang H., Shi S., Li F., Sun X., Zhu F., A transformer-based ensemble framework for the prediction of protein–protein interaction sites, Research, 6, (2023)
[40]  
Hu J., Shen L., Sun G., pp. 7132-7141, (2018)