Combining pairwise structural similarity and deep learning interface contact prediction to estimate protein complex model accuracy in CASP15

被引:13
作者
Roy, Raj S. [1 ]
Liu, Jian [1 ]
Giri, Nabin [1 ]
Guo, Zhiye [1 ]
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, Dept Elect Engn & Comp Sci, NextGen Precis Hlth, Columbia, MO 65211 USA
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
deep learning; estimation of protein model accuracy; protein interface contact prediction; protein model quality assessment; protein quaternary structure prediction; QUALITY; DOCKING; POTENTIALS; NETWORKS;
D O I
10.1002/prot.26542
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Estimating the accuracy of quaternary structural models of protein complexes and assemblies (EMA) is important for predicting quaternary structures and applying them to studying protein function and interaction. The pairwise similarity between structural models is proven useful for estimating the quality of protein tertiary structural models, but it has been rarely applied to predicting the quality of quaternary structural models. Moreover, the pairwise similarity approach often fails when many structural models are of low quality and similar to each other. To address the gap, we developed a hybrid method (MULTICOM_qa) combining a pairwise similarity score (PSS) and an interface contact probability score (ICPS) based on the deep learning inter-chain contact prediction for estimating protein complex model accuracy. It blindly participated in the 15th Critical Assessment of Techniques for Protein Structure Prediction (CASP15) in 2022 and performed very well in estimating the global structure accuracy of assembly models. The average per-target correlation coefficient between the model quality scores predicted by MULTICOM_qa and the true quality scores of the models of CASP15 assembly targets is 0.66. The average per-target ranking loss in using the predicted quality scores to rank the models is 0.14. It was able to select good models for most targets. Moreover, several key factors (i.e., target difficulty, model sampling difficulty, skewness of model quality, and similarity between good/bad models) for EMA are identified and analyzed. The results demonstrate that combining the multi-model method (PSS) with the complementary single-model method (ICPS) is a promising approach to EMA.
引用
收藏
页码:1889 / 1902
页数:14
相关论文
共 50 条
[1]   Accurate prediction of protein structures and interactions using a three-track neural network [J].
Baek, Minkyung ;
DiMaio, Frank ;
Anishchenko, Ivan ;
Dauparas, Justas ;
Ovchinnikov, Sergey ;
Lee, Gyu Rie ;
Wang, Jue ;
Cong, Qian ;
Kinch, Lisa N. ;
Schaeffer, R. Dustin ;
Millan, Claudia ;
Park, Hahnbeom ;
Adams, Carson ;
Glassman, Caleb R. ;
DeGiovanni, Andy ;
Pereira, Jose H. ;
Rodrigues, Andria V. ;
van Dijk, Alberdina A. ;
Ebrecht, Ana C. ;
Opperman, Diederik J. ;
Sagmeister, Theo ;
Buhlheller, Christoph ;
Pavkov-Keller, Tea ;
Rathinaswamy, Manoj K. ;
Dalwadi, Udit ;
Yip, Calvin K. ;
Burke, John E. ;
Garcia, K. Christopher ;
Grishin, Nick V. ;
Adams, Paul D. ;
Read, Randy J. ;
Baker, David .
SCIENCE, 2021, 373 (6557) :871-+
[2]   DockQ: A Quality Measure for Protein-Protein Docking Models [J].
Basu, Sankar ;
Wallner, Bjorn .
PLOS ONE, 2016, 11 (08)
[3]   Predicting the structure of large protein complexes using AlphaFold and Monte Carlo tree search [J].
Bryant, Patrick ;
Pozzati, Gabriele ;
Zhu, Wensi ;
Shenoy, Aditi ;
Kundrotas, Petras ;
Elofsson, Arne .
NATURE COMMUNICATIONS, 2022, 13 (01)
[4]   3D-equivariant graph neural networks for protein model quality assessment [J].
Chen, Chen ;
Chen, Xiao ;
Morehead, Alex ;
Wu, Tianqi ;
Cheng, Jianlin .
BIOINFORMATICS, 2023, 39 (01)
[5]   ZDOCK: An initial-stage protein-docking algorithm [J].
Chen, R ;
Li, L ;
Weng, ZP .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 52 (01) :80-87
[6]  
Chen X., 2022, BIORXIV, DOI [10.1101/2022.05.19.492741, DOI 10.1101/2022.05.19.492741]
[7]   Estimation of model accuracy in CASP13 [J].
Chene, Jianlin ;
Choe, Myong-Ho ;
Elofsson, Arne ;
Han, Kun-Sop ;
Hoe, Jie ;
Maghrabi, Ali H. A. ;
McGuffin, Liam J. ;
Menendez-Hurtado, David ;
Olechnovic, Klinnent ;
Schwede, Torsten ;
Studer, Gabriel ;
Uziela, Karolis ;
Venclovas, Ceslovas ;
Wallner, Bjorn .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (12) :1361-1377
[8]   Prediction of global and local quality of CASP8 models by MULTICOM series [J].
Cheng, Jianlin ;
Wang, Zheng ;
Tegge, Allison N. ;
Eickholt, Jesse .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :181-184
[9]   Modeling of protein complexes in CASP14 with emphasis on the interaction interface prediction [J].
Dapkunas, Justas ;
Olechnovic, Kliment ;
Venclovas, Ceslovas .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2021, 89 (12) :1834-1843
[10]   Predicting protein residue-residue contacts using deep networks and boosting [J].
Eickholt, Jesse ;
Cheng, Jianlin .
BIOINFORMATICS, 2012, 28 (23) :3066-3072