DeepUMQA: ultrafast shape recognition-based protein model quality assessment using deep learning

被引:36
作者
Guo, Sai-Sai [1 ]
Liu, Jun [1 ]
Zhou, Xiao-Gen [1 ]
Zhang, Gui-Jun [1 ]
机构
[1] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou 310023, Peoples R China
关键词
LOCAL QUALITY; PREDICTION;
D O I
10.1093/bioinformatics/btac056
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein model quality assessment is a key component of protein structure prediction. In recent research, the voxelization feature was used to characterize the local structural information of residues, but it may be insufficient for describing residue-level topological information. Design features that can further reflect residue-level topology when combined with deep learning methods are therefore crucial to improve the performance of model quality assessment. Results: We developed a deep-learning method, DeepUMQA, based on Ultrafast Shape Recognition (USR) for the residue-level single-model quality assessment. In the framework of the deep residual neural network, the residuelevel USR feature was introduced to describe the topological relationship between the residue and overall structure by calculating the first moment of a set of residue distance sets and then combined with 1D, 2D and voxelization features to assess the quality of the model. Experimental results on the CASP13, CASP14 test datasets and CAMEO blind test show that USR could supplement the voxelization features to comprehensively characterize residue structure information and significantly improve model assessment accuracy. The performance of DeepUMQA ranks among the top during the state-of-the-art single-model quality assessment methods, including ProQ2, ProQ3, ProQ3D, Ornate, VoroMQA, ProteinGCN, ResNetQA, QDeep, GraphQA, ModFOLD6, ModFOLD7, ModFOLD8, QMEAN3, QMEANDisCo3 and DeepAccNet.
引用
收藏
页码:1895 / 1903
页数:9
相关论文
共 63 条
[1]   AlphaFold at CASP13 [J].
AlQuraishi, Mohammed .
BIOINFORMATICS, 2019, 35 (22) :4862-4865
[2]   GraphQA: protein model quality assessment using graph convolutional networks [J].
Baldassarre, Federico ;
Hurtado, David Menendez ;
Elofsson, Arne ;
Azizpour, Hossein .
BIOINFORMATICS, 2021, 37 (03) :360-366
[3]  
Ballester PJ, 2007, J COMPUT CHEM, V28, P1711, DOI 10.1002/JCC.20681
[4]   QMEAN: A comprehensive scoring function for model quality assessment [J].
Benkert, Pascal ;
Tosatto, Silvio C. E. ;
Schomburg, Dietmar .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 71 (01) :261-277
[5]  
Bolboaca S-D, 2006, Leonardo J Sci, V5, P179
[6]   QAcon: single model quality assessment using protein structural and contact information with machine learning techniques [J].
Cao, Renzhi ;
Adhikari, Badri ;
Bhattacharya, Debswapna ;
Sun, Miao ;
Hou, Jie ;
Cheng, Jianlin .
BIOINFORMATICS, 2017, 33 (04) :586-588
[7]   Estimation of model accuracy in CASP13 [J].
Chene, Jianlin ;
Choe, Myong-Ho ;
Elofsson, Arne ;
Han, Kun-Sop ;
Hoe, Jie ;
Maghrabi, Ali H. A. ;
McGuffin, Liam J. ;
Menendez-Hurtado, David ;
Olechnovic, Klinnent ;
Schwede, Torsten ;
Studer, Gabriel ;
Uziela, Karolis ;
Venclovas, Ceslovas ;
Wallner, Bjorn .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2019, 87 (12) :1361-1377
[8]   Prediction of global and local quality of CASP8 models by MULTICOM series [J].
Cheng, Jianlin ;
Wang, Zheng ;
Tegge, Allison N. ;
Eickholt, Jesse .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2009, 77 :181-184
[9]   Relaxation of backbone bond geometry improves protein energy landscape modeling [J].
Conway, Patrick ;
Tyka, Michael D. ;
DiMaio, Frank ;
Konerding, David E. ;
Baker, David .
PROTEIN SCIENCE, 2014, 23 (01) :47-55
[10]   3D-Jury: a simple approach to improve protein structure predictions [J].
Ginalski, K ;
Elofsson, A ;
Fischer, D ;
Rychlewski, L .
BIOINFORMATICS, 2003, 19 (08) :1015-1018