Prediction of Protein Tertiary Structure via Regularized Template Classification Techniques

被引:3
作者
Alvarez-Machancoses, Oscar [1 ]
Luis Fernandez-Martinez, Juan [1 ]
Kloczkowski, Andrzej [2 ,3 ]
机构
[1] Univ Oviedo, Dept Math, Grp Inverse Problems Optimizat & Machine Learning, C Federico Garcia Lorca 18, Oviedo 33007, Spain
[2] Nationwide Childrens Hosp, Battelle Ctr Math Med, Columbus, OH 43210 USA
[3] Ohio State Univ, Dept Pediat, Columbus, OH 43210 USA
关键词
Protein Tertiary Structure; LDA classification; PSO; uncertainty analysis; STRUCTURE ALIGNMENT; MULTIPLE STRUCTURE; SEQUENCE; DATABASE; MODEL; GENE; REFINEMENT; ACCURACY;
D O I
10.3390/molecules25112467
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We discuss the use of the regularized linear discriminant analysis (LDA) as a model reduction technique combined with particle swarm optimization (PSO) in protein tertiary structure prediction, followed by structure refinement based on singular value decomposition (SVD) and PSO. The algorithm presented in this paper corresponds to the category of template-based modeling. The algorithm performs a preselection of protein templates before constructing a lower dimensional subspace via a regularized LDA. The protein coordinates in the reduced spaced are sampled using a highly explorative optimization algorithm, regressive-regressive PSO (RR-PSO). The obtained structure is then projected onto a reduced space via singular value decomposition and further optimized via RR-PSO to carry out a structure refinement. The final structures are similar to those predicted by best structure prediction tools, such as Rossetta and Zhang servers. The main advantage of our methodology is that alleviates the ill-posed character of protein structure prediction problems related to high dimensional optimization. It is also capable of sampling a wide range of conformational space due to the application of a regularized linear discriminant analysis, which allows us to expand the differences over a reduced basis set.
引用
收藏
页数:17
相关论文
共 71 条
[1]   Combining multiple structure and sequence alignments to improve sequence detection and alignment: Application to the SH2 domains of Janus kinases [J].
Al-Lazikani, B ;
Sheinerman, FB ;
Honig, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (26) :14796-14801
[2]   Predicting protein tertiary structure and its uncertainty analysis via particle swarm sampling [J].
Alvarez, Oscar ;
Luis Fernandez-Martinez, Juan ;
Cernea Corbeanu, Ana ;
Fernandez-Muniz, Zulima ;
Kloczkowski, Andrzej .
JOURNAL OF MOLECULAR MODELING, 2019, 25 (03)
[3]  
alvarez-Machancoses, 2018, LECT NOTES COMPUTER, V10813, P211
[4]  
alvarez-Machancoses, 2018, J BIOINF COMPUT BIOL, V16, DOI 10.1142/S021972001850005129566640
[5]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[6]   Automated server predictions in CASP7 [J].
Battey, James N. D. ;
Kopp, Jurgen ;
Bordoli, Lorenza ;
Read, Randy J. ;
Clarke, Neil D. ;
Schwede, Torsten .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 :68-82
[7]   Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships [J].
Brenner, SE ;
Chothia, C ;
Hubbard, TJP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (11) :6073-6078
[8]   PREDICTION OF THE FOLDING OF SHORT POLYPEPTIDE SEGMENTS BY UNIFORM CONFORMATIONAL SAMPLING [J].
BRUCCOLERI, RE ;
KARPLUS, M .
BIOPOLYMERS, 1987, 26 (01) :137-168
[9]  
Cernea A., 2018, COMPUTER VISION, V10814, P15
[10]   Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection [J].
Chivian, Dylan ;
Baker, David .
NUCLEIC ACIDS RESEARCH, 2006, 34 (17)