Prediction of Protein Tertiary Structure via Regularized Template Classification Techniques

被引：3

作者：

Alvarez-Machancoses, Oscar ^{[1
]}

Luis Fernandez-Martinez, Juan ^{[1
]}

Kloczkowski, Andrzej ^{[2
,3
]}

机构：

[1] Univ Oviedo, Dept Math, Grp Inverse Problems Optimizat & Machine Learning, C Federico Garcia Lorca 18, Oviedo 33007, Spain

[2] Nationwide Childrens Hosp, Battelle Ctr Math Med, Columbus, OH 43210 USA

[3] Ohio State Univ, Dept Pediat, Columbus, OH 43210 USA

来源：

MOLECULES | 2020年 / 25卷 / 11期

关键词：

Protein Tertiary Structure; LDA classification; PSO; uncertainty analysis; STRUCTURE ALIGNMENT; MULTIPLE STRUCTURE; SEQUENCE; DATABASE; MODEL; GENE; REFINEMENT; ACCURACY;

D O I：

10.3390/molecules25112467

中图分类号：

Q5 [生物化学]; Q7 [分子生物学];

学科分类号：

071010 ; 081704 ;

摘要：

We discuss the use of the regularized linear discriminant analysis (LDA) as a model reduction technique combined with particle swarm optimization (PSO) in protein tertiary structure prediction, followed by structure refinement based on singular value decomposition (SVD) and PSO. The algorithm presented in this paper corresponds to the category of template-based modeling. The algorithm performs a preselection of protein templates before constructing a lower dimensional subspace via a regularized LDA. The protein coordinates in the reduced spaced are sampled using a highly explorative optimization algorithm, regressive-regressive PSO (RR-PSO). The obtained structure is then projected onto a reduced space via singular value decomposition and further optimized via RR-PSO to carry out a structure refinement. The final structures are similar to those predicted by best structure prediction tools, such as Rossetta and Zhang servers. The main advantage of our methodology is that alleviates the ill-posed character of protein structure prediction problems related to high dimensional optimization. It is also capable of sampling a wide range of conformational space due to the application of a regularized linear discriminant analysis, which allows us to expand the differences over a reduced basis set.

引用

页数：17

共 71 条

[1] Combining multiple structure and sequence alignments to improve sequence detection and alignment: Application to the SH2 domains of Janus kinases [J].