How Molecular Size Impacts RMSD Applications in Molecular Dynamics Simulations

被引:411
作者
Sargsyan, Karen [1 ]
Grauffel, Cedric [1 ]
Lim, Carmay [1 ,2 ]
机构
[1] Acad Sinica, Inst Biomed Sci, Taipei 115, Taiwan
[2] Natl Tsinghua Univ, Dept Chem, Hsinchu 300, Taiwan
关键词
PROTEIN DOCKING; 3-DIMENSIONAL STRUCTURES; FORCE-FIELD; TRAJECTORIES; CHARMM; LIBRARY;
D O I
10.1021/acs.jctc.7b00028
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The root-mean-square deviation (RMSD) is a similarity measure widely used in analysis of macromolecular structures and dynamics. As increasingly larger macro-molecular systems are being studied, dimensionality effects such as the "curse of dimensionality" (a diminishing ability to discriminate pairwise differences between conformations with increasing system size) may exist and significantly impact RMSD-based analyses. For such large bimolecular systems, whether the RMSD or other alternative similarity measures might suffer from this "curse" and lose the ability to discriminate different macromolecular structures had not been explicitly addressed. Here, we show such dimensionality effects for both weighted and nonweighted RMSD schemes. We also provide a mechanism for the emergence of the "curse of dimensionality" for RMSD from the law of large numbers by showing that the conformational distributions from which RMSDs are calculated become increasingly similar as the system size increases. Our findings suggest the use of weighted RMSD schemes for small proteins (less than 200 residues) and nonweighted RMSD for larger proteins when analyzing molecular dynamics trajectories.
引用
收藏
页码:1518 / 1524
页数:7
相关论文
共 25 条
[1]   Clustering high dimensional data [J].
Assent, Ira .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2012, 2 (04) :340-350
[2]   Optimization of the Additive CHARMM All-Atom Protein Force Field Targeting Improved Sampling of the Backbone φ, ψ and Side-Chain χ1 and χ2 Dihedral Angles [J].
Best, Robert B. ;
Zhu, Xiao ;
Shim, Jihyun ;
Lopes, Pedro E. M. ;
Mittal, Jeetain ;
Feig, Michael ;
MacKerell, Alexander D., Jr. .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2012, 8 (09) :3257-3273
[3]   CHARMM - A PROGRAM FOR MACROMOLECULAR ENERGY, MINIMIZATION, AND DYNAMICS CALCULATIONS [J].
BROOKS, BR ;
BRUCCOLERI, RE ;
OLAFSON, BD ;
STATES, DJ ;
SWAMINATHAN, S ;
KARPLUS, M .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 1983, 4 (02) :187-217
[4]   Biopython']python: freely available Python']Python tools for computational molecular biology and bioinformatics [J].
Cock, Peter J. A. ;
Antao, Tiago ;
Chang, Jeffrey T. ;
Chapman, Brad A. ;
Cox, Cymon J. ;
Dalke, Andrew ;
Friedberg, Iddo ;
Hamelryck, Thomas ;
Kauff, Frank ;
Wilczynski, Bartek ;
de Hoon, Michiel J. L. .
BIOINFORMATICS, 2009, 25 (11) :1422-1423
[5]   A 2ND GENERATION FORCE-FIELD FOR THE SIMULATION OF PROTEINS, NUCLEIC-ACIDS, AND ORGANIC-MOLECULES [J].
CORNELL, WD ;
CIEPLAK, P ;
BAYLY, CI ;
GOULD, IR ;
MERZ, KM ;
FERGUSON, DM ;
SPELLMEYER, DC ;
FOX, T ;
CALDWELL, JW ;
KOLLMAN, PA .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1995, 117 (19) :5179-5197
[6]   Gaussian-weighted RMSD superposition of proteins: A structural comparison for flexible proteins and predicted protein structures [J].
Damm, Kelly L. ;
Carlson, Heather A. .
BIOPHYSICAL JOURNAL, 2006, 90 (12) :4558-4573
[7]   Comparing molecules and solids across structural and alchemical space [J].
De, Sandip ;
Bartok, Albert P. ;
Csanyi, Gabor ;
Ceriotti, Michele .
PHYSICAL CHEMISTRY CHEMICAL PHYSICS, 2016, 18 (20) :13754-13769
[8]   Human Frataxin Folds Via an Intermediate State. Role of the C-Terminal Region [J].
Faraj, Santiago E. ;
Gonzalez-Lebrero, Rodolfo M. ;
Roman, Ernesto A. ;
Santos, Javier .
SCIENTIFIC REPORTS, 2016, 6
[9]   Geometrical comparison of two protein structures using Wigner-D functions [J].
Fathi, S. M. Saberi ;
White, Diana T. ;
Tuszynski, Jack A. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (10) :2756-2769
[10]   SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS [J].
KABSCH, W .
ACTA CRYSTALLOGRAPHICA SECTION A, 1976, 32 (SEP1) :922-923