Manifold learning in atomistic simulations: a conceptual review

被引:11
作者
Rydzewski, Jakub [1 ]
Chen, Ming [2 ]
Valsson, Omar [3 ]
机构
[1] Nicolaus Copernicus Univ, Inst Phys, Fac Phys Astron & Informat, Grudziadzka 5, PL-87100 Torun, Poland
[2] Purdue Univ, Dept Chem, W Lafayette, IN 47907 USA
[3] Univ North Texas, Dept Chem, Denton, TX 76201 USA
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2023年 / 4卷 / 03期
关键词
molecular dynamics; dimensionality reduction; manifold learning; enhanced sampling; MARKOV STATE MODELS; MOLECULAR-DYNAMICS SIMULATIONS; NONLINEAR DIMENSIONALITY REDUCTION; FREE-ENERGY LANDSCAPES; COLLECTIVE VARIABLES; DIFFUSION MAPS; VARIATIONAL APPROACH; LAPLACIAN EIGENMAPS; ORDER PARAMETERS; T-SNE;
D O I
10.1088/2632-2153/ace81a
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Analyzing large volumes of high-dimensional data requires dimensionality reduction: finding meaningful low-dimensional structures hidden in their high-dimensional observations. Such practice is needed in atomistic simulations of complex systems where even thousands of degrees of freedom are sampled. An abundance of such data makes gaining insight into a specific physical problem strenuous. Our primary aim in this review is to focus on unsupervised machine learning methods that can be used on simulation data to find a low-dimensional manifold providing a collective and informative characterization of the studied process. Such manifolds can be used for sampling long-timescale processes and free-energy estimation. We describe methods that can work on datasets from standard and enhanced sampling atomistic simulations. Unlike recent reviews on manifold learning for atomistic simulations, we consider only methods that construct low-dimensional manifolds based on Markov transition probabilities between high-dimensional samples. We discuss these techniques from a conceptual point of view, including their underlying theoretical frameworks and possible limitations.
引用
收藏
页数:32
相关论文
共 241 条
[1]   Principal component analysis [J].
Abdi, Herve ;
Williams, Lynne J. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04) :433-459
[2]   Markov state modeling reveals alternative unbinding pathways for peptide-MHC complexes [J].
Abella, Jayvee R. ;
Antunes, Dinler ;
Jackson, Kyle ;
Lizee, Gregory ;
Clementi, Cecilia ;
Kavraki, Lydia E. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (48) :30610-30618
[3]   Enhanced Sampling in Molecular Dynamics Using Metadynamics, Replica-Exchange, and Temperature-Acceleration [J].
Abrams, Cameron ;
Bussi, Giovanni .
ENTROPY, 2014, 16 (01) :163-199
[4]   Spectral multidimensional scaling [J].
Aflalo, Yonathan ;
Kimmel, Ron .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (45) :18052-18057
[5]   Mapping the Substrate Recognition Pathway in Cytochrome P450 [J].
Ahalawat, Navjeet ;
Mondal, Jagannath .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2018, 140 (50) :17743-17752
[6]   Application of time series analysis on molecular dynamics simulations of proteins: A study of different conformational spaces by principal component analysis [J].
Alakent, B ;
Doruker, P ;
Camurdan, MC .
JOURNAL OF CHEMICAL PHYSICS, 2004, 121 (10) :4759-4769
[7]  
[Anonymous], 2005, P 22 INT C MACHINE L
[8]  
[Anonymous], 2016, DISTILL, DOI [DOI 10.23915/DISTILL.00002, 10.23915/distill.00002]
[9]  
[Anonymous], 2009, 2009005 TICCTR TILB
[10]  
[Anonymous], 2004, P 21 INT C MACH LEAR