Molecular dynamics simulations for genetic interpretation in protein coding regions: where we are, where to go and when

被引:16
作者
Galano-Frutos, Juan J. [1 ]
Garcia-Cebollada, Helena [2 ,3 ]
Sancho, Javier [3 ]
机构
[1] Univ Zaragoza, BIFI, Prot Folding & Mol Design ProtMol Grp, Zaragoza, Spain
[2] BIFI, Zaragoza, Spain
[3] Prot Folding & Mol Design ProtMol Grp, Zaragoza, Spain
关键词
genetic interpretation; mutation-effect prediction tools; single amino acid variation; molecular dynamics simulation; protein stability; large-scale phenotype prediction; AMINO-ACID SUBSTITUTIONS; PREDICTING STABILITY CHANGES; GENOME-WIDE ASSOCIATION; WEB-BASED TOOL; MISSENSE SUBSTITUTIONS; INTERACTION NETWORKS; COMPARING PROTEINS; SOMATIC MUTATIONS; WHOLE-EXOME; DISEASE;
D O I
10.1093/bib/bbz146
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The increasing ease with which massive genetic information can be obtained from patients or healthy individuals has stimulated the development of interpretive bioinformatics tools as aids in clinical practice. Most such tools analyze evolutionary information and simple physical-chemical properties to predict whether replacement of one amino acid residue with another will be tolerated or cause disease. Those approaches achieve up to 80-85% accuracy as binary classifiers (neutral/pathogenic). As such accuracy is insufficient for medical decision to be based on, and it does not appear to be increasing, more precise methods, such as full-atom molecular dynamics (MD) simulations in explicit solvent, are also discussed. Then, to describe the goal of interpreting human genetic variations at large scale through MD simulations, we restrictively refer to all possible protein variants carrying single-amino-acid substitutions arising from single-nucleotide variations as the human variome. We calculate its size and develop a simple model that allows calculating the simulation time needed to have a 0.99 probability of observing unfolding events of any unstable variant. The knowledge of that time enables performing a binary classification of the variants (stable-potentially neutral/unstable-pathogenic). Our model indicates that the human variome cannot be simulated with present computing capabilities. However, if they continue to increase as per Moore's law, it could be simulated (at 65 degrees C) spending only 3 years in the task if we started in 2031. The simulation of individual protein variomes is achievable in short times starting at present. International coordination seems appropriate to embark upon massive MD simulations of protein variants.
引用
收藏
页码:3 / 19
页数:17
相关论文
共 204 条
[1]   Response to: Statistical Analysis of Missense Mutation Classifiers [J].
Acharya, Vishal ;
Nagarajaram, Hampapathalu A. .
HUMAN MUTATION, 2013, 34 (02) :407-407
[2]   Hansa: An Automated Method for Discriminating Disease and Neutral Human nsSNPs [J].
Acharya, Vishal ;
Nagarajaram, Hampapathalu A. .
HUMAN MUTATION, 2012, 33 (02) :332-337
[3]   A method and server for predicting damaging missense mutations [J].
Adzhubei, Ivan A. ;
Schmidt, Steffen ;
Peshkin, Leonid ;
Ramensky, Vasily E. ;
Gerasimova, Anna ;
Bork, Peer ;
Kondrashov, Alexey S. ;
Sunyaev, Shamil R. .
NATURE METHODS, 2010, 7 (04) :248-249
[4]   InterPreTS: protein Interaction Prediction through Tertiary Structure [J].
Aloy, P ;
Russell, RB .
BIOINFORMATICS, 2003, 19 (01) :161-162
[5]   A map of human genome variation from population-scale sequencing [J].
Altshuler, David ;
Durbin, Richard M. ;
Abecasis, Goncalo R. ;
Bentley, David R. ;
Chakravarti, Aravinda ;
Clark, Andrew G. ;
Collins, Francis S. ;
De la Vega, Francisco M. ;
Donnelly, Peter ;
Egholm, Michael ;
Flicek, Paul ;
Gabriel, Stacey B. ;
Gibbs, Richard A. ;
Knoppers, Bartha M. ;
Lander, Eric S. ;
Lehrach, Hans ;
Mardis, Elaine R. ;
McVean, Gil A. ;
Nickerson, DebbieA. ;
Peltonen, Leena ;
Schafer, Alan J. ;
Sherry, Stephen T. ;
Wang, Jun ;
Wilson, Richard K. ;
Gibbs, Richard A. ;
Deiros, David ;
Metzker, Mike ;
Muzny, Donna ;
Reid, Jeff ;
Wheeler, David ;
Wang, Jun ;
Li, Jingxiang ;
Jian, Min ;
Li, Guoqing ;
Li, Ruiqiang ;
Liang, Huiqing ;
Tian, Geng ;
Wang, Bo ;
Wang, Jian ;
Wang, Wei ;
Yang, Huanming ;
Zhang, Xiuqing ;
Zheng, Huisong ;
Lander, Eric S. ;
Altshuler, David L. ;
Ambrogio, Lauren ;
Bloom, Toby ;
Cibulskis, Kristian ;
Fennell, Tim J. ;
Gabriel, Stacey B. .
NATURE, 2010, 467 (7319) :1061-1073
[6]   High Throughput Sequencing: An Overview of Sequencing Chemistry [J].
Ambardar, Sheetal ;
Gupta, Rikita ;
Trakroo, Deepika ;
Lal, Rup ;
Vakhlu, Jyoti .
INDIAN JOURNAL OF MICROBIOLOGY, 2016, 56 (04) :394-404
[7]   Exploring the complete mutational space of the LDL receptor LA5 domain using molecular dynamics: linking SNPs with disease phenotypes in familial hypercholesterolemia [J].
Angarica, Vladimir Espinosa ;
Orozco, Modesto ;
Sancho, Javier .
HUMAN MOLECULAR GENETICS, 2016, 25 (06) :1233-1246
[8]   Translation of pre-spliced RNAs in the nuclear compartment generates peptides for the MHC class I pathway [J].
Apcher, Sebastien ;
Millot, Guy ;
Daskalogianni, Chrysoula ;
Scherl, Alexander ;
Manoury, Benedicte ;
Fahraeus, Robin .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2013, 110 (44) :17951-17956
[9]   A new and updated resource for codon usage tables [J].
Athey, John ;
Alexaki, Aikaterini ;
Osipova, Ekaterina ;
Rostovtsev, Alexandre ;
Santana-Quintero, Luis V. ;
Katneni, Upendra ;
Simonyan, Vahan ;
Kimchi-Sarfaty, Chava .
BMC BIOINFORMATICS, 2017, 18
[10]   The Effects of Mutations on Protein Function: A Comparative Study of Three Databases of Mutations in Humans [J].
Azia, Ariel ;
Uversky, Vladimir N. ;
Horovitz, Amnon ;
Unger, Ron .
ISRAEL JOURNAL OF CHEMISTRY, 2013, 53 (3-4) :217-226