Deep Learning for Deep Chemistry: Optimizing the Prediction of Chemical Patterns

被引:111
作者
Cova, Tania F. G. G. [1 ]
Pais, Alberto A. C. C. [1 ]
机构
[1] Univ Coimbra, Fac Sci & Technol, Dept Chem, Coimbra Chem Ctr,CQC, Coimbra, Portugal
来源
FRONTIERS IN CHEMISTRY | 2019年 / 7卷
关键词
machine-learning; deep-learning; optimization; models; molecular simulation; chemistry; NEURAL-NETWORKS; MOLECULAR-DYNAMICS; QUANTUM-MECHANICS; GENERATIVE MODELS; BIG DATA; MACHINE; COMPUTER; DESIGN; OPTIMIZATION; DISCOVERY;
D O I
10.3389/fchem.2019.00809
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Computational Chemistry is currently a synergistic assembly between ab initio calculations, simulation, machine learning (ML) and optimization strategies for describing, solving and predicting chemical data and related phenomena. These include accelerated literature searches, analysis and prediction of physical and quantum chemical properties, transition states, chemical structures, chemical reactions, and also new catalysts and drug candidates. The generalization of scalability to larger chemical problems, rather than specialization, is now the main principle for transforming chemical tasks in multiple fronts, for which systematic and cost-effective solutions have benefited from ML approaches, including those based on deep learning (e.g. quantum chemistry, molecular screening, synthetic route design, catalysis, drug discovery). The latter class of ML algorithms is capable of combining raw input into layers of intermediate features, enabling bench-to-bytes designs with the potential to transform several chemical domains. In this review, the most exciting developments concerning the use of ML in a range of different chemical scenarios are described. A range of different chemical problems and respective rationalization, that have hitherto been inaccessible due to the lack of suitable analysis tools, is thus detailed, evidencing the breadth of potential applications of these emerging multidimensional approaches. Focus is given to the models, algorithms and methods proposed to facilitate research on compound design and synthesis, materials design, prediction of binding, molecular activity, and soft matter behavior. The information produced by pairing Chemistry and ML, through data-driven analyses, neural network predictions and monitoring of chemical systems, allows (i) prompting the ability to understand the complexity of chemical data, (ii) streamlining and designing experiments, (ii) discovering new molecular targets and materials, and also (iv) planning or rethinking forthcoming chemical challenges. In fact, optimization engulfs all these tasks directly.
引用
收藏
页数:22
相关论文
共 171 条
  • [1] Two-class support vector machine with new kernel function based on paths of features for predicting chemical activity
    Abu El-Atta, Ahmed H.
    Hassanien, Aboul Ella
    [J]. INFORMATION SCIENCES, 2017, 403 : 42 - 54
  • [2] On the use of neural network ensembles in QSAR and QSPR
    Agrafiotis, DK
    Cedeño, W
    Lobanov, VS
    [J]. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2002, 42 (04): : 903 - 911
  • [3] Design and Optimization of Catalysts Based on Mechanistic Insights Derived from Quantum Chemical Reaction Modeling
    Ahn, Seihwan
    Hong, Mannkyu
    Sundararajan, Mahesh
    Ess, Daniel H.
    Baik, Mu-Hyun
    [J]. CHEMICAL REVIEWS, 2019, 119 (11) : 6509 - 6560
  • [4] Predicting reaction performance in C-N cross-coupling using machine learning
    Ahneman, Derek T.
    Estrada, Jesus G.
    Lin, Shishi
    Dreher, Spencer D.
    Doyle, Abigail G.
    [J]. SCIENCE, 2018, 360 (6385) : 186 - 190
  • [5] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
    Alipanahi, Babak
    Delong, Andrew
    Weirauch, Matthew T.
    Frey, Brendan J.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
  • [6] [Anonymous], ARXIV190309010
  • [7] [Anonymous], 1995, REV COMPUTATIONAL CH, DOI DOI 10.1002/9780470125830
  • [8] [Anonymous], 2017, Moleculenet: A benchmark for molecular machine learning
  • [9] [Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
  • [10] [Anonymous], MACH LEARN DAT MIN R