GAtor: A First-Principles Genetic Algorithm for Molecular Crystal Structure Prediction

被引:87
作者
Curtis, Farren [1 ]
Li, Xiayue [2 ,3 ]
Rose, Timothy [3 ]
Vazquez-Mayagoitia, Alvaro [4 ]
Bhattacharya, Saswata [5 ]
Ghiringhelli, Luca M. [6 ]
Marom, Noa [1 ,3 ,7 ]
机构
[1] Carnegie Mellon Univ, Dept Phys, Pittsburgh, PA 15213 USA
[2] Google, Mountain View, CA 94030 USA
[3] Carnegie Mellon Univ, Dept Mat Sci & Engn, Pittsburgh, PA 15213 USA
[4] Argonne Natl Lab, Argonne Leadership Comp Facil, Lemont, IL 60439 USA
[5] Indian Inst Technol Delhi, Dept Phys, Hauz Khas, New Delhi 110016, India
[6] Max Planck Gesell, Fritz Haber Inst, Faradayweg 4-6, D-14195 Berlin, Germany
[7] Carnegie Mellon Univ, Dept Chem, 4400 5th Ave, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
DENSITY-FUNCTIONAL THEORY; SOURCE EVOLUTIONARY ALGORITHM; SMALL ORGANIC-MOLECULES; DER-WAALS INTERACTIONS; BLIND TEST; NONCOVALENT INTERACTIONS; GEOMETRY OPTIMIZATION; ENERGY LANDSCAPES; 1ST PRINCIPLES; EXCHANGE;
D O I
10.1021/acs.jctc.7b01152
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
We present the implementation of GAtor, a massively parallel, first-principles genetic algorithm (GA) for molecular crystal structure prediction. GAtor is written in Python and currently interfaces with the FHI-aims code to perform local optimizations and energy evaluations using dispersion-inclusive density functional theory (DFT). GAtor offers a variety of fitness evaluation, selection, crossover, and mutation schemes. Breeding operators designed specifically for molecular crystals provide a balance between exploration and exploitation. Evolutionary niching is implemented in GAtor by using machine learning to cluster the dynamically updated population by structural similarity and then employing a cluster-based fitness function. Evolutionary niching promotes uniform sampling of the potential energy surface by evolving several subpopulations, which helps overcome initial pool biases and selection biases (genetic drift). The various settings offered by GAtor increase the likelihood of locating numerous low-energy minima, including those located in disconnected, hard to reach regions of the potential energy landscape. The best structures generated are re-relaxed and re ranked using a hierarchy of increasingly accurate DFT functionals and dispersion methods. GAtor is applied to a chemically diverse set of four past blind test targets, characterized by different types of intermolecular interactions. The experimentally observed structures and other low-energy structures are found for all four targets. In particular, for Target II, 5-cyano-3-hydroxythiophene, the top ranked putative crystal structure is a Z' = 2 structure with PT symmetry and a scaffold packing motif, which has not been reported previously.
引用
收藏
页码:2246 / +
页数:19
相关论文
共 161 条
  • [41] Significant progress in predicting the crystal structures of small organic molecules - a report on the fourth blind test
    Day, Graeme M.
    Cooper, Timothy G.
    Cruz-Cabeza, Aurora J.
    Hejczyk, Katarzyna E.
    Ammon, Herman L.
    Boerrigter, Stephan X. M.
    Tan, Jeffrey S.
    Della Valle, Raffaele G.
    Venuti, Elisabetta
    Jose, Jovan
    Gadre, Shridhar R.
    Desiraju, Gautam R.
    Thakur, Tejender S.
    van Eijck, Bouke P.
    Facelli, Julio C.
    Bazterra, Victor E.
    Ferraro, Marta B.
    Hofmann, Detlef W. M.
    Neumann, Marcus A.
    Leusen, Frank J. J.
    Kendrick, John
    Price, Sarah L.
    Misquitta, Alston J.
    Karamertzanis, Panagiotis G.
    Welch, Gareth W. A.
    Scheraga, Harold A.
    Arnautova, Yelena A.
    Schmidt, Martin U.
    van de Streek, Jacco
    Wolf, Alexandra K.
    Schweizer, Bernd
    [J]. ACTA CRYSTALLOGRAPHICA SECTION B-STRUCTURAL SCIENCE CRYSTAL ENGINEERING AND MATERIALS, 2009, 65 : 107 - 125
  • [42] MOLECULAR-GEOMETRY OPTIMIZATION WITH A GENETIC ALGORITHM
    DEAVEN, DM
    HO, KM
    [J]. PHYSICAL REVIEW LETTERS, 1995, 75 (02) : 288 - 291
  • [43] Diao Y, 2013, NAT MATER, V12, P665, DOI [10.1038/nmat3650, 10.1038/NMAT3650]
  • [44] Van der Waals density functional for general geometries -: art. no. 246401
    Dion, M
    Rydberg, H
    Schröder, E
    Langreth, DC
    Lundqvist, BI
    [J]. PHYSICAL REVIEW LETTERS, 2004, 92 (24) : 246401 - 1
  • [45] Collective many-body van der Waals interactions in molecular systems
    DiStasio, Robert A., Jr.
    von Lilienfeld, O. Anatole
    Tkatchenko, Alexandre
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (37) : 14791 - 14795
  • [46] The solid-state continuum: a perspective on the interrelationships between different solid-state forms in drug substance and drug product
    Elder, David P.
    Patterson, James E.
    Holm, Rene
    [J]. JOURNAL OF PHARMACY AND PHARMACOLOGY, 2015, 67 (06) : 757 - 772
  • [47] XTALOPT version r9: An open-source evolutionary algorithm for crystal structure prediction
    Falls, Zackary
    Lonie, David C.
    Avery, Patrick
    Shamp, Andrew
    Zurek, Eva
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2016, 199 : 178 - 179
  • [48] Confirmation of the Molecular Structure of Tetramethylene Diperoxide Dicarbamide (TMDD) and Its Sensitiveness Properties
    Fitzgerald, Mark
    Gardiner, Miehael G.
    Armitt, David
    Dicinoski, Gregory W.
    Wall, Craig
    [J]. JOURNAL OF PHYSICAL CHEMISTRY A, 2015, 119 (05) : 905 - 910
  • [49] Nature of Hydrogen Bonds and S•••S Interactions in the L-Cystine Crystal
    Flores-Huerta, Anaid G.
    Tkatchenko, Alexandre
    Galvan, Marcelo
    [J]. JOURNAL OF PHYSICAL CHEMISTRY A, 2016, 120 (24) : 4223 - 4230
  • [50] Clustering by passing messages between data points
    Frey, Brendan J.
    Dueck, Delbert
    [J]. SCIENCE, 2007, 315 (5814) : 972 - 976