Data-Driven Strategies for Accelerated Materials Design

被引:283
作者
Pollice, Robert [1 ,2 ]
Gomes, Gabriel dos Passos [1 ,2 ]
Aldeghi, Matteo [1 ,2 ,3 ]
Hickman, Riley J. [1 ,2 ]
Krenn, Mario [1 ,2 ,3 ]
Lavigne, Cyrille [1 ,2 ]
Lindner-D'Addario, Michael [1 ,2 ]
Nigam, AkshatKumar [1 ,2 ]
Ser, Cher Tian [1 ,2 ]
Yao, Zhenpeng [1 ,2 ]
Aspuru-Guzik, Alan [1 ,2 ,3 ,4 ]
机构
[1] Univ Toronto, Dept Chem, Chem Phys Theory Grp, Toronto, ON M5S 3H6, Canada
[2] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3H6, Canada
[3] Vector Inst Artificial Intelligence, Toronto, ON M5G 1M1, Canada
[4] Canadian Inst Adv Res CIFAR, Toronto, ON M5G, Canada
基金
加拿大自然科学与工程研究理事会; 奥地利科学基金会; 瑞士国家科学基金会;
关键词
LIGHT-EMITTING-DIODES; CLEAN ENERGY PROJECT; ORGANIC PHOTOVOLTAICS; COMPUTATIONAL DISCOVERY; SELECTION BIAS; MICROARRAY; CANDIDATES; BATTERIES;
D O I
10.1021/acs.accounts.0c00785
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
The ongoing revolution of the natural sciences by the advent of machine learning and artificial intelligence sparked significant interest in the material science community in recent years. The intrinsically high dimensionality of the space of realizable materials makes traditional approaches ineffective for large-scale explorations. Modern data science and machine learning tools developed for increasingly complicated problems are an attractive alternative. An imminent climate catastrophe calls for a clean energy transformation by overhauling current technologies within only several years of possible action available. Tackling this crisis requires the development of new materials at an unprecedented pace and scale. For example, organic photovoltaics have the potential to replace existing silicon-based materials to a large extent and open up new fields of application. In recent years, organic light-emitting diodes have emerged as state-of-the-art technology for digital screens and portable devices and are enabling new applications with flexible displays. Reticular frameworks allow the atom-precise synthesis of nanomaterials and promise to revolutionize the field by the potential to realize multifunctional nanopartides with applications from gas storage, gas separation, and electrochemical energy storage to nanomedicine. In the recent decade, significant advances in all these fields have been facilitated by the comprehensive application of simulation and machine learning for property prediction, property optimization, and chemical space exploration enabled by considerable advances in computing power and algorithmic efficiency. In this Account, we review the most recent contributions of our group in this thriving field of machine learning for material science. We start with a summary of the most important material classes our group has been involved in, focusing on small molecules as organic electronic materials and crystalline materials. Specifically, we highlight the data-driven approaches we employed to speed up discovery and derive material design strategies. Subsequently, our focus lies on the data-driven methodologies our group has developed and employed, elaborating on high-throughput virtual screening, inverse molecular design, Bayesian optimization, and supervised learning. We discuss the general ideas, their working principles, and their use cases with examples of successful implementations in data-driven material discovery and design efforts. Furthermore, we elaborate on potential pitfalls and remaining challenges of these methods. Finally, we provide a brief outlook for the field as we foresee increasing adaptation and implementation of large scale data-driven approaches in material discovery and design campaigns.
引用
收藏
页码:849 / 860
页数:12
相关论文
共 74 条
[51]   DESIGN AND NATURAL-SCIENCE RESEARCH ON INFORMATION TECHNOLOGY [J].
MARCH, ST ;
SMITH, GF .
DECISION SUPPORT SYSTEMS, 1995, 15 (04) :251-266
[52]   Accelerated computational discovery of high-performance materials for organic photovoltaics by means of cheminformatics [J].
Olivares-Amaya, Roberto ;
Amador-Bedolla, Carlos ;
Hachmann, Johannes ;
Atahan-Evrenk, Sule ;
Sanchez-Carrera, Roel S. ;
Vogt, Leslie ;
Aspuru-Guzik, Alan .
ENERGY & ENVIRONMENTAL SCIENCE, 2011, 4 (12) :4849-4861
[53]   Organic Optoelectronic Materials: Mechanisms and Applications [J].
Ostroverkhova, Oksana .
CHEMICAL REVIEWS, 2016, 116 (22) :13279-13412
[54]  
Pollice R., 2020, CHEMRXIV, DOI [10.26434/CHEMRXIV.13087319.V1, DOI 10.26434/CHEMRXIV.13087319.V1]
[55]  
Ponrouch A, 2016, NAT MATER, V15, P169, DOI [10.1038/NMAT4462, 10.1038/nmat4462]
[56]   Learning from the Harvard Clean Energy Project: The Use of Neural Networks to Accelerate Materials Discovery [J].
Pyzer-Knapp, Edward O. ;
Li, Kewei ;
Aspuru-Guzik, Alan .
ADVANCED FUNCTIONAL MATERIALS, 2015, 25 (41) :6495-6502
[57]   What Is High-Throughput Virtual Screening? A Perspective from Organic Materials Discovery [J].
Pyzer-Knapp, Edward O. ;
Suh, Changwon ;
Gomez-Bombarelli, Rafael ;
Aguilera-Iparraguirre, Jorge ;
Aspuru-Guzik, Alan .
ANNUAL REVIEW OF MATERIALS RESEARCH, VOL 45, 2015, 45 :195-216
[58]   ChemOS: An orchestration software to democratize autonomous discovery [J].
Roch, Loic M. ;
Hase, Florian ;
Kreisbeck, Christoph ;
Tamayo-Mendoza, Teresa ;
Yunker, Lars P. E. ;
Hein, Jason E. ;
Aspuru-Guzik, Alan .
PLOS ONE, 2020, 15 (04)
[59]   ChemOS: Orchestrating autonomous experimentation [J].
Roch, Loic M. ;
Hase, Florian ;
Kreisbeck, Christoph ;
Tamayo-Mendoza, Teresa ;
Yunker, Lars P. E. ;
Hein, Jason E. ;
Aspuru-Guzik, Alan .
SCIENCE ROBOTICS, 2018, 3 (19)
[60]  
Roelofs R., 2019, P 33 INT C NEUR INF, V32nd, P9179, DOI DOI 10.5555/3454287.3455110