Combining machine learning and high-throughput experimentation to discover photocatalytically active organic molecules

被引:63
作者
Li, Xiaobo [1 ]
Maffettone, Phillip M. [1 ,2 ]
Che, Yu [1 ,3 ]
Liu, Tao [1 ]
Chen, Linjiang [1 ,3 ]
Cooper, Andrew I. [1 ,3 ]
机构
[1] Univ Liverpool, Dept Chem & Mat Innovat Factory, 51 Oxford St, Liverpool L7 3NY, Merseyside, England
[2] Brookhaven Natl Lab, Natl Synchrotron Light Source 2, Upton, NY 11973 USA
[3] Univ Liverpool, Leverhulme Res Ctr Funct Mat Design, Mat Innovat Factory & Dept Chem, 51 Oxford St, Liverpool L7 3NY, Merseyside, England
基金
英国工程与自然科学研究理事会;
关键词
HYDROGEN-PRODUCTION; VISIBLE-LIGHT; CO2; REDUCTION; PHOTOSENSITIZERS; SYSTEMS; DESIGN; WATER; DYES; TEMPERATURE; FRAMEWORKS;
D O I
10.1039/d1sc02150h
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Light-absorbing organic molecules are useful components in photocatalysts, but it is difficult to formulate reliable structure-property design rules. More than 100 million unique chemical compounds are documented in the PubChem database, and a significant sub-set of these are pi-conjugated, light-absorbing molecules that might in principle act as photocatalysts. Nature has used natural selection to evolve photosynthetic assemblies; by contrast, our ability to navigate the enormous potential search space of organic photocatalysts in the laboratory is limited. Here, we integrate experiment, computation, and machine learning to address this challenge. A library of 572 aromatic organic molecules was assembled with diverse compositions and structures, selected on the basis of availability in our laboratory, rather than more sophisticated criteria. This training library was then assessed experimentally for sacrificial photocatalytic hydrogen evolution using a high-throughput, automated method. Quantum chemical calculations and machine learning were used to visualise, interpret, and ultimately to predict the photocatalytic activities of these molecules, covering a much broader chemical space than for previous polymer photocatalyst libraries. By applying unsupervised learning to the molecular structures, we identified structural features that were common in molecules with high catalytic activity. Further analysis using calculated molecular descriptors within a suite of supervised classification algorithms revealed that light absorption, exciton electron affinity, electron affinity, exciton binding energy, and singlet-triplet energy gap had correlations with the photocatalytic performance. These trained predictive models can be used in future studies as filters to deprioritise or discard would-be low-activity candidate molecules from experiments, and to prioritize more favourable candidates. As a demonstration, we used virtual in silico experiments to show that it was possible to halve the experimental cost of finding 50% of the most active photocatalysts by using the machine learning model as an experimental advisor. We further showed that the ML advisor trained on the 572-molecule library could be used to make predictions for an unseen set of 96 molecules, achieving equivalent predictive accuracies to those in the initial training set. This marks a step toward the machine-learning assisted discovery of molecular organic photocatalysts and the approach might also be applied to problems beyond photocatalytic hydrogen evolution, such as CO2 reduction and photoredox chemistry.
引用
收藏
页码:10742 / 10754
页数:13
相关论文
共 60 条
  • [1] Virtual Excited State Reference for the Discovery of Electronic Materials Database: An Open-Access Resource for Ground and Excited State Properties of Organic Molecules
    Abreha, Biruk G.
    Agarwal, Snigdha
    Foster, Ian
    Blaiszik, Ben
    Lopez, Steven A.
    [J]. JOURNAL OF PHYSICAL CHEMISTRY LETTERS, 2019, 10 (21) : 6835 - 6841
  • [2] Photochemical mechanisms responsible for the versatile application of naphthalimides and naphthaldiimides in biological systems
    Aveline, BM
    Matsugo, S
    Redmond, RW
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 1997, 119 (49) : 11785 - 11795
  • [3] Accelerated Discovery of Organic Polymer Photocatalysts for Hydrogen Evolution from Water through the Integration of Experiment and Theory
    Bai, Yang
    Wilbraham, Liam
    Slater, Benjamin J.
    Zwijnenburg, Martijn A.
    Sprick, Reiner Sebastian
    Cooper, Andrew I.
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2019, 141 (22) : 9063 - 9071
  • [4] On representing chemical environments
    Bartok, Albert P.
    Kondor, Risi
    Csanyi, Gabor
    [J]. PHYSICAL REVIEW B, 2013, 87 (18)
  • [5] Selective and Efficient Photocatalytic CO2 Reduction to CO Using Visible Light and an Iron-Based Homogeneous Catalyst
    Bonin, Julien
    Robert, Marc
    Routier, Mathilde
    [J]. JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2014, 136 (48) : 16768 - 16771
  • [6] Data-driven design of metal-organic frameworks for wet flue gas CO2 capture
    Boyd, Peter G.
    Chidambaram, Arunraj
    Garcia-Diez, Enrique
    Ireland, Christopher P.
    Daff, Thomas D.
    Bounds, Richard
    Gladysiak, Andrzej
    Schouwink, Pascal
    Moosavi, Seyed Mohamad
    Maroto-Valer, M. Mercedes
    Reimer, Jeffrey A.
    Navarro, Jorge A. R.
    Woo, Tom K.
    Garcia, Susana
    Stylianou, Kyriakos C.
    Smit, Berend
    [J]. NATURE, 2019, 576 (7786) : 253 - +
  • [7] A mobile robotic chemist
    Burger, Benjamin
    Maffettone, Phillip M.
    Gusev, Vladimir V.
    Aitchison, Catherine M.
    Bai, Yang
    Wang, Xiaoyan
    Li, Xiaobo
    Alston, Ben M.
    Li, Buyi
    Clowes, Rob
    Rankin, Nicola
    Harris, Brandon
    Sprick, Reiner Sebastian
    Cooper, Andrew I.
    [J]. NATURE, 2020, 583 (7815) : 237 - +
  • [8] Machine learning for molecular and materials science
    Butler, Keith T.
    Davies, Daniel W.
    Cartwright, Hugh
    Isayev, Olexandr
    Walsh, Aron
    [J]. NATURE, 2018, 559 (7715) : 547 - 555
  • [9] QSAR Modeling: Where Have You Been? Where Are You Going To?
    Cherkasov, Artem
    Muratov, Eugene N.
    Fourches, Denis
    Varnek, Alexandre
    Baskin, Igor I.
    Cronin, Mark
    Dearden, John
    Gramatica, Paola
    Martin, Yvonne C.
    Todeschini, Roberto
    Consonni, Viviana
    Kuz'min, Victor E.
    Cramer, Richard
    Benigni, Romualdo
    Yang, Chihae
    Rathman, James
    Terfloth, Lothar
    Gasteiger, Johann
    Richard, Ann
    Tropsha, Alexander
    [J]. JOURNAL OF MEDICINAL CHEMISTRY, 2014, 57 (12) : 4977 - 5010
  • [10] Autonomous Discovery in the Chemical Sciences Part I: Progress
    Coley, Connor W.
    Eyke, Natalie S.
    Jensen, Klavs F.
    [J]. ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2020, 59 (51) : 22858 - 22893