ChemGymRL: A customizable interactive framework for reinforcement learning for digital chemistry

被引:1
作者
Beeler, Chris [1 ,2 ,3 ]
Subramanian, Sriram Ganapathi [2 ,4 ]
Sprague, Kyle [2 ]
Baula, Mark [2 ]
Chatti, Nouha [2 ]
Dawit, Amanuel [2 ]
Li, Xinkai [2 ]
Paquin, Nicholas [2 ]
Shahen, Mitchell [2 ]
Yang, Zihan [2 ]
Bellinger, Colin [3 ]
Crowley, Mark [2 ]
Tamblyn, Isaac [4 ,5 ]
机构
[1] Univ Ottawa, Dept Math & Stat, Ottawa, ON, Canada
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
[3] Natl Res Council Canada, Digital Technol, Ottawa, ON, Canada
[4] Vector Inst Artificial Intelligence, Toronto, ON, Canada
[5] Univ Ottawa, Dept Phys, Ottawa, ON, Canada
来源
DIGITAL DISCOVERY | 2024年 / 3卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1039/d3dd00183k
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper provides a simulated laboratory for making use of reinforcement learning (RL) for material design, synthesis, and discovery. Since RL is fairly data intensive, training agents 'on-the-fly' by taking actions in the real world is infeasible and possibly dangerous. Moreover, chemical processing and discovery involves challenges which are not commonly found in RL benchmarks and therefore offer a rich space to work in. We introduce a set of highly customizable and open-source RL environments, ChemGymRL, implementing the standard gymnasium API. ChemGymRL supports a series of interconnected virtual chemical benches where RL agents can operate and train. The paper introduces and details each of these benches using well-known chemical reactions as illustrative examples, and trains a set of standard RL algorithms in each of these benches. Finally, discussion and comparison of the performances of several standard RL methods are provided in addition to a list of directions for future work as a vision for the further development and usage of ChemGymRL. Demonstration of a new open source Python library for simulating chemistry experiments as a gymnasium-API, reinforcement learning environment. Allowing learning policies for material design tasks or pipelines using a modular, extendable design.
引用
收藏
页码:742 / 758
页数:17
相关论文
共 90 条
[11]  
Da Silva FL, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1026
[12]  
Dabney W, 2018, AAAI CONF ARTIF INTE, P2892
[13]   Challenges of real-world reinforcement learning: definitions, benchmarks and analysis [J].
Dulac-Arnold, Gabriel ;
Levine, Nir ;
Mankowitz, Daniel J. ;
Li, Jerry ;
Paduraru, Cosmin ;
Gowal, Sven ;
Hester, Todd .
MACHINE LEARNING, 2021, 110 (09) :2419-2468
[14]   ARChemist: Autonomous Robotic Chemistry System Architecture [J].
Fakhruldeen, Hatem ;
Pizzuto, Gabriella ;
Glowacki, Jakub ;
Cooper, Andrew Ian .
2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, :6013-6019
[15]  
Fedus W, 2020, INT C MACHINE LEARNI, P3061, DOI DOI 10.48550/ARXIV.2007.06700
[16]  
Fievez Mathilde, 2022, 2022 IEEE 49th Photovoltaics Specialists Conference (PVSC), P1072, DOI 10.1109/PVSC48317.2022.9938778
[17]   Materials Acceleration Platforms: On the way to autonomous experimentation [J].
Flores-Leonar, Martha M. ;
Mejia-Mendoza, Luis M. ;
Aguilar-Granda, Andres ;
Sanchez-Lengeling, Benjamin ;
Tribukait, Hermann ;
Amador-Bedolla, Carlos ;
Aspuru-Guzik, Alan .
CURRENT OPINION IN GREEN AND SUSTAINABLE CHEMISTRY, 2020, 25
[18]  
Fujimoto S, 2018, PR MACH LEARN RES, V80
[19]  
Fujimoto Scott, 2019, arXiv, DOI DOI 10.48550/ARXIV.1910.01708
[20]  
Gao Y, 2019, Arxiv, DOI arXiv:1802.05313