ChemGymRL: A customizable interactive framework for reinforcement learning for digital chemistry

被引：1

作者：

Beeler, Chris ^{[1
,2
,3
]}

Subramanian, Sriram Ganapathi ^{[2
,4
]}

Sprague, Kyle ^{[2
]}

Baula, Mark ^{[2
]}

Chatti, Nouha ^{[2
]}

Dawit, Amanuel ^{[2
]}

Li, Xinkai ^{[2
]}

Paquin, Nicholas ^{[2
]}

Shahen, Mitchell ^{[2
]}

Yang, Zihan ^{[2
]}

Bellinger, Colin ^{[3
]}

Crowley, Mark ^{[2
]}

Tamblyn, Isaac ^{[4
,5
]}

机构：

[1] Univ Ottawa, Dept Math & Stat, Ottawa, ON, Canada

[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada

[3] Natl Res Council Canada, Digital Technol, Ottawa, ON, Canada

[4] Vector Inst Artificial Intelligence, Toronto, ON, Canada

[5] Univ Ottawa, Dept Phys, Ottawa, ON, Canada

来源：

DIGITAL DISCOVERY | 2024年 / 3卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1039/d3dd00183k

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This paper provides a simulated laboratory for making use of reinforcement learning (RL) for material design, synthesis, and discovery. Since RL is fairly data intensive, training agents 'on-the-fly' by taking actions in the real world is infeasible and possibly dangerous. Moreover, chemical processing and discovery involves challenges which are not commonly found in RL benchmarks and therefore offer a rich space to work in. We introduce a set of highly customizable and open-source RL environments, ChemGymRL, implementing the standard gymnasium API. ChemGymRL supports a series of interconnected virtual chemical benches where RL agents can operate and train. The paper introduces and details each of these benches using well-known chemical reactions as illustrative examples, and trains a set of standard RL algorithms in each of these benches. Finally, discussion and comparison of the performances of several standard RL methods are provided in addition to a list of directions for future work as a vision for the further development and usage of ChemGymRL. Demonstration of a new open source Python library for simulating chemistry experiments as a gymnasium-API, reinforcement learning environment. Allowing learning policies for material design tasks or pipelines using a modular, extendable design.

引用

页码：742 / 758

页数：17

共 90 条

[11]

Da Silva FL, 2018, PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), P1026

[12]

Dabney W, 2018, AAAI CONF ARTIF INTE, P2892

[13] Challenges of real-world reinforcement learning: definitions, benchmarks and analysis [J].

Dulac-Arnold, Gabriel ;

Levine, Nir ;

Mankowitz, Daniel J. ;

Li, Jerry ;

Paduraru, Cosmin ;

Gowal, Sven ;

Hester, Todd .

MACHINE LEARNING, 2021, 110 (09) :2419-2468

[14] ARChemist: Autonomous Robotic Chemistry System Architecture [J].

Fakhruldeen, Hatem ;

Pizzuto, Gabriella ;

Glowacki, Jakub ;

Cooper, Andrew Ian .

2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, :6013-6019

[15]

Fedus W, 2020, INT C MACHINE LEARNI, P3061, DOI DOI 10.48550/ARXIV.2007.06700

[16]

Fievez Mathilde, 2022, 2022 IEEE 49th Photovoltaics Specialists Conference (PVSC), P1072, DOI 10.1109/PVSC48317.2022.9938778

[17] Materials Acceleration Platforms: On the way to autonomous experimentation [J].

Flores-Leonar, Martha M. ;

Mejia-Mendoza, Luis M. ;

Aguilar-Granda, Andres ;

Sanchez-Lengeling, Benjamin ;

Tribukait, Hermann ;

Amador-Bedolla, Carlos ;

Aspuru-Guzik, Alan .

CURRENT OPINION IN GREEN AND SUSTAINABLE CHEMISTRY, 2020, 25

[18]

Fujimoto S, 2018, PR MACH LEARN RES, V80

[19]

Fujimoto Scott, 2019, arXiv, DOI DOI 10.48550/ARXIV.1910.01708

[20]

Gao Y, 2019, Arxiv, DOI arXiv:1802.05313

← 1 2 3 4 5 6 7 8 9 →