ChemGymRL: A customizable interactive framework for reinforcement learning for digital chemistry

被引：1

作者：

Beeler, Chris ^{[1
,2
,3
]}

Subramanian, Sriram Ganapathi ^{[2
,4
]}

Sprague, Kyle ^{[2
]}

Baula, Mark ^{[2
]}

Chatti, Nouha ^{[2
]}

Dawit, Amanuel ^{[2
]}

Li, Xinkai ^{[2
]}

Paquin, Nicholas ^{[2
]}

Shahen, Mitchell ^{[2
]}

Yang, Zihan ^{[2
]}

Bellinger, Colin ^{[3
]}

Crowley, Mark ^{[2
]}

Tamblyn, Isaac ^{[4
,5
]}

机构：

[1] Univ Ottawa, Dept Math & Stat, Ottawa, ON, Canada

[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada

[3] Natl Res Council Canada, Digital Technol, Ottawa, ON, Canada

[4] Vector Inst Artificial Intelligence, Toronto, ON, Canada

[5] Univ Ottawa, Dept Phys, Ottawa, ON, Canada

来源：

DIGITAL DISCOVERY | 2024年 / 3卷 / 04期

基金：

加拿大自然科学与工程研究理事会;

关键词：

D O I：

10.1039/d3dd00183k

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This paper provides a simulated laboratory for making use of reinforcement learning (RL) for material design, synthesis, and discovery. Since RL is fairly data intensive, training agents 'on-the-fly' by taking actions in the real world is infeasible and possibly dangerous. Moreover, chemical processing and discovery involves challenges which are not commonly found in RL benchmarks and therefore offer a rich space to work in. We introduce a set of highly customizable and open-source RL environments, ChemGymRL, implementing the standard gymnasium API. ChemGymRL supports a series of interconnected virtual chemical benches where RL agents can operate and train. The paper introduces and details each of these benches using well-known chemical reactions as illustrative examples, and trains a set of standard RL algorithms in each of these benches. Finally, discussion and comparison of the performances of several standard RL methods are provided in addition to a list of directions for future work as a vision for the further development and usage of ChemGymRL. Demonstration of a new open source Python library for simulating chemistry experiments as a gymnasium-API, reinforcement learning environment. Allowing learning policies for material design tasks or pipelines using a modular, extendable design.

引用

页码：742 / 758

页数：17

共 90 条

[1]

Achiam J, 2017, PR MACH LEARN RES, V70

[2]

Andrychowicz Marcin, 2017, ADV NEURAL INFORM PR, V30

[3]

Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726

[4]

Bellemare M. G., 2023, Distributional reinforcement learning

[5]

Bellemare MG, 2017, PR MACH LEARN RES, V70

[6] Autonomous chemical science and engineering enabled by self-driving laboratories [J].

Bennett, Jeffrey A. ;

Abolhasani, Milad .

CURRENT OPINION IN CHEMICAL ENGINEERING, 2022, 36

[7]

Bubliauskas A., 2022, Angew. Chem, V134

[8] Discovering New Chemistry with an Autonomous Robotic Platform Driven by a Reactivity-Seeking Neural Network [J].

Caramelli, Dario ;

Granda, Jaroslaw M. ;

Mehr, S. Hessam M. ;

Cambie, Dario ;

Henson, Alon B. ;

Cronin, Leroy .

ACS CENTRAL SCIENCE, 2021, 7 (11) :1821-1830

[9] Accelerated chemical space search using a quantum-inspired cluster expansion approach [J].

Choubisa, Hitarth ;

Abed, Jehad ;

Mendoza, Douglas ;

Matsumura, Hidetoshi ;

Sugimura, Masahiko ;

Yao, Zhenpeng ;

Wang, Ziyun ;

Sutherland, Brandon R. ;

Aspuru-Guzik, Alan ;

Sargent, Edward H. .

MATTER, 2023, 6 (02) :605-+

[10]

Chunduru R, 2022, Arxiv, DOI arXiv:2201.02628

← 1 2 3 4 5 6 7 8 9 →