ChemGymRL: A customizable interactive framework for reinforcement learning for digital chemistry

被引:1
作者
Beeler, Chris [1 ,2 ,3 ]
Subramanian, Sriram Ganapathi [2 ,4 ]
Sprague, Kyle [2 ]
Baula, Mark [2 ]
Chatti, Nouha [2 ]
Dawit, Amanuel [2 ]
Li, Xinkai [2 ]
Paquin, Nicholas [2 ]
Shahen, Mitchell [2 ]
Yang, Zihan [2 ]
Bellinger, Colin [3 ]
Crowley, Mark [2 ]
Tamblyn, Isaac [4 ,5 ]
机构
[1] Univ Ottawa, Dept Math & Stat, Ottawa, ON, Canada
[2] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
[3] Natl Res Council Canada, Digital Technol, Ottawa, ON, Canada
[4] Vector Inst Artificial Intelligence, Toronto, ON, Canada
[5] Univ Ottawa, Dept Phys, Ottawa, ON, Canada
来源
DIGITAL DISCOVERY | 2024年 / 3卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
D O I
10.1039/d3dd00183k
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This paper provides a simulated laboratory for making use of reinforcement learning (RL) for material design, synthesis, and discovery. Since RL is fairly data intensive, training agents 'on-the-fly' by taking actions in the real world is infeasible and possibly dangerous. Moreover, chemical processing and discovery involves challenges which are not commonly found in RL benchmarks and therefore offer a rich space to work in. We introduce a set of highly customizable and open-source RL environments, ChemGymRL, implementing the standard gymnasium API. ChemGymRL supports a series of interconnected virtual chemical benches where RL agents can operate and train. The paper introduces and details each of these benches using well-known chemical reactions as illustrative examples, and trains a set of standard RL algorithms in each of these benches. Finally, discussion and comparison of the performances of several standard RL methods are provided in addition to a list of directions for future work as a vision for the further development and usage of ChemGymRL. Demonstration of a new open source Python library for simulating chemistry experiments as a gymnasium-API, reinforcement learning environment. Allowing learning policies for material design tasks or pipelines using a modular, extendable design.
引用
收藏
页码:742 / 758
页数:17
相关论文
共 90 条
[1]  
Achiam J, 2017, PR MACH LEARN RES, V70
[2]  
Andrychowicz Marcin, 2017, ADV NEURAL INFORM PR, V30
[3]  
Bacon PL, 2017, AAAI CONF ARTIF INTE, P1726
[4]  
Bellemare M. G., 2023, Distributional reinforcement learning
[5]  
Bellemare MG, 2017, PR MACH LEARN RES, V70
[6]   Autonomous chemical science and engineering enabled by self-driving laboratories [J].
Bennett, Jeffrey A. ;
Abolhasani, Milad .
CURRENT OPINION IN CHEMICAL ENGINEERING, 2022, 36
[7]  
Bubliauskas A., 2022, Angew. Chem, V134
[8]   Discovering New Chemistry with an Autonomous Robotic Platform Driven by a Reactivity-Seeking Neural Network [J].
Caramelli, Dario ;
Granda, Jaroslaw M. ;
Mehr, S. Hessam M. ;
Cambie, Dario ;
Henson, Alon B. ;
Cronin, Leroy .
ACS CENTRAL SCIENCE, 2021, 7 (11) :1821-1830
[9]   Accelerated chemical space search using a quantum-inspired cluster expansion approach [J].
Choubisa, Hitarth ;
Abed, Jehad ;
Mendoza, Douglas ;
Matsumura, Hidetoshi ;
Sugimura, Masahiko ;
Yao, Zhenpeng ;
Wang, Ziyun ;
Sutherland, Brandon R. ;
Aspuru-Guzik, Alan ;
Sargent, Edward H. .
MATTER, 2023, 6 (02) :605-+
[10]  
Chunduru R, 2022, Arxiv, DOI arXiv:2201.02628