Reinforcement learning in discrete action space applied to inverse defect design

被引:13
作者
Loeffler, Troy D. [1 ,2 ]
Banik, Suvo [1 ,2 ]
Patra, Tarak K. [2 ,3 ]
Sternberg, Michael [2 ]
Sankaranarayanan, Subramanian K. R. S. [1 ,2 ]
机构
[1] Univ Illinois, Dept Mech & Ind Engn, Chicago, IL 60607 USA
[2] Argonne Natl Lab, Ctr Nanoscale Mat, Lemont, IL 60439 USA
[3] Indian Inst Technol Madras, Dept Chem Engn, Chennai 600036, Tamil Nadu, India
来源
JOURNAL OF PHYSICS COMMUNICATIONS | 2021年 / 5卷 / 03期
关键词
machine learning; defect design; inverse design; DEEP NEURAL-NETWORKS; MOLYBDENUM-DISULFIDE; TREE-SEARCH; ALGORITHMS;
D O I
10.1088/2399-6528/abe591
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Reinforcement learning (RL) algorithms that include Monte Carlo Tree Search (MCTS) have found tremendous success in computer games such as Go, Shiga and Chess. Such learning algorithms have demonstrated super-human capabilities in navigating through an exhaustive discrete action search space. Motivated by their success in computer games, we demonstrate that RL can be applied to inverse materials design problems. We deploy RL for a representative case of the optimal atomic scale inverse design of extended defects via rearrangement of chalcogen (e.g. S) vacancies in 2D transition metal dichalcogenides (e.g. MoS2). These defect rearrangements and their dynamics are important from the perspective of tunable phase transition in 2D materials i.e. 2H (semi-conducting) to 1T (metallic) in MoS2. We demonstrate the ability of MCTS interfaced with a reactive molecular dynamics simulator to efficiently sample the defect phase space and perform inverse design-starting from randomly distributed S vacancies, the optimal defect rearrangement of defects corresponds a line defect of S vacancies. We compare MCTS performance with evolutionary optimization i.e. genetic algorithms and show that MCTS converges to a better optimal solution (lower objective) and in fewer evaluations compared to GA. We also comprehensively evaluate and discuss the effect of MCTS hyperparameters on the convergence to solution. Overall, our study demonstrates the effectives of using RL approaches that operate in discrete action space for inverse defect design problems.
引用
收藏
页数:10
相关论文
共 25 条
[1]   Surface Defects on Natural MoS2 [J].
Addou, Rafik ;
Colombo, Luigi ;
Wallace, Robert M. .
ACS APPLIED MATERIALS & INTERFACES, 2015, 7 (22) :11921-11929
[2]   Defect Dominated Charge Transport and Fermi Level Pinning in MoS2/Metal Contacts [J].
Bampoulis, Pantelis ;
van Bremen, Rik ;
Yao, Qirong ;
Poelsema, Bene ;
Zandvliet, Harold J. W. ;
Sotthewes, Kai .
ACS APPLIED MATERIALS & INTERFACES, 2017, 9 (22) :19278-19286
[3]   A Survey of Monte Carlo Tree Search Methods [J].
Browne, Cameron B. ;
Powley, Edward ;
Whitehouse, Daniel ;
Lucas, Simon M. ;
Cowling, Peter I. ;
Rohlfshagen, Philipp ;
Tavener, Stephen ;
Perez, Diego ;
Samothrakis, Spyridon ;
Colton, Simon .
IEEE TRANSACTIONS ON COMPUTATIONAL INTELLIGENCE AND AI IN GAMES, 2012, 4 (01) :1-43
[4]   Monte Carlo tree search for materials design and discovery [J].
Dieb, Thaer M. ;
Ju, Shenghong ;
Shiomi, Junichiro ;
Tsuda, Koji .
MRS COMMUNICATIONS, 2019, 9 (02) :532-536
[5]   MDTS: automatic complex materials design using Monte Carlo tree search [J].
Dieb, Thaer M. ;
Ju, Shenghong ;
Yoshizoe, Kazuki ;
Hou, Zhufeng ;
Shiomi, Junichiro ;
Tsuda, Koji .
SCIENCE AND TECHNOLOGY OF ADVANCED MATERIALS, 2017, 18 (01) :498-503
[6]  
Goldberg D. E., 1989, Genetic Algorithms in Search, Optimization and Machine Learning, V1st, DOI DOI 10.1109/ICETEEEM.2012.6494460
[7]   Searching the stable segregation configuration at the grain boundary by a Monte Carlo tree search [J].
Kiyohara, Shin ;
Mizoguchi, Teruyasu .
JOURNAL OF CHEMICAL PHYSICS, 2018, 148 (24)
[8]   Bandit based Monte-Carlo planning [J].
Kocsis, Levente ;
Szepesvari, Csaba .
MACHINE LEARNING: ECML 2006, PROCEEDINGS, 2006, 4212 :282-293
[9]  
Lin YC, 2014, NAT NANOTECHNOL, V9, P391, DOI [10.1038/nnano.2014.64, 10.1038/NNANO.2014.64]
[10]   Defect-Dominated Doping and Contact Resistance in MoS2 [J].
McDonnell, Stephen ;
Addou, Rafik ;
Buie, Creighton ;
Wallace, Robert M. ;
Hinkle, Christopher L. .
ACS NANO, 2014, 8 (03) :2880-2888