Small modular reactor reinforcement learning framework: Automating reactor core startup

被引:0
作者
Bae, Seong Jun [1 ]
Son, Hong Hyun [1 ]
Lee, Yongjae [1 ]
Yang, Jongin [2 ]
机构
[1] Korea Atom Energy Res Inst, Daejeon 34057, South Korea
[2] Kumoh Natl Inst Technol, 61 Daehak Ro, Gumi Si 39177, Gyeongsangbuk D, South Korea
基金
新加坡国家研究基金会;
关键词
Reinforcement learning; SMR simulation; Reactor core startup; INSTABILITY; FLOW;
D O I
10.1016/j.net.2024.10.009
中图分类号
TL [原子能技术]; O571 [原子核物理学];
学科分类号
0827 ; 082701 ;
摘要
A small modular reactor (SMR) has been considered a potential alternative for achieving carbon neutrality, and therefore, an increasing number of countries are performing extensive research and development. However, this is still in the development stage, and there are several technological or economical challenges that need to be overcome. Minimizing manual operations may be considered a wise approach to reduce the number of operators. Reactor core startup, which is a manual operation, is considered as an example. A method to automate the reactor core startup via the reinforcement learning (RL) algorithm is proposed in this paper. Further, an efficient SMR dynamic simulation model that performs simulations considering the action of the RL agent to achieve states and reward is developed. The suggested SMR dynamic simulation model is validated by the data available in the existing literature. The proposed method can perform automatic reactor core startup. The proposed framework that incorporates the SMR simulator to the RL algorithm is expected to be applied to various cases for reducing manual operations and contributing to realizing a higher level of SMR automation.
引用
收藏
页数:15
相关论文
共 37 条
[31]  
Shixiang Gu, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P3389, DOI 10.1109/ICRA.2017.7989385
[32]   DENSITY WAVE INSTABILITY IN ONCE-THROUGH BOILING FLOW SYSTEM .1. EXPERIMENT [J].
TAKITANI, K ;
TAKEMURA, T .
JOURNAL OF NUCLEAR SCIENCE AND TECHNOLOGY, 1978, 15 (05) :355-364
[33]  
van Hasselt H, 2015, Arxiv, DOI [arXiv:1509.06461, 10.48550/ARXIV.1509.06461, DOI 10.1609/AAAI.V30I1.10295, 10.1609/aaai.v30i1.10295]
[34]  
Vijayan P.K., 2019, SINGLE PHASE 2 PHASE, DOI DOI 10.1016/C2017-0-01142-4
[35]  
Wierstra D, 2019, arXiv, DOI DOI 10.48550/ARXIV.1509.02971
[36]   Boundary condition coupling methods and its application to BOP integrated transient simulation of SMART [J].
Yang, Jongin ;
Son, Hong Hyun ;
Lee, Yong Jae ;
Shin, Doyoung ;
Kim, Taejin ;
Choi, Seong Soo .
NUCLEAR ENGINEERING AND TECHNOLOGY, 2023, 55 (06) :1974-1987
[37]   A policy optimization algorithm based on sample adaptive reuse and dual-clipping for robotic action control [J].
Zhao, Li -yang ;
Chang, Tian-qing ;
Zhang, Jie ;
Zhang, Lei ;
Chu, Kai-xuan ;
Guo, Li -bin ;
Kong, De-peng .
APPLIED SOFT COMPUTING, 2023, 134