Accelerated Deep Reinforcement Learning for Fast Feedback of Beam Dynamics at KARA

被引:7
作者
Wang, Weija [1 ]
Caselle, Michele [1 ]
Boltz, Tobias [2 ]
Blomley, Edmund [2 ]
Brosi, Miriam [2 ]
Dritschler, Timo [1 ]
Ebersoldt, Andreas [1 ]
Kopmann, Andreas [1 ]
Garcia, Andrea Santamaria [3 ]
Schreiber, Patrick [3 ]
Bruendermann, Erik [2 ]
Weber, Marc [4 ]
Mueller, Anke-Susanne [2 ]
Fang, Yangwang [5 ]
机构
[1] Karlsruhe Inst Technol KIT, Inst Data Proc & Elect, D-76344 Eggenstein Leopoldshafen, Germany
[2] Karlsruhe Inst Technol KIT, Inst Beam Phys & Technol, D-76344 Eggenstein Leopoldshafen, Germany
[3] Karlsruhe Inst Technol KIT, Lab Applicat Synchrotron Radiat, D-76131 Karlsruhe, Germany
[4] Karlsruhe Inst Technol KIT, Div Phys & Math 5, D-76344 Eggenstein Leopoldshafen, Germany
[5] Northwestern Polytech Univ, Unmanned Syst Res Inst, Xian 710072, Peoples R China
关键词
Hardware; Radio frequency; Real-time systems; Detectors; Control systems; Reinforcement learning; Perturbation methods; Artificial intelligence; beam diagnostics; control systems; machine learning; particle accelerators; real-time control; reinforcement learning (RL);
D O I
10.1109/TNS.2021.3084515
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Coherent synchrotron radiation (CSR) is generated when the electron bunch length is in the order of the magnitude of the wavelength of the emitted radiation. The self-interaction of short electron bunches with their own electromagnetic fields changes the longitudinal beam dynamics significantly. Above a certain current threshold, the micro-bunching instability develops, characterized by the appearance of distinguishable substructures in the longitudinal phase space of the bunch. To stabilize the CSR emission, a real-time feedback control loop based on reinforcement learning (RL) is proposed. Informed by the available THz diagnostics, the feedback is designed to act on the radio frequency (RF) system of the storage ring to mitigate the micro-bunching dynamics. To satisfy low-latency requirements given by the longitudinal beam dynamics, the RL controller has been implemented on hardware (FPGA). In this article, a real-time feedback loop architecture and its performance is presented and compared with a software implementation using Keras-RL on CPU/GPU. The results obtained with the CSR simulation Inovesa demonstrate that the functionality of both platforms is equivalent. The training performance of the hardware implementation is similar to software solution, while it outperforms the Keras-RL implementation by an order of magnitude. The presented RL hardware controller is considered as an essential platform for the development of intelligent CSR control systems.
引用
收藏
页码:1794 / 1800
页数:7
相关论文
共 20 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2016, ZYNQ ULTR MPSOC PROD
[3]  
Boltz T., 2019, P INT PART ACC C IPA, P104, DOI [10.18429/JACoW-IPAC2019, DOI 10.18429/JACOW-IPAC2019-MOPGW017]
[4]   Fast mapping of terahertz bursting thresholds and characteristics at synchrotron light sources [J].
Brosi, Miriam ;
Steinmann, Johannes L. ;
Blomley, Edmund ;
Bruendermann, Erik ;
Caselle, Michele ;
Hiller, Nicole ;
Kehrer, Benjamin ;
Mathis, Yves-Laurent ;
Nasse, Michael J. ;
Rota, Lorenzo ;
Schedler, Manuel ;
Schoenfeldt, Patrik ;
Schuh, Marcel ;
Schwarz, Markus ;
Weber, Marc ;
Mueller, Anke-Susanne .
PHYSICAL REVIEW ACCELERATORS AND BEAMS, 2016, 19 (11)
[5]  
Brundermann E., 2012, TERAHERTZ TECHNIQUES
[6]   A high-speed DAQ framework for future high-level trigger and event building clusters [J].
Caselle, M. ;
Perez, L. E. Ardila ;
Balzer, M. ;
Dritschler, T. ;
Kopmann, A. ;
Mohr, H. ;
Rota, L. ;
Vogelgesang, M. ;
Weber, M. .
JOURNAL OF INSTRUMENTATION, 2017, 12
[7]  
Chollet F., 2015, Keras
[8]  
Fu Y., 2017, White Paper WP486 (v1.0.1)
[9]   Caffe: Convolutional Architecture for Fast Feature Embedding [J].
Jia, Yangqing ;
Shelhamer, Evan ;
Donahue, Jeff ;
Karayev, Sergey ;
Long, Jonathan ;
Girshick, Ross ;
Guadarrama, Sergio ;
Darrell, Trevor .
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, :675-678
[10]  
Lillicrap T. P., 2015, PREPRINT