Model-free Reinforcement Learning with a Non-linear Reconstructor for closed-loop Adaptive Optics control with a pyramid wavefront sensor

被引:7
|
作者
Pou, B. [1 ,2 ]
Smith, J. [3 ]
Quinones, E. [1 ]
Martin, M. [2 ]
Gratadour, D. [4 ]
机构
[1] Barcelona Supercomputing Ctr BSC, C Jordi Girona 29, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Comp Sci Dept, C Jordi Girona 31, Barcelona 08034, Spain
[3] Australian Natl Univ, Sch Comp, Canberra, Australia
[4] Univ PSL, Sorbonne Univ, Univ Paris Diderot, CNRS,LESIA,Observ Paris, Sorbonne Paris Cite,5 Pl Jules Janssen, F-92195 Meudon, France
来源
ADAPTIVE OPTICS SYSTEMS VIII | 2022年 / 12185卷
关键词
Reinforcement Learning; AO Control; Machine Learning; Pyramid Wavefront Sensor; NEURAL-NETWORKS;
D O I
10.1117/12.2627849
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present a model-free reinforcement learning (RL) predictive model with a supervised learning non-linear reconstructor for adaptive optics (AO) control with a pyramid wavefront sensor (P-WFS). First, we analyse the additional problems of training an RL control method with a P-WFS compared to the Shack-Hartmann WFS. From those observations, we propose our solution: a combination of model-free RL for prediction with a non-linear reconstructor based on neural networks with a U-net architecture. We test the proposed method in simulation of closed-loop AO for an 8m telescope equipped with a 32x32 P-WFS and observe that both the predictive and non-linear reconstruction add additional benefits over an optimised integrator.
引用
收藏
页数:14
相关论文
共 19 条
  • [1] The pyramid wavefront sensor used in the closed-loop adaptive optics system
    Wang, Shengqian
    Wei, Kai
    Zheng, Wenjia
    Rao, Changhui
    ADAPTIVE OPTICS SYSTEMS V, 2016, 9909
  • [2] Testing the pyramid wavefront sensor without modulation used in the closed-loop adaptive optics system
    Wang, Shengqian
    Rao, Changhui
    Zhang, Ang
    Zhang, Xuejun
    Wei, Kai
    Tian, Yu
    Liao, Zhou
    Zhang, Cheng
    Xian, Hao
    Zhang, Xiaojun
    Wei, Ling
    ADAPTIVE OPTICS SYSTEMS III, 2012, 8447
  • [3] Integrating supervised and reinforcement learning for predictive control with an unmodulated pyramid wavefront sensor for adaptive optics
    Pou, Bartomeu
    Smith, Jeffrey
    Quinones, Eduardo
    Martin, Mario
    Gratadour, Damien
    OPTICS EXPRESS, 2024, 32 (21): : 37011 - 37035
  • [4] Model-free closed-loop wind farm control using reinforcement learning with recursive least squares
    Liew, Jaime
    Gocmen, Tuhfe
    Lio, Wai Hou
    Larsen, Gunner Chr.
    WIND ENERGY, 2024, 27 (11) : 1173 - 1187
  • [5] Reinforcement learning based closed-loop reference model adaptive flight control system design
    Yuksek, Burak
    Inalhan, Gokhan
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2021, 35 (03) : 420 - 440
  • [6] Model-Based Reinforcement Learning for Closed-Loop Dynamic Control of Soft Robotic Manipulators
    Thuruthel, Thomas George
    Falotico, Egidio
    Renda, Federico
    Laschi, Cecilia
    IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 124 - 134
  • [7] Adaptive phase shift control of thermoacoustic combustion instabilities using model-free reinforcement learning
    Alhazmi, Khalid
    Sarathy, S. Mani
    COMBUSTION AND FLAME, 2023, 257
  • [8] Model-free adaptive control design for nonlinear discrete-time processes with reinforcement learning techniques
    Liu, Dong
    Yang, Guang-Hong
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2018, 49 (11) : 2298 - 2308
  • [9] Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic
    Radac, Mircea-Bogdan
    Precup, Radu-Emil
    APPLIED SCIENCES-BASEL, 2019, 9 (09):
  • [10] A Model-free Reinforcement Learning Approach for the Energetic Control of a Building with Non-stationary User Behaviour
    Haddam, Nassim
    Boulakia, Benjamin Cohen
    Barth, Dominique
    2020 THE 4TH INTERNATIONAL CONFERENCE ON SMART GRID AND SMART CITIES (ICSGSC 2020), 2020, : 168 - 177