Model-free Reinforcement Learning with a Non-linear Reconstructor for closed-loop Adaptive Optics control with a pyramid wavefront sensor

被引:7
|
作者
Pou, B. [1 ,2 ]
Smith, J. [3 ]
Quinones, E. [1 ]
Martin, M. [2 ]
Gratadour, D. [4 ]
机构
[1] Barcelona Supercomputing Ctr BSC, C Jordi Girona 29, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Comp Sci Dept, C Jordi Girona 31, Barcelona 08034, Spain
[3] Australian Natl Univ, Sch Comp, Canberra, Australia
[4] Univ PSL, Sorbonne Univ, Univ Paris Diderot, CNRS,LESIA,Observ Paris, Sorbonne Paris Cite,5 Pl Jules Janssen, F-92195 Meudon, France
来源
ADAPTIVE OPTICS SYSTEMS VIII | 2022年 / 12185卷
关键词
Reinforcement Learning; AO Control; Machine Learning; Pyramid Wavefront Sensor; NEURAL-NETWORKS;
D O I
10.1117/12.2627849
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present a model-free reinforcement learning (RL) predictive model with a supervised learning non-linear reconstructor for adaptive optics (AO) control with a pyramid wavefront sensor (P-WFS). First, we analyse the additional problems of training an RL control method with a P-WFS compared to the Shack-Hartmann WFS. From those observations, we propose our solution: a combination of model-free RL for prediction with a non-linear reconstructor based on neural networks with a U-net architecture. We test the proposed method in simulation of closed-loop AO for an 8m telescope equipped with a 32x32 P-WFS and observe that both the predictive and non-linear reconstruction add additional benefits over an optimised integrator.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] The pyramid wavefront sensor used in the closed-loop adaptive optics system
    Wang, Shengqian
    Wei, Kai
    Zheng, Wenjia
    Rao, Changhui
    ADAPTIVE OPTICS SYSTEMS V, 2016, 9909
  • [2] Wavefront response matrix for closed-loop adaptive optics system based on non-modulation pyramid wavefront sensor
    Wang, Jianxin
    Bai, Fuzhong
    Ning, Yu
    Li, Fei
    Jiang, Wenhan
    OPTICS COMMUNICATIONS, 2012, 285 (12) : 2814 - 2820
  • [3] Laboratory demonstrations on a pyramid wavefront sensor without modulation for closed-loop adaptive optics system
    Wang, Shengqian
    Rao, Changhui
    Xian, Hao
    Zhang, Jianlin
    Wang, Jianxin
    Liu, Zheng
    OPTICS EXPRESS, 2011, 19 (09): : 8135 - 8150
  • [4] Testing the pyramid wavefront sensor without modulation used in the closed-loop adaptive optics system
    Wang, Shengqian
    Rao, Changhui
    Zhang, Ang
    Zhang, Xuejun
    Wei, Kai
    Tian, Yu
    Liao, Zhou
    Zhang, Cheng
    Xian, Hao
    Zhang, Xiaojun
    Wei, Ling
    ADAPTIVE OPTICS SYSTEMS III, 2012, 8447
  • [5] Integrating supervised and reinforcement learning for predictive control with an unmodulated pyramid wavefront sensor for adaptive optics
    Pou, Bartomeu
    Smith, Jeffrey
    Quinones, Eduardo
    Martin, Mario
    Gratadour, Damien
    OPTICS EXPRESS, 2024, 32 (21): : 37011 - 37035
  • [6] Modulation-nonmodulation pyramid wavefront sensor with direct gradient reconstruction algorithm on the closed-loop adaptive optics system
    Wang, Shengqian
    Wei, Kai
    Zheng, Wenjia
    OPTICS EXPRESS, 2018, 26 (16): : 20952 - 20964
  • [7] Model-free closed-loop wind farm control using reinforcement learning with recursive least squares
    Liew, Jaime
    Gocmen, Tuhfe
    Lio, Wai Hou
    Larsen, Gunner Chr.
    WIND ENERGY, 2024, 27 (11) : 1173 - 1187
  • [8] Transformer neural networks for closed-loop adaptive optics using nonmodulated pyramid wavefront sensors
    Weinberger, Camilo
    Tapia, Jorge
    Neichel, Benoit
    Vera, Esteban
    ASTRONOMY & ASTROPHYSICS, 2024, 687
  • [9] Transformer neural networks for closed-loop adaptive optics using nonmodulated pyramid wavefront sensors
    Weinberger, Camilo
    Tapia, Jorge
    Neichel, Benoît
    Vera, Esteban
    Astronomy and Astrophysics, 1600, 687
  • [10] Closed-loop control for adaptive optics wavefront compensation in highly scintillated conditions
    Gerwe, DR
    Stone, JP
    Schall, HB
    LASER SYSTEMS TECHNOLOGY, 2003, 5087 : 87 - 102