Model-free Reinforcement Learning with a Non-linear Reconstructor for closed-loop Adaptive Optics control with a pyramid wavefront sensor

被引:7
|
作者
Pou, B. [1 ,2 ]
Smith, J. [3 ]
Quinones, E. [1 ]
Martin, M. [2 ]
Gratadour, D. [4 ]
机构
[1] Barcelona Supercomputing Ctr BSC, C Jordi Girona 29, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Comp Sci Dept, C Jordi Girona 31, Barcelona 08034, Spain
[3] Australian Natl Univ, Sch Comp, Canberra, Australia
[4] Univ PSL, Sorbonne Univ, Univ Paris Diderot, CNRS,LESIA,Observ Paris, Sorbonne Paris Cite,5 Pl Jules Janssen, F-92195 Meudon, France
来源
ADAPTIVE OPTICS SYSTEMS VIII | 2022年 / 12185卷
关键词
Reinforcement Learning; AO Control; Machine Learning; Pyramid Wavefront Sensor; NEURAL-NETWORKS;
D O I
10.1117/12.2627849
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
We present a model-free reinforcement learning (RL) predictive model with a supervised learning non-linear reconstructor for adaptive optics (AO) control with a pyramid wavefront sensor (P-WFS). First, we analyse the additional problems of training an RL control method with a P-WFS compared to the Shack-Hartmann WFS. From those observations, we propose our solution: a combination of model-free RL for prediction with a non-linear reconstructor based on neural networks with a U-net architecture. We test the proposed method in simulation of closed-loop AO for an 8m telescope equipped with a 32x32 P-WFS and observe that both the predictive and non-linear reconstruction add additional benefits over an optimised integrator.
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Reinforcement learning based closed-loop reference model adaptive flight control system design
    Yuksek, Burak
    Inalhan, Gokhan
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2021, 35 (03) : 420 - 440
  • [22] Linear Quadratic Control Using Model-Free Reinforcement Learning
    Yaghmaie, Farnaz Adib
    Gustafsson, Fredrik
    Ljung, Lennart
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (02) : 737 - 752
  • [23] Non-linear adaptive closed-loop control system for improved efficiency in IoT-blockchain management
    Casado-Vara, Roberto
    Chamoso, Pablo
    De la Prieta, Fernando
    Prieto, Javier
    Corchado, Juan M.
    INFORMATION FUSION, 2019, 49 : 227 - 239
  • [24] Closed-loop incremental stability for efficient symbolic control of non-linear systems
    Tajvar, Pouria
    Meyer, Pierre-Jean
    Tumova, Jana
    IFAC PAPERSONLINE, 2021, 54 (05): : 121 - 126
  • [25] A non-linear tunable PI block for improvement of closed-loop control response
    Dummermuth, E
    SMCIA/01: PROCEEDINGS OF THE 2001 IEEE MOUNTAIN WORKSHOP ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS, 2001, : 105 - 108
  • [26] Model-Free Adaptive Control Approach Using Integral Reinforcement Learning
    Abouheaf, Mohammed
    Gueaieb, Wail
    2019 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2019), 2019, : 84 - 90
  • [27] MODEL-FREE ADAPTIVE CONTROL FOR TIME-VARYING TRAJECTORY TRACKING OF NON-LINEAR SYSTEMS
    Hao, Ce
    Wang, Yueling
    Wang, Hongbin
    Zhou, Zhen
    INTERNATIONAL JOURNAL OF ROBOTICS & AUTOMATION, 2019, 34 (01): : 71 - 77
  • [28] Model Reference Adaptive Control with Linear-like Closed-loop Behavior
    Shahab, Mohamad T.
    Miller, Daniel E.
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1069 - 1074
  • [29] Closed-loop non-linear control of an initially imperfect beam with non-collocated input
    Lacarbonara, W
    Yabuno, H
    JOURNAL OF SOUND AND VIBRATION, 2004, 273 (4-5) : 695 - 711
  • [30] Predictability of fractional-Brownian-motion wavefront distortions and some implications for closed-loop adaptive optics control.
    Aitken, GJM
    Rossille, D
    McGaughey, DR
    ADAPTIVE OPTICAL SYSTEM TECHNOLOGIES, PARTS 1 AND 2, 1998, 3353 : 1060 - 1069