Model-free Reinforcement Learning with a Non-linear Reconstructor for closed-loop Adaptive Optics control with a pyramid wavefront sensor

被引：7

作者：

Pou, B. ^{[1
,2
]}

Smith, J. ^{[3
]}

Quinones, E. ^{[1
]}

Martin, M. ^{[2
]}

Gratadour, D. ^{[4
]}

机构：

[1] Barcelona Supercomputing Ctr BSC, C Jordi Girona 29, Barcelona 08034, Spain

[2] Univ Politecn Catalunya UPC, Comp Sci Dept, C Jordi Girona 31, Barcelona 08034, Spain

[3] Australian Natl Univ, Sch Comp, Canberra, Australia

[4] Univ PSL, Sorbonne Univ, Univ Paris Diderot, CNRS,LESIA,Observ Paris, Sorbonne Paris Cite,5 Pl Jules Janssen, F-92195 Meudon, France

来源：

ADAPTIVE OPTICS SYSTEMS VIII | 2022年 / 12185卷

关键词：

Reinforcement Learning; AO Control; Machine Learning; Pyramid Wavefront Sensor; NEURAL-NETWORKS;

D O I：

10.1117/12.2627849

中图分类号：

P1 [天文学];

学科分类号：

0704 ;

摘要：

We present a model-free reinforcement learning (RL) predictive model with a supervised learning non-linear reconstructor for adaptive optics (AO) control with a pyramid wavefront sensor (P-WFS). First, we analyse the additional problems of training an RL control method with a P-WFS compared to the Shack-Hartmann WFS. From those observations, we propose our solution: a combination of model-free RL for prediction with a non-linear reconstructor based on neural networks with a U-net architecture. We test the proposed method in simulation of closed-loop AO for an 8m telescope equipped with a 32x32 P-WFS and observe that both the predictive and non-linear reconstruction add additional benefits over an optimised integrator.

引用

页数：14

共 19 条

[1] The pyramid wavefront sensor used in the closed-loop adaptive optics system
Wang, Shengqian
Wei, Kai
Zheng, Wenjia
Rao, Changhui
ADAPTIVE OPTICS SYSTEMS V, 2016, 9909
[2] Testing the pyramid wavefront sensor without modulation used in the closed-loop adaptive optics system
Wang, Shengqian
Rao, Changhui
Zhang, Ang
Zhang, Xuejun
Wei, Kai
Tian, Yu
Liao, Zhou
Zhang, Cheng
Xian, Hao
Zhang, Xiaojun
Wei, Ling
ADAPTIVE OPTICS SYSTEMS III, 2012, 8447
[3] Integrating supervised and reinforcement learning for predictive control with an unmodulated pyramid wavefront sensor for adaptive optics
Pou, Bartomeu
Smith, Jeffrey
Quinones, Eduardo
Martin, Mario
Gratadour, Damien
OPTICS EXPRESS, 2024, 32 (21): : 37011 - 37035
[4] Model-free closed-loop wind farm control using reinforcement learning with recursive least squares
Liew, Jaime
Gocmen, Tuhfe
Lio, Wai Hou
Larsen, Gunner Chr.
WIND ENERGY, 2024, 27 (11) : 1173 - 1187
[5] Reinforcement learning based closed-loop reference model adaptive flight control system design
Yuksek, Burak
Inalhan, Gokhan
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2021, 35 (03) : 420 - 440
[6] Model-Based Reinforcement Learning for Closed-Loop Dynamic Control of Soft Robotic Manipulators
Thuruthel, Thomas George
Falotico, Egidio
Renda, Federico
Laschi, Cecilia
IEEE TRANSACTIONS ON ROBOTICS, 2019, 35 (01) : 124 - 134
[7] Adaptive phase shift control of thermoacoustic combustion instabilities using model-free reinforcement learning
Alhazmi, Khalid
Sarathy, S. Mani
COMBUSTION AND FLAME, 2023, 257
[8] Model-free adaptive control design for nonlinear discrete-time processes with reinforcement learning techniques
Liu, Dong
Yang, Guang-Hong
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2018, 49 (11) : 2298 - 2308
[9] Data-Driven Model-Free Tracking Reinforcement Learning Control with VRFT-based Adaptive Actor-Critic
Radac, Mircea-Bogdan
Precup, Radu-Emil
APPLIED SCIENCES-BASEL, 2019, 9 (09):
[10] A Model-free Reinforcement Learning Approach for the Energetic Control of a Building with Non-stationary User Behaviour
Haddam, Nassim
Boulakia, Benjamin Cohen
Barth, Dominique
2020 THE 4TH INTERNATIONAL CONFERENCE ON SMART GRID AND SMART CITIES (ICSGSC 2020), 2020, : 168 - 177

← 1 2 →