Adaptive optics control with multi-agent model-free reinforcement learning

被引:34
|
作者
Pou, B. [1 ,2 ]
Ferreira, F. [3 ]
Quinones, E. [1 ]
Gratadour, D. [3 ,4 ]
Martin, M. [2 ]
机构
[1] Barcelona Supercotnputing Ctr BSC, C Jordi Girona 29, Barcelona 08034, Spain
[2] Univ Politecn Catalunya UPC, Comp Sci Dept, C Jordi Girona 31, Barcelona 08034, Spain
[3] Univ Paris Diderot, Univ PSL, Sorbonne Paris Cite, Sorbonne Univ,CNRS,Observ Paris,LESIA, 5 Pl Jules Janssen, F-92195 Meudon, France
[4] Australian Natl Univ, Res Sch Astron & Astrophys, Canberra, ACT 2611, Australia
关键词
QUADRATIC GAUSSIAN CONTROL; WAVE-FRONT RECONSTRUCTION;
D O I
10.1364/OE.444099
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We present a novel formulation of closed-loop adaptive optics (AO) control as a multi-agent reinforcement learning (MARL) problem in which the controller is able to learn a non-linear policy and does not need a priori information on the dynamics of the atmosphere. We identify the different challenges of applying a reinforcement learning (RL) method to AO and, to solve them, propose the combination of model-free MARL for control with an autoencoder neural network to mitigate the effect of noise. Moreover, we extend current existing methods of error budget analysis to include a RL controller. The experimental results for an 8m telescope equipped with a 40x40 Shack-Hartmann system show a significant increase in performance over the integrator baseline and comparable performance to a model-based predictive approach, a linear quadratic Gaussian controller with perfect knowledge of atmospheric conditions. Finally, the error budget analysis provides evidence that the RL controller is partially compensating for bandwidth error and is helping to mitigate the propagation of aliasing. (C) 2022 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement
引用
收藏
页码:2991 / 3015
页数:25
相关论文
共 50 条
  • [31] ADAPTIVE STATE REPRESENTATIONS FOR MULTI-AGENT REINFORCEMENT LEARNING
    De Hauwere, Yann-Michael
    Vrancx, Peter
    Nowe, Ann
    ICAART 2011: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE, VOL 2, 2011, : 181 - 189
  • [32] An adaptive clustering method for model-free reinforcement learning
    Matt, A
    Regensburger, G
    INMIC 2004: 8TH INTERNATIONAL MULTITOPIC CONFERENCE, PROCEEDINGS, 2004, : 362 - 367
  • [33] Adaptive Multi-Agent Deep Mixed Reinforcement Learning for Traffic Light Control
    Li, Lulu
    Zhu, Ruijie
    Wu, Shuning
    Ding, Wenting
    Xu, Mingliang
    Lu, Jiwen
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (02) : 1803 - 1816
  • [34] Model-Free Event-Triggered Containment Control of Multi-Agent Systems
    Yang, Yongliang
    Modares, Hamidreza
    Vamvoudakis, Kyriakos G.
    Yin, Yixin
    Wunsch, Donald C., II
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 877 - 884
  • [35] Quantized model-free adaptive iterative learning bipartite consensus tracking for unknown nonlinear multi-agent systems
    Zhao, Huarong
    Peng, Li
    Yu, Hongnian
    APPLIED MATHEMATICS AND COMPUTATION, 2022, 412
  • [36] Cooperative output regulation of heterogeneous directed multi-agent systems:a fully distributed model-free reinforcement learning framework
    Xiongtao SHI
    Yanjie LI
    Chenglong DU
    Huiping LI
    Chaoyang CHEN
    Weihua GUI
    Science China(Information Sciences), 2025, 68 (02) : 170 - 185
  • [37] Cooperative output regulation of heterogeneous directed multi-agent systems: a fully distributed model-free reinforcement learning framework
    Shi, Xiongtao
    Li, Yanjie
    Du, Chenglong
    Li, Huiping
    Chen, Chaoyang
    Gui, Weihua
    SCIENCE CHINA-INFORMATION SCIENCES, 2025, 68 (02)
  • [38] Event-based model-free adaptive consensus control for multi-agent systems under intermittent attacks
    Xiong, Hongxing
    Chen, Guangdeng
    Ren, Hongru
    Li, Hongyi
    Lu, Renquan
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (10) : 2062 - 2076
  • [39] Fully distributed data-driven model-free adaptive control for consensus tracking in multi-agent systems
    Sahafi, Sayed Shahab Aldin
    Farsangi, Malihe Maghfoori
    ISA TRANSACTIONS, 2025, 158 : 122 - 129
  • [40] Model-free learning on robot kinematic chains using a nested multi-agent topology
    Karigiannis, John N.
    Tzafestas, Costas S.
    JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2016, 28 (06) : 913 - 954