Reinforcement Learning for Partially Observable Linear Gaussian Systems Using Batch Dynamics of Noisy Observations

被引:2
|
作者
Yaghmaie, Farnaz Adib [1 ]
Modares, Hamidreza [2 ]
Gustafsson, Fredrik [1 ]
机构
[1] Linkoping Univ, Fac Elect Engn, S-58183 Linkoping, Sweden
[2] Michigan State Univ, Coll Engn, E Lansing, MI 48824 USA
基金
瑞典研究理事会; 美国国家科学基金会;
关键词
Costs; History; Noise; Dynamical systems; Noise measurement; Heuristic algorithms; Data models; Linear quadratic Gaussian; partiially observable dynamical systems; reinforcement learning;
D O I
10.1109/TAC.2024.3385680
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning algorithms are commonly used to control dynamical systems with measurable state variables. If the dynamical system is partially observable, reinforcement learning algorithms are modified to compensate for the effect of partial observability. One common approach is to feed a finite history of input-output data instead of the state variable. In this article, we study and quantify the effect of this approach in linear Gaussian systems with quadratic costs. We coin the concept of L-Extra-Sampled-dynamics to formalize the idea of using a finite history of input-output data instead of state and show that this approach increases the average cost.
引用
收藏
页码:6397 / 6404
页数:8
相关论文
共 50 条
  • [41] Learning-based line impedance estimation for partially observable distribution systems
    Zhu, Yanming
    Xu, Xiaoyuan
    Yan, Zheng
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 137
  • [42] Linear Quadratic Gaussian using Kalman Network and Reinforcement Learning for Discrete-Time System
    Putri, Adi Novitarini
    Machbub, Carmadi
    Mahayana, Dimitri
    Hidayat, Egi M. Idris
    2022 12TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET 2022), 2022, : 54 - 60
  • [43] Memory-driven deep-reinforcement learning for autonomous robot navigation in partially observable environments
    Montero, Estrella
    Pico, Nabih
    Ghergherehchi, Mitra
    Song, Ho Seung
    ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2025, 62
  • [44] Gaussian Based Non-linear Function Approximation for Reinforcement Learning
    Haider A.
    Hawe G.
    Wang H.
    Scotney B.
    SN Computer Science, 2021, 2 (3)
  • [45] Output regulation of unknown linear systems using average cost reinforcement learning
    Yaghmaie, Farnaz Adib
    Gunnarsson, Svante
    Lewis, Frank L.
    AUTOMATICA, 2019, 110
  • [46] Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems
    Pang, Bo
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (04) : 2383 - 2390
  • [47] Battery Energy Management in a Microgrid Using Batch Reinforcement Learning
    Mbuwir, Brida V.
    Ruelens, Frederik
    Spiessens, Fred
    Deconinck, Geert
    ENERGIES, 2017, 10 (11):
  • [48] Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes
    Haklidir, Mehmet
    Temeltas, Hakan
    IEEE ACCESS, 2021, 9 : 159672 - 159683
  • [49] SchedInspector: A Batch Job Scheduling Inspector Using Reinforcement Learning
    Zhang, Di
    Dai, Dong
    Xie, Bing
    PROCEEDINGS OF THE 31ST INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2022, 2022, : 97 - 109
  • [50] Model-free reinforcement learning for motion planning of autonomous agents with complex tasks in partially observable environments
    Li, Junchao
    Cai, Mingyu
    Kan, Zhen
    Xiao, Shaoping
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (01)