Reinforcement Learning for Partially Observable Linear Gaussian Systems Using Batch Dynamics of Noisy Observations

被引：2

作者：

Yaghmaie, Farnaz Adib ^{[1
]}

Modares, Hamidreza ^{[2
]}

Gustafsson, Fredrik ^{[1
]}

机构：

[1] Linkoping Univ, Fac Elect Engn, S-58183 Linkoping, Sweden

[2] Michigan State Univ, Coll Engn, E Lansing, MI 48824 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 2024年 / 69卷 / 09期

基金：

瑞典研究理事会; 美国国家科学基金会;

关键词：

Costs; History; Noise; Dynamical systems; Noise measurement; Heuristic algorithms; Data models; Linear quadratic Gaussian; partiially observable dynamical systems; reinforcement learning;

D O I：

10.1109/TAC.2024.3385680

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning algorithms are commonly used to control dynamical systems with measurable state variables. If the dynamical system is partially observable, reinforcement learning algorithms are modified to compensate for the effect of partial observability. One common approach is to feed a finite history of input-output data instead of the state variable. In this article, we study and quantify the effect of this approach in linear Gaussian systems with quadratic costs. We coin the concept of L-Extra-Sampled-dynamics to formalize the idea of using a finite history of input-output data instead of state and show that this approach increases the average cost.

引用

页码：6397 / 6404

页数：8

共 50 条

[41] Learning-based line impedance estimation for partially observable distribution systems
Zhu, Yanming
Xu, Xiaoyuan
Yan, Zheng
INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 137
[42] Linear Quadratic Gaussian using Kalman Network and Reinforcement Learning for Discrete-Time System
Putri, Adi Novitarini
Machbub, Carmadi
Mahayana, Dimitri
Hidayat, Egi M. Idris
2022 12TH INTERNATIONAL CONFERENCE ON SYSTEM ENGINEERING AND TECHNOLOGY (ICSET 2022), 2022, : 54 - 60
[43] Memory-driven deep-reinforcement learning for autonomous robot navigation in partially observable environments
Montero, Estrella
Pico, Nabih
Ghergherehchi, Mitra
Song, Ho Seung
ENGINEERING SCIENCE AND TECHNOLOGY-AN INTERNATIONAL JOURNAL-JESTECH, 2025, 62
[44] Gaussian Based Non-linear Function Approximation for Reinforcement Learning
Haider A.
Hawe G.
Wang H.
Scotney B.
SN Computer Science, 2021, 2 (3)
[45] Output regulation of unknown linear systems using average cost reinforcement learning
Yaghmaie, Farnaz Adib
Gunnarsson, Svante
Lewis, Frank L.
AUTOMATICA, 2019, 110
[46] Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems
Pang, Bo
Jiang, Zhong-Ping
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (04) : 2383 - 2390
[47] Battery Energy Management in a Microgrid Using Batch Reinforcement Learning
Mbuwir, Brida V.
Ruelens, Frederik
Spiessens, Fred
Deconinck, Geert
ENERGIES, 2017, 10 (11):
[48] Guided Soft Actor Critic: A Guided Deep Reinforcement Learning Approach for Partially Observable Markov Decision Processes
Haklidir, Mehmet
Temeltas, Hakan
IEEE ACCESS, 2021, 9 : 159672 - 159683
[49] SchedInspector: A Batch Job Scheduling Inspector Using Reinforcement Learning
Zhang, Di
Dai, Dong
Xie, Bing
PROCEEDINGS OF THE 31ST INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2022, 2022, : 97 - 109
[50] Model-free reinforcement learning for motion planning of autonomous agents with complex tasks in partially observable environments
Li, Junchao
Cai, Mingyu
Kan, Zhen
Xiao, Shaoping
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (01)

← 1 2 3 4 5 →