Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

被引：0

作者：

SelÇuk şenkul

Faruk Polat

机构：

[1] Middle East Technical University,Computer Engineering Department

来源：

Artificial Intelligence Review | 2002年 / 18卷

关键词：

agent learning; multi-agent systems; Q-learning; reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.

引用

页码：97 / 115

页数：18

共 50 条

[41] CHQ: A multi-agent reinforcement learning scheme for partially observable Markov decision processes
Osada, H
Fujita, S
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2005, E88D (05): : 1004 - 1011
[42] Multi-Agent Combat in Non-Stationary Environments
Li, Shengang
Chi, Haoang
Xie, Tao
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[43] Disturbance Observable Reinforcement Learning that Compensates for Changes in Environment
Kim, SeongIn
Shibuya, Takeshi
2022 61ST ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS (SICE), 2022, : 141 - 145
[44] PARTIALLY OBSERVABLE MODEL-BASED LEARNING FOR ISAC RESOURCE ALLOCATION
Pulkkinee, Petteri
Koivunen, Visa
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2024), 2024, : 12996 - 13000
[45] Partially Observable Reinforcement Learning for Dialog-based Interactive Recommendation
Wu, Yaxiong
Macdonald, Craig
Ounis, Iadh
15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 241 - 251
[46] Modeling and reinforcement learning in partially observable many-agent systems
He, Keyang
Doshi, Prashant
Banerjee, Bikramjit
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2024, 38 (01)
[47] A Bayesian Approach for Learning and Planning in Partially Observable Markov Decision Processes
Ross, Stephane
Pineau, Joelle
Chaib-draa, Brahim
Kreitmann, Pierre
JOURNAL OF MACHINE LEARNING RESEARCH, 2011, 12 : 1729 - 1770
[48] Policy Reuse for Learning and Planning in Partially Observable Markov Decision Processes
Wu, Bo
Feng, Yanpeng
2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 549 - 552
[49] Multi-task Reinforcement Learning in Partially Observable Stochastic Environments
Li, Hui
Liao, Xuejun
Carin, Lawrence
JOURNAL OF MACHINE LEARNING RESEARCH, 2009, 10 : 1131 - 1186
[50] Learning-Based Traffic Scheduling in Non-Stationary Multipath 5G Non-Terrestrial Networks
Machumilane, Achilles
Gotta, Alberto
Cassara, Pietro
Amato, Giuseppe
Gennaro, Claudio
REMOTE SENSING, 2023, 15 (07)

← 1 2 3 4 5 →