Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

被引：0

作者：

SelÇuk şenkul

Faruk Polat

机构：

[1] Middle East Technical University,Computer Engineering Department

来源：

Artificial Intelligence Review | 2002年 / 18卷

关键词：

agent learning; multi-agent systems; Q-learning; reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.

引用

页码：97 / 115

页数：18

共 50 条

[21] ENHANCED DEEP REINFORCEMENT LEARNING FOR PARCEL SINGULATION IN NON-STATIONARY ENVIRONMENTS
Shen, Jiwei
Lu, Hu
Zhang, Hao
Lyu, Shujing
Lu, Yue
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 86 - 90
[22] Learning partially observable deterministic action models
Amir, Eyal
Chang, Allen
Journal of Artificial Intelligence Research, 2008, 33 : 349 - 402
[23] Ad Hoc Teamwork in the Presence of Non-stationary Teammates
Santos, Pedro M.
Ribeiro, Joao G.
Sardinha, Alberto
Melo, Francisco S.
PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021), 2021, 12981 : 648 - 660
[24] Learning reward machines: A study in partially observable reinforcement learning
Icarte, Rodrigo Toro
Klassen, Toryn Q.
Valenzano, Richard
Castro, Margarita P.
Waldie, Ethan
Mcilraith, Sheila A.
ARTIFICIAL INTELLIGENCE, 2023, 323
[25] Predictive reinforcement learning in non-stationary environments using weighted mixture policy
Pourshamsaei, Hossein
Nobakhti, Amin
APPLIED SOFT COMPUTING, 2024, 153
[26] Traffic Scheduling in Non-Stationary Multipath Non-Terrestrial Networks: A Reinforcement Learning Approach
Machumilane, Achilles
Gotta, Alberto
Cassara, Pietro
Gennaro, Claudio
Amato, Giuseppe
ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4094 - 4099
[27] Learning to Act Optimally in Partially Observable Multiagent Settings
Ceren, Roi
AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1532 - 1533
[28] Partially Observable Reinforcement Learning for Sustainable Active Surveillance
Chen, Hechang
Yang, Bo
Liu, Jiming
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2018, PT II, 2018, 11062 : 425 - 437
[29] P-MARL: Prediction-Based Multi-Agent Reinforcement Learning for Non-Stationary Environments
Marinescu, Andrei
Dusparic, Ivana
Taylor, Adam
Cahill, Vinny
Clarke, Siobhan
PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1897 - 1898
[30] Multi-Armed Bandit Learning in IoT Networks: Learning Helps Even in Non-stationary Settings
Bonnefoi, Remi
Besson, Lilian
Moy, Christophe
Kaufmann, Emilie
Palicot, Jacques
COGNITIVE RADIO ORIENTED WIRELESS NETWORKS, 2018, 228 : 173 - 185

← 1 2 3 4 5 →