Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

被引：0

作者：

SelÇuk şenkul

Faruk Polat

机构：

[1] Middle East Technical University,Computer Engineering Department

来源：

Artificial Intelligence Review | 2002年 / 18卷

关键词：

agent learning; multi-agent systems; Q-learning; reinforcement learning;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.

引用

页码：97 / 115

页数：18

共 50 条

[1] Learning intelligent behavior in a non-stationary and partially observable environment
Senkul, S
Polat, F
ARTIFICIAL INTELLIGENCE REVIEW, 2002, 18 (02) : 97 - 115
[2] Context Detection and Identification In Multi-Agent Reinforcement Learning With Non-Stationary Environment
Selamet, Ekrem Talha
Tumer, Borahan
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
[3] Reinforcement learning algorithm for non-stationary environments
Sindhu Padakandla
Prabuchandran K. J.
Shalabh Bhatnagar
Applied Intelligence, 2020, 50 : 3590 - 3606
[4] Reinforcement learning algorithm for non-stationary environments
Padakandla, Sindhu
Prabuchandran, K. J.
Bhatnagar, Shalabh
APPLIED INTELLIGENCE, 2020, 50 (11) : 3590 - 3606
[5] Towards Reinforcement Learning for Non-stationary Environments
Dal Toe, Sebastian Gregory
Tiddeman, Bernard
Mac Parthalain, Neil
ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 41 - 52
[6] A Policy Search and Transfer Approach in the Non-stationary Environment
Zhu F.
Liu Q.
Fu Q.-M.
Chen D.-H.
Wang H.
Fu Y.-C.
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2017, 45 (02): : 257 - 266
[7] Reinforcement learning with augmented states in partially expectation and action observable environment
Guirnaldo, SA
Watanabe, K
Izumi, K
Kiguchi, K
SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 823 - 828
[8] Navigation with memory in a partially observable environment
Montesanto, A
Tascini, G
Puliti, P
Baldassarri, P
ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (01) : 84 - 94
[9] Online Learning Bipartite Matching with Non-stationary Distributions
Chen, Weirong
Zheng, Jiaqi
Yu, Haoyu
Chen, Guihai
Chen, Yixin
Li, Dongsheng
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (05)
[10] Partially observable environment estimation with uplift inference for reinforcement learning based recommendation
Shang, Wenjie
Li, Qingyang
Qin, Zhiwei
Yu, Yang
Meng, Yiping
Ye, Jieping
MACHINE LEARNING, 2021, 110 (09) : 2603 - 2640

← 1 2 3 4 5 →