Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

被引:0
|
作者
SelÇuk şenkul
Faruk Polat
机构
[1] Middle East Technical University,Computer Engineering Department
来源
Artificial Intelligence Review | 2002年 / 18卷
关键词
agent learning; multi-agent systems; Q-learning; reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.
引用
收藏
页码:97 / 115
页数:18
相关论文
共 50 条
  • [1] Learning intelligent behavior in a non-stationary and partially observable environment
    Senkul, S
    Polat, F
    ARTIFICIAL INTELLIGENCE REVIEW, 2002, 18 (02) : 97 - 115
  • [2] Context Detection and Identification In Multi-Agent Reinforcement Learning With Non-Stationary Environment
    Selamet, Ekrem Talha
    Tumer, Borahan
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [3] Reinforcement learning algorithm for non-stationary environments
    Sindhu Padakandla
    Prabuchandran K. J.
    Shalabh Bhatnagar
    Applied Intelligence, 2020, 50 : 3590 - 3606
  • [4] Reinforcement learning algorithm for non-stationary environments
    Padakandla, Sindhu
    Prabuchandran, K. J.
    Bhatnagar, Shalabh
    APPLIED INTELLIGENCE, 2020, 50 (11) : 3590 - 3606
  • [5] Towards Reinforcement Learning for Non-stationary Environments
    Dal Toe, Sebastian Gregory
    Tiddeman, Bernard
    Mac Parthalain, Neil
    ADVANCES IN COMPUTATIONAL INTELLIGENCE SYSTEMS, UKCI 2023, 2024, 1453 : 41 - 52
  • [6] A Policy Search and Transfer Approach in the Non-stationary Environment
    Zhu F.
    Liu Q.
    Fu Q.-M.
    Chen D.-H.
    Wang H.
    Fu Y.-C.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2017, 45 (02): : 257 - 266
  • [7] Reinforcement learning with augmented states in partially expectation and action observable environment
    Guirnaldo, SA
    Watanabe, K
    Izumi, K
    Kiguchi, K
    SICE 2002: PROCEEDINGS OF THE 41ST SICE ANNUAL CONFERENCE, VOLS 1-5, 2002, : 823 - 828
  • [8] Navigation with memory in a partially observable environment
    Montesanto, A
    Tascini, G
    Puliti, P
    Baldassarri, P
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (01) : 84 - 94
  • [9] Online Learning Bipartite Matching with Non-stationary Distributions
    Chen, Weirong
    Zheng, Jiaqi
    Yu, Haoyu
    Chen, Guihai
    Chen, Yixin
    Li, Dongsheng
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2022, 16 (05)
  • [10] Partially observable environment estimation with uplift inference for reinforcement learning based recommendation
    Shang, Wenjie
    Li, Qingyang
    Qin, Zhiwei
    Yu, Yang
    Meng, Yiping
    Ye, Jieping
    MACHINE LEARNING, 2021, 110 (09) : 2603 - 2640