Learning Intelligent Behavior in a Non-stationary and Partially Observable Environment

被引:0
|
作者
SelÇuk şenkul
Faruk Polat
机构
[1] Middle East Technical University,Computer Engineering Department
来源
Artificial Intelligence Review | 2002年 / 18卷
关键词
agent learning; multi-agent systems; Q-learning; reinforcement learning;
D O I
暂无
中图分类号
学科分类号
摘要
Individual learning in an environment where more than one agent exist is a chal-lengingtask. In this paper, a single learning agent situated in an environment where multipleagents exist is modeled based on reinforcement learning. The environment is non-stationaryand partially accessible from an agents' point of view. Therefore, learning activities of anagent is influenced by actions of other cooperative or competitive agents in the environment.A prey-hunter capture game that has the above characteristics is defined and experimentedto simulate the learning process of individual agents. Experimental results show that thereare no strict rules for reinforcement learning. We suggest two new methods to improve theperformance of agents. These methods decrease the number of states while keeping as muchstate as necessary.
引用
收藏
页码:97 / 115
页数:18
相关论文
共 50 条
  • [21] ENHANCED DEEP REINFORCEMENT LEARNING FOR PARCEL SINGULATION IN NON-STATIONARY ENVIRONMENTS
    Shen, Jiwei
    Lu, Hu
    Zhang, Hao
    Lyu, Shujing
    Lu, Yue
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 86 - 90
  • [22] Learning partially observable deterministic action models
    Amir, Eyal
    Chang, Allen
    Journal of Artificial Intelligence Research, 2008, 33 : 349 - 402
  • [23] Ad Hoc Teamwork in the Presence of Non-stationary Teammates
    Santos, Pedro M.
    Ribeiro, Joao G.
    Sardinha, Alberto
    Melo, Francisco S.
    PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021), 2021, 12981 : 648 - 660
  • [24] Learning reward machines: A study in partially observable reinforcement learning 
    Icarte, Rodrigo Toro
    Klassen, Toryn Q.
    Valenzano, Richard
    Castro, Margarita P.
    Waldie, Ethan
    Mcilraith, Sheila A.
    ARTIFICIAL INTELLIGENCE, 2023, 323
  • [25] Predictive reinforcement learning in non-stationary environments using weighted mixture policy
    Pourshamsaei, Hossein
    Nobakhti, Amin
    APPLIED SOFT COMPUTING, 2024, 153
  • [26] Traffic Scheduling in Non-Stationary Multipath Non-Terrestrial Networks: A Reinforcement Learning Approach
    Machumilane, Achilles
    Gotta, Alberto
    Cassara, Pietro
    Gennaro, Claudio
    Amato, Giuseppe
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 4094 - 4099
  • [27] Learning to Act Optimally in Partially Observable Multiagent Settings
    Ceren, Roi
    AAMAS'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS, 2016, : 1532 - 1533
  • [28] Partially Observable Reinforcement Learning for Sustainable Active Surveillance
    Chen, Hechang
    Yang, Bo
    Liu, Jiming
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2018, PT II, 2018, 11062 : 425 - 437
  • [29] P-MARL: Prediction-Based Multi-Agent Reinforcement Learning for Non-Stationary Environments
    Marinescu, Andrei
    Dusparic, Ivana
    Taylor, Adam
    Cahill, Vinny
    Clarke, Siobhan
    PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15), 2015, : 1897 - 1898
  • [30] Multi-Armed Bandit Learning in IoT Networks: Learning Helps Even in Non-stationary Settings
    Bonnefoi, Remi
    Besson, Lilian
    Moy, Christophe
    Kaufmann, Emilie
    Palicot, Jacques
    COGNITIVE RADIO ORIENTED WIRELESS NETWORKS, 2018, 228 : 173 - 185