Metaheuristics-based Exploration Strategies for Multi-Objective Reinforcement Learning

被引:1
|
作者
Felten, Florian [1 ]
Danoy, Gregoire [1 ,2 ]
Talbi, El-Ghazali [3 ]
Bouvry, Pascal [1 ,2 ]
机构
[1] Univ Luxembourg, SnT, Esch Sur Alzette, Luxembourg
[2] Univ Luxembourg, FSTM DCS, Esch Sur Alzette, Luxembourg
[3] Univ Lille, Inria Lille, CNRS CRIStAL, Lille, France
关键词
Reinforcement Learning; Multi-objective; Metaheuristics; Pareto Sets;
D O I
10.5220/0010989100003116
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The fields of Reinforcement Learning (RL) and Optimization aim at finding an optimal solution to a problem, characterized by an objective function. The exploration-exploitation dilemma (EED) is a well known subject in those fields. Indeed, a consequent amount of literature has already been proposed on the subject and shown it is a non-negligible topic to consider to achieve good performances. Yet, many problems in real life involve the optimization of multiple objectives. Multi-Policy Multi-Objective Reinforcement Learning (MPMORL) offers a way to learn various optimised behaviours for the agent in such problems. This work introduces a modular framework for the learning phase of such algorithms, allowing to ease the study of the EED in Inner-Loop MPMORL algorithms. We present three new exploration strategies inspired from the metaheuristics domain. To assess the performance of our methods on various environments, we use a classical benchmark - the Deep Sea Treasure (DST) - as well as propose a harder version of it. Our experiments show all of the proposed strategies outperform the current state-of-the-art e-greedy based methods on the studied benchmarks.
引用
收藏
页码:662 / 673
页数:12
相关论文
共 50 条
  • [31] Deep Reinforcement Learning based Multi-Objective Systems for Financial Trading
    Bisht, Kiran
    Kumar, Arun
    2020 5TH IEEE INTERNATIONAL CONFERENCE ON RECENT ADVANCES AND INNOVATIONS IN ENGINEERING (IEEE - ICRAIE-2020), 2020,
  • [32] Multi-Objective Combinatorial Optimization of Trigeneration Plants Based on Metaheuristics
    Stojiljkovic, Mirko M.
    Stojiljkovic, Mladen M.
    Blagojevic, Bratislav D.
    ENERGIES, 2014, 7 (12): : 8554 - 8581
  • [33] Multi-objective vehicle following decision algorithm based on reinforcement learning
    Dend X.-H.
    Hou J.
    Tan G.-H.
    Wan B.-Y.
    Cao T.-T.
    Kongzhi yu Juece/Control and Decision, 2021, 36 (10): : 2497 - 2503
  • [34] Data transmission optimization based on multi-objective deep reinforcement learning
    Wang, Cuiping
    Li, Xiaole
    Tian, Jinwei
    Yin, Yilong
    COMPUTER JOURNAL, 2024, 68 (02): : 201 - 215
  • [35] Adaptive multi-objective reinforcement learning with hybrid exploration for traffic signal control based on cooperative multi-agent framework
    Khamis, Mohamed A.
    Gomaa, Walid
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 29 : 134 - 151
  • [36] Track Learning Agent Using Multi-objective Reinforcement Learning
    Shah, Rushabh
    Ruparel, Vidhi
    Prabhu, Mukul
    D'mello, Lynette
    FOURTH CONGRESS ON INTELLIGENT SYSTEMS, VOL 1, CIS 2023, 2024, 868 : 27 - 40
  • [37] A Multi-objective Reinforcement Learning Algorithm for JS']JSSP
    Mendez-Hernandez, Beatriz M.
    Rodriguez-Bazan, Erick D.
    Martinez-Jimenez, Yailen
    Libin, Pieter
    Nowe, Ann
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: THEORETICAL NEURAL COMPUTATION, PT I, 2019, 11727 : 567 - 584
  • [38] Multi-Objective Service Composition Using Reinforcement Learning
    Moustafa, Ahmed
    Zhang, Minjie
    SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 298 - 312
  • [39] Taming Lagrangian chaos with multi-objective reinforcement learning
    Calascibetta, Chiara
    Biferale, Luca
    Borra, Francesco
    Celani, Antonio
    Cencini, Massimo
    EUROPEAN PHYSICAL JOURNAL E, 2023, 46 (03):
  • [40] Multi-Objective Reinforcement Learning for Designing Ethical Environments
    Rodriguez-Soto, Manel
    Lopez-Sanchez, Maite
    Rodriguez-Aguilar, Juan A.
    PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 545 - 551