Efficient Elitist Cooperative Evolutionary Algorithm for Multi-Objective Reinforcement Learning

被引:5
作者
Zhou, Dan [1 ]
Du, Jiqing [1 ]
Arai, Sachiyo [1 ]
机构
[1] Chiba Univ, Grad Sch Sci & Engn, Dept Urban Environm Syst, Div Earth & Environm Sci, Chiba 2638522, Japan
基金
日本学术振兴会;
关键词
Pareto optimization; Statistics; Social factors; Underwater vehicles; Measurement; Q-learning; Uncertainty; Reinforcement learning; Multi-objective reinforcement learning; efficient; cooperative; Pareto front; elite archive; GENETIC ALGORITHM;
D O I
10.1109/ACCESS.2023.3272115
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequential decision-making problems with multiple objectives are known as multi-objective reinforcement learning. In these scenarios, decision-makers require a complete Pareto front that consists of Pareto optimal solutions. Such a front enables decision-makers to understand the relationship between objectives and make informed decisions from a broad range of solutions. However, existing methods may be unable to search for solutions in concave regions of the Pareto front or lack global optimization ability, leading to incomplete Pareto fronts. To address this issue, we propose an efficient elitist cooperative evolutionary algorithm that maintains both an evolving population and an elite archive. The elite archive uses cooperative operations with various genetic operators to guide the evolving population, resulting in efficient searches for Pareto optimal solutions. The experimental results on submarine treasure hunting benchmarks demonstrate the effectiveness of the proposed method in solving various multi-objective reinforcement learning problems and providing decision-makers with a set of trade-off solutions between travel time and treasure amount, enabling them to make flexible and informed decisions based on their preferences. Therefore, the proposed method has the potential to be a useful tool for implementing real-world applications.
引用
收藏
页码:43128 / 43139
页数:12
相关论文
共 50 条
  • [31] An Elitist Local Search Based Multi-objective Algorithm for Power Distribution System Reconfiguration
    Leon Ibarra, Marco Antonio
    Leonardo Guardado, Jose
    Rivas-Davalos, Francisco
    Torres Jimenez, Jacinto
    Luis Naredo, Jose
    ELECTRIC POWER COMPONENTS AND SYSTEMS, 2016, 44 (16) : 1839 - 1853
  • [32] Multi-Objective Quantum Evolutionary Algorithm for Discrete Multi-Objective Combinational Problem
    Wei, Xin
    Fujimura, Shigeru
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 39 - 46
  • [33] Multi-Objective Service Composition Using Reinforcement Learning
    Moustafa, Ahmed
    Zhang, Minjie
    SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 298 - 312
  • [34] Application of elitist multi-objective genetic algorithm for classification rule generation
    Dehuri, S.
    Patnaik, S.
    Ghosh, A.
    Mall, R.
    APPLIED SOFT COMPUTING, 2008, 8 (01) : 477 - 487
  • [35] Multi-objective Evolutionary Algorithm for Security Enhancement
    Banu, R. Narmatha
    Devaraj, D.
    JOURNAL OF ELECTRICAL SYSTEMS, 2009, 5 (04)
  • [36] A Pareto Front grid guided multi-objective evolutionary algorithm
    Xu, Ying
    Zhang, Huan
    Huang, Lei
    Qu, Rong
    Nojima, Yusuke
    APPLIED SOFT COMPUTING, 2023, 136
  • [37] Multi-Objective Evolutionary Algorithm for PET Image Reconstruction: Concept
    Abouhawwash, Mohamed
    Alessio, Adam M.
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (08) : 2142 - 2151
  • [38] A multi-objective deep reinforcement learning framework
    Thanh Thi Nguyen
    Ngoc Duy Nguyen
    Vamplew, Peter
    Nahavandi, Saeid
    Dazeley, Richard
    Lim, Chee Peng
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 96
  • [39] EMOCA: An Evolutionary Multi-Objective Crowding Algorithm
    Rajagopalan, Ramesh
    Mohan, Chilukuri
    Mehrotra, Kishan
    Varshney, Pramod
    JOURNAL OF INTELLIGENT SYSTEMS, 2008, 17 (1-3) : 107 - 123
  • [40] A multi-objective evolutionary algorithm for examination timetabling
    Cheong, C. Y.
    Tan, K. C.
    Veeravalli, B.
    JOURNAL OF SCHEDULING, 2009, 12 (02) : 121 - 146