Efficient Elitist Cooperative Evolutionary Algorithm for Multi-Objective Reinforcement Learning

被引：5

作者：

Zhou, Dan ^{[1
]}

Du, Jiqing ^{[1
]}

Arai, Sachiyo ^{[1
]}

机构：

[1] Chiba Univ, Grad Sch Sci & Engn, Dept Urban Environm Syst, Div Earth & Environm Sci, Chiba 2638522, Japan

来源：

IEEE ACCESS | 2023年 / 11卷 / 43128-43139期

基金：

日本学术振兴会;

关键词：

Pareto optimization; Statistics; Social factors; Underwater vehicles; Measurement; Q-learning; Uncertainty; Reinforcement learning; Multi-objective reinforcement learning; efficient; cooperative; Pareto front; elite archive; GENETIC ALGORITHM;

D O I：

10.1109/ACCESS.2023.3272115

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sequential decision-making problems with multiple objectives are known as multi-objective reinforcement learning. In these scenarios, decision-makers require a complete Pareto front that consists of Pareto optimal solutions. Such a front enables decision-makers to understand the relationship between objectives and make informed decisions from a broad range of solutions. However, existing methods may be unable to search for solutions in concave regions of the Pareto front or lack global optimization ability, leading to incomplete Pareto fronts. To address this issue, we propose an efficient elitist cooperative evolutionary algorithm that maintains both an evolving population and an elite archive. The elite archive uses cooperative operations with various genetic operators to guide the evolving population, resulting in efficient searches for Pareto optimal solutions. The experimental results on submarine treasure hunting benchmarks demonstrate the effectiveness of the proposed method in solving various multi-objective reinforcement learning problems and providing decision-makers with a set of trade-off solutions between travel time and treasure amount, enabling them to make flexible and informed decisions based on their preferences. Therefore, the proposed method has the potential to be a useful tool for implementing real-world applications.

引用

页码：43128 / 43139

页数：12

共 50 条

[31] An Elitist Local Search Based Multi-objective Algorithm for Power Distribution System Reconfiguration
Leon Ibarra, Marco Antonio
Leonardo Guardado, Jose
Rivas-Davalos, Francisco
Torres Jimenez, Jacinto
Luis Naredo, Jose
ELECTRIC POWER COMPONENTS AND SYSTEMS, 2016, 44 (16) : 1839 - 1853
[32] Multi-Objective Quantum Evolutionary Algorithm for Discrete Multi-Objective Combinational Problem
Wei, Xin
Fujimura, Shigeru
INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 39 - 46
[33] Multi-Objective Service Composition Using Reinforcement Learning
Moustafa, Ahmed
Zhang, Minjie
SERVICE-ORIENTED COMPUTING, ICSOC 2013, 2013, 8274 : 298 - 312
[34] Application of elitist multi-objective genetic algorithm for classification rule generation
Dehuri, S.
Patnaik, S.
Ghosh, A.
Mall, R.
APPLIED SOFT COMPUTING, 2008, 8 (01) : 477 - 487
[35] Multi-objective Evolutionary Algorithm for Security Enhancement
Banu, R. Narmatha
Devaraj, D.
JOURNAL OF ELECTRICAL SYSTEMS, 2009, 5 (04)
[36] A Pareto Front grid guided multi-objective evolutionary algorithm
Xu, Ying
Zhang, Huan
Huang, Lei
Qu, Rong
Nojima, Yusuke
APPLIED SOFT COMPUTING, 2023, 136
[37] Multi-Objective Evolutionary Algorithm for PET Image Reconstruction: Concept
Abouhawwash, Mohamed
Alessio, Adam M.
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2021, 40 (08) : 2142 - 2151
[38] A multi-objective deep reinforcement learning framework
Thanh Thi Nguyen
Ngoc Duy Nguyen
Vamplew, Peter
Nahavandi, Saeid
Dazeley, Richard
Lim, Chee Peng
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 96
[39] EMOCA: An Evolutionary Multi-Objective Crowding Algorithm
Rajagopalan, Ramesh
Mohan, Chilukuri
Mehrotra, Kishan
Varshney, Pramod
JOURNAL OF INTELLIGENT SYSTEMS, 2008, 17 (1-3) : 107 - 123
[40] A multi-objective evolutionary algorithm for examination timetabling
Cheong, C. Y.
Tan, K. C.
Veeravalli, B.
JOURNAL OF SCHEDULING, 2009, 12 (02) : 121 - 146

← 1 2 3 4 5 →