Efficient Elitist Cooperative Evolutionary Algorithm for Multi-Objective Reinforcement Learning

被引：5

作者：

Zhou, Dan ^{[1
]}

Du, Jiqing ^{[1
]}

Arai, Sachiyo ^{[1
]}

机构：

[1] Chiba Univ, Grad Sch Sci & Engn, Dept Urban Environm Syst, Div Earth & Environm Sci, Chiba 2638522, Japan

来源：

IEEE ACCESS | 2023年 / 11卷 / 43128-43139期

关键词：

Pareto optimization; Statistics; Social factors; Underwater vehicles; Measurement; Q-learning; Uncertainty; Reinforcement learning; Multi-objective reinforcement learning; efficient; cooperative; Pareto front; elite archive; GENETIC ALGORITHM;

D O I：

10.1109/ACCESS.2023.3272115

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Sequential decision-making problems with multiple objectives are known as multi-objective reinforcement learning. In these scenarios, decision-makers require a complete Pareto front that consists of Pareto optimal solutions. Such a front enables decision-makers to understand the relationship between objectives and make informed decisions from a broad range of solutions. However, existing methods may be unable to search for solutions in concave regions of the Pareto front or lack global optimization ability, leading to incomplete Pareto fronts. To address this issue, we propose an efficient elitist cooperative evolutionary algorithm that maintains both an evolving population and an elite archive. The elite archive uses cooperative operations with various genetic operators to guide the evolving population, resulting in efficient searches for Pareto optimal solutions. The experimental results on submarine treasure hunting benchmarks demonstrate the effectiveness of the proposed method in solving various multi-objective reinforcement learning problems and providing decision-makers with a set of trade-off solutions between travel time and treasure amount, enabling them to make flexible and informed decisions based on their preferences. Therefore, the proposed method has the potential to be a useful tool for implementing real-world applications.

引用

页码：43128 / 43139

页数：12

共 50 条

[1] An improved elitist strategy multi-objective evolutionary algorithm
Wang, Lu
Xiong, Sheng-Wu
Yang, Jie
Fan, Ji-Shan
PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 2315 - +
[2] Decomposition based Multi-Objective Evolutionary Algorithm in XCS for Multi-Objective Reinforcement Learning
Cheng, Xiu
Browne, Will N.
Zhang, Mengjie
2018 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2018, : 622 - 629
[3] A Study of the Multi-Objective Evolutionary Algorithm Based on Elitist Strategy
Chen WenBin
Liu YiJun
Wang Li
Liu XiaoLing
2009 ASIA-PACIFIC CONFERENCE ON INFORMATION PROCESSING (APCIP 2009), VOL 1, PROCEEDINGS, 2009, : 136 - 139
[4] Multi-strategy multi-objective differential evolutionary algorithm with reinforcement learning
Han, Yupeng
Peng, Hu
Mei, Changrong
Cao, Lianglin
Deng, Changshou
Wang, Hui
Wu, Zhijian
KNOWLEDGE-BASED SYSTEMS, 2023, 277
[5] Efficient Hybrid Multi-Objective Evolutionary Algorithm
Mohammed, Tareq Abed
Bayat, Oguz
Ucan, Osman N.
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (03): : 19 - 26
[6] Evolutionary Reinforcement Learning for Multi-objective SFC Deployment
Zhao, Jialiang
Wang, Ran
Wu, Qiang
Hao, Jie
Xiong, Zehui
2024 IEEE 21ST INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SMART SYSTEMS, MASS 2024, 2024, : 212 - 218
[7] An Enhanced Multi-Objective Evolutionary Algorithm with Reinforcement Learning for Energy-Efficient Scheduling in the Flexible Job Shop
Shi, Jinfa
Liu, Wei
Yang, Jie
PROCESSES, 2024, 12 (09)
[8] A novel multi-state reinforcement learning-based multi-objective evolutionary algorithm
Wang, Jing
Zheng, Yuxin
Zhang, Ziyun
Peng, Hu
Wang, Hui
INFORMATION SCIENCES, 2025, 688
[9] An elitist cooperative evolutionary bi-level multi-objective decomposition-based algorithm for sustainable supply chain
Abbassi, Malek
Chaabani, Abir
Absi, Nabil
Ben Said, Lamjed
INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2022, 60 (23) : 7013 - 7032
[10] Cooperative two-engine multi-objective bee foraging algorithm with reinforcement learning
Ma, Lianbo
Cheng, Shi
Wang, Xingwei
Huang, Min
Shen, Hai
He, Xiaoxian
Shi, Yuhui
KNOWLEDGE-BASED SYSTEMS, 2017, 133 : 278 - 293

← 1 2 3 4 5 →