Reinforcement Learning Based Whale Optimizer

被引：7

作者：

Becerra-Rozas, Marcelo ^{[1
]}

Lemus-Romani, Jose ^{[4
]}

Crawford, Broderick ^{[1
]}

Soto, Ricardo ^{[1
]}

Cisternas-Caneo, Felipe ^{[1
]}

Embry, Andres Trujillo ^{[1
]}

Molina, Maximo Arnao ^{[1
]}

Tapia, Diego ^{[1
]}

Castillo, Mauricio ^{[1
]}

Misra, Sanjay ^{[2
]}

Rubio, Jose-Miguel ^{[3
]}

机构：

[1] Pontificia Univ Catolica Valparaiso, Valparaiso, Chile

[2] Covenant Univ, Ota, Nigeria

[3] Univ Bernardo OHiggins, Santiago, Chile

[4] Pontificia Univ Catolica Chile, Sch Civil Construct, Santiago, Chile

来源：

COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT IX | 2021年 / 12957卷

关键词：

Metaheuristic; SARSA; Q-Learning; Swarm intelligence; Whale optimization algorithm; Combinatorial optimization;

D O I：

10.1007/978-3-030-87013-3_16

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This work proposes a Reinforcement Learning based optimizer integrating SARSA and Whale Optimization Algorithm. SARSA determines the binarization operator required during the metaheuristic process. The hybrid instance is applied to solve benchmarks of the Set Covering Problem and it is compared with a Q-learning version, showing good results in terms of fitness, specifically, SARSA beats its Q-Learning version in 44 out of 45 instances evaluated. It is worth mentioning that the only instance where it does not win is a tie. Finally, thanks to graphs presented in our results analysis we can observe that not only does it obtain good results, it also obtains a correct exploration and exploitation balance as presented in the referenced literature.

引用

页码：205 / 219

页数：15

共 50 条

[31] Algorithmic Foundations of Reinforcement Learning [J].

Pareigis, Stephan .

ADVANCES IN REAL-TIME AND AUTONOMOUS SYSTEMS, 2023, 2024, 1009 :1-27

[32] A Reinforcement Learning Based Robotic Navigation System [J].

Zuo, Bashan ;

Chen, Jiaxin ;

Wang, Larry ;

Wang, Ying .

2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, :3452-3457

[33] A dynamic checkpointing scheme based on reinforcement learning [J].

Okamura, H ;

Nishimura, Y ;

Dohi, T .

10TH IEEE PACIFIC RIM INTERNATIONAL SYMPOSIUM ON DEPENDABLE COMPUTING, PROCEEDINGS, 2004, :151-158

[34] A Reinforcement Learning based Edge Cloud Collaboration [J].

Kobari, Hiroki ;

Du, Zhaoyang ;

Wu, Celimuge ;

Yoshinaga, Tsutomu ;

Bao, Wugedele .

2021 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES FOR DISASTER MANAGEMENT (ICT-DM), 2021, :26-29

[35] Topical Crawler Technology Based on Reinforcement Learning [J].

Wang Youzeng ;

Wang Jinbao .

PROCEEDINGS OF 2009 CONFERENCE ON COMMUNICATION FACULTY, 2009, :607-+

[36] The Advance of Reinforcement Learning and Deep Reinforcement Learning [J].

Lyu, Le ;

Shen, Yang ;

Zhang, Sicheng .

2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, :644-648

[37] Reinforcement Learning Based Stochastic Shortest Path Finding in Wireless Sensor Networks [J].

Xia, Wenwen ;

Di, Chong ;

Guo, Haonan ;

Li, Shenghong .

IEEE ACCESS, 2019, 7 :157807-157817

[38] An adaptive search strategy combination algorithm based on reinforcement learning and neighborhood search [J].

Liu, Xiaotong ;

Xu, Ying ;

Wang, Tianlei ;

Zeng, Zhiqiang ;

Zhou, Zhiheng ;

Zhai, Yikui .

JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2025, 12 (02) :177-217

[39] A reinforcement learning-based metaheuristic algorithm for solving global optimization problems [J].

Seyyedabbasi, Amir .

ADVANCES IN ENGINEERING SOFTWARE, 2023, 178

[40] Whale Optimization Algorithm Based on Lamarckian Learning for Global Optimization Problems [J].

Zhang, Qiang ;

Liu, Lijie .

IEEE ACCESS, 2019, 7 :36642-36666

← 1 2 3 4 5 →