Reinforcement Learning Based Whale Optimizer

被引：6

作者：

Becerra-Rozas, Marcelo ^{[1
]}

Lemus-Romani, Jose ^{[4
]}

Crawford, Broderick ^{[1
]}

Soto, Ricardo ^{[1
]}

Cisternas-Caneo, Felipe ^{[1
]}

Embry, Andres Trujillo ^{[1
]}

Molina, Maximo Arnao ^{[1
]}

Tapia, Diego ^{[1
]}

Castillo, Mauricio ^{[1
]}

Misra, Sanjay ^{[2
]}

Rubio, Jose-Miguel ^{[3
]}

机构：

[1] Pontificia Univ Catolica Valparaiso, Valparaiso, Chile

[2] Covenant Univ, Ota, Nigeria

[3] Univ Bernardo OHiggins, Santiago, Chile

[4] Pontificia Univ Catolica Chile, Sch Civil Construct, Santiago, Chile

来源：

COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT IX | 2021年 / 12957卷

关键词：

Metaheuristic; SARSA; Q-Learning; Swarm intelligence; Whale optimization algorithm; Combinatorial optimization;

D O I：

10.1007/978-3-030-87013-3_16

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This work proposes a Reinforcement Learning based optimizer integrating SARSA and Whale Optimization Algorithm. SARSA determines the binarization operator required during the metaheuristic process. The hybrid instance is applied to solve benchmarks of the Set Covering Problem and it is compared with a Q-learning version, showing good results in terms of fitness, specifically, SARSA beats its Q-Learning version in 44 out of 45 instances evaluated. It is worth mentioning that the only instance where it does not win is a tie. Finally, thanks to graphs presented in our results analysis we can observe that not only does it obtain good results, it also obtains a correct exploration and exploitation balance as presented in the referenced literature.

引用

页码：205 / 219

页数：15

共 50 条

[21] Whale Optimization based Deep Residual Learning Network for Early Rice Disease Prediction in IoT
Lakshmi, M. Sri
Kashyap, K. Jayadwaja
Khan, S. Mohammed Fazal
Reddy, N. Jaya Satya Vratha
Achari, V. Bharath Kumar
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2023, 10 (06)
[22] SeaRank: relevance prediction based on click models in a reinforcement learning framework
Keyhanipour, Amir Hosein
Oroumchian, Farhad
DATA TECHNOLOGIES AND APPLICATIONS, 2023, 57 (04) : 465 - 488
[23] Transition Based Discount Factor for Model Free Algorithms in Reinforcement Learning
Sharma, Abhinav
Gupta, Ruchir
Lakshmanan, K.
Gupta, Atul
SYMMETRY-BASEL, 2021, 13 (07):
[24] Local instance-based transfer learning for reinforcement learning
Li, Xiaoguang
Ji, Wanting
Huang, Jidong
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
[25] Differential Evolution Based Particle Swarm Optimizer for Neural Network Learning
Ning, Dongfang
Zhang, Weiguo
Li, Bin
2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 4444 - 4447
[26] Efficient Ranking-Based Whale Optimizer for Parameter Extraction of Three-Diode Photovoltaic Model: Analysis and Validations
Abdel-Basset, Mohamed
Mohamed, Reda
El-Fergany, Attia
Askar, Sameh S.
Abouhawwash, Mohamed
ENERGIES, 2021, 14 (13)
[27] Intradialytic hypotension prediction using covariance matrix-driven whale optimizer with orthogonal structure-assisted extreme learning machine
Li, Yupeng
Zhao, Dong
Liu, Guangjie
Liu, Yi
Bano, Yasmeen
Ibrohimov, Alisherjon
Chen, Huiling
Wu, Chengwen
Chen, Xumin
FRONTIERS IN NEUROINFORMATICS, 2022, 16
[28] Review of reinforcement learning research
Jia, Jingkai
Wang, Wenlin
2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 186 - 191
[29] Algorithmic Foundations of Reinforcement Learning
Pareigis, Stephan
ADVANCES IN REAL-TIME AND AUTONOMOUS SYSTEMS, 2023, 2024, 1009 : 1 - 27
[30] A Reinforcement Learning Based Robotic Navigation System
Zuo, Bashan
Chen, Jiaxin
Wang, Larry
Wang, Ying
2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3452 - 3457

← 1 2 3 4 5 →