Reinforcement Learning Based Whale Optimizer

被引:6
作者
Becerra-Rozas, Marcelo [1 ]
Lemus-Romani, Jose [4 ]
Crawford, Broderick [1 ]
Soto, Ricardo [1 ]
Cisternas-Caneo, Felipe [1 ]
Embry, Andres Trujillo [1 ]
Molina, Maximo Arnao [1 ]
Tapia, Diego [1 ]
Castillo, Mauricio [1 ]
Misra, Sanjay [2 ]
Rubio, Jose-Miguel [3 ]
机构
[1] Pontificia Univ Catolica Valparaiso, Valparaiso, Chile
[2] Covenant Univ, Ota, Nigeria
[3] Univ Bernardo OHiggins, Santiago, Chile
[4] Pontificia Univ Catolica Chile, Sch Civil Construct, Santiago, Chile
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT IX | 2021年 / 12957卷
关键词
Metaheuristic; SARSA; Q-Learning; Swarm intelligence; Whale optimization algorithm; Combinatorial optimization;
D O I
10.1007/978-3-030-87013-3_16
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work proposes a Reinforcement Learning based optimizer integrating SARSA and Whale Optimization Algorithm. SARSA determines the binarization operator required during the metaheuristic process. The hybrid instance is applied to solve benchmarks of the Set Covering Problem and it is compared with a Q-learning version, showing good results in terms of fitness, specifically, SARSA beats its Q-Learning version in 44 out of 45 instances evaluated. It is worth mentioning that the only instance where it does not win is a tie. Finally, thanks to graphs presented in our results analysis we can observe that not only does it obtain good results, it also obtains a correct exploration and exploitation balance as presented in the referenced literature.
引用
收藏
页码:205 / 219
页数:15
相关论文
共 50 条
  • [21] Whale Optimization based Deep Residual Learning Network for Early Rice Disease Prediction in IoT
    Lakshmi, M. Sri
    Kashyap, K. Jayadwaja
    Khan, S. Mohammed Fazal
    Reddy, N. Jaya Satya Vratha
    Achari, V. Bharath Kumar
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2023, 10 (06)
  • [22] SeaRank: relevance prediction based on click models in a reinforcement learning framework
    Keyhanipour, Amir Hosein
    Oroumchian, Farhad
    DATA TECHNOLOGIES AND APPLICATIONS, 2023, 57 (04) : 465 - 488
  • [23] Transition Based Discount Factor for Model Free Algorithms in Reinforcement Learning
    Sharma, Abhinav
    Gupta, Ruchir
    Lakshmanan, K.
    Gupta, Atul
    SYMMETRY-BASEL, 2021, 13 (07):
  • [24] Local instance-based transfer learning for reinforcement learning
    Li, Xiaoguang
    Ji, Wanting
    Huang, Jidong
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [25] Differential Evolution Based Particle Swarm Optimizer for Neural Network Learning
    Ning, Dongfang
    Zhang, Weiguo
    Li, Bin
    2008 7TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-23, 2008, : 4444 - 4447
  • [26] Efficient Ranking-Based Whale Optimizer for Parameter Extraction of Three-Diode Photovoltaic Model: Analysis and Validations
    Abdel-Basset, Mohamed
    Mohamed, Reda
    El-Fergany, Attia
    Askar, Sameh S.
    Abouhawwash, Mohamed
    ENERGIES, 2021, 14 (13)
  • [27] Intradialytic hypotension prediction using covariance matrix-driven whale optimizer with orthogonal structure-assisted extreme learning machine
    Li, Yupeng
    Zhao, Dong
    Liu, Guangjie
    Liu, Yi
    Bano, Yasmeen
    Ibrohimov, Alisherjon
    Chen, Huiling
    Wu, Chengwen
    Chen, Xumin
    FRONTIERS IN NEUROINFORMATICS, 2022, 16
  • [28] Review of reinforcement learning research
    Jia, Jingkai
    Wang, Wenlin
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 186 - 191
  • [29] Algorithmic Foundations of Reinforcement Learning
    Pareigis, Stephan
    ADVANCES IN REAL-TIME AND AUTONOMOUS SYSTEMS, 2023, 2024, 1009 : 1 - 27
  • [30] A Reinforcement Learning Based Robotic Navigation System
    Zuo, Bashan
    Chen, Jiaxin
    Wang, Larry
    Wang, Ying
    2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 3452 - 3457