Reinforcement Learning Based Whale Optimizer

被引:6
作者
Becerra-Rozas, Marcelo [1 ]
Lemus-Romani, Jose [4 ]
Crawford, Broderick [1 ]
Soto, Ricardo [1 ]
Cisternas-Caneo, Felipe [1 ]
Embry, Andres Trujillo [1 ]
Molina, Maximo Arnao [1 ]
Tapia, Diego [1 ]
Castillo, Mauricio [1 ]
Misra, Sanjay [2 ]
Rubio, Jose-Miguel [3 ]
机构
[1] Pontificia Univ Catolica Valparaiso, Valparaiso, Chile
[2] Covenant Univ, Ota, Nigeria
[3] Univ Bernardo OHiggins, Santiago, Chile
[4] Pontificia Univ Catolica Chile, Sch Civil Construct, Santiago, Chile
来源
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS, ICCSA 2021, PT IX | 2021年 / 12957卷
关键词
Metaheuristic; SARSA; Q-Learning; Swarm intelligence; Whale optimization algorithm; Combinatorial optimization;
D O I
10.1007/978-3-030-87013-3_16
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work proposes a Reinforcement Learning based optimizer integrating SARSA and Whale Optimization Algorithm. SARSA determines the binarization operator required during the metaheuristic process. The hybrid instance is applied to solve benchmarks of the Set Covering Problem and it is compared with a Q-learning version, showing good results in terms of fitness, specifically, SARSA beats its Q-Learning version in 44 out of 45 instances evaluated. It is worth mentioning that the only instance where it does not win is a tie. Finally, thanks to graphs presented in our results analysis we can observe that not only does it obtain good results, it also obtains a correct exploration and exploitation balance as presented in the referenced literature.
引用
收藏
页码:205 / 219
页数:15
相关论文
共 50 条
  • [1] An enhanced associative learning-based exploratory whale optimizer for global optimization
    Ali Asghar Heidari
    Ibrahim Aljarah
    Hossam Faris
    Huiling Chen
    Jie Luo
    Seyedali Mirjalili
    Neural Computing and Applications, 2020, 32 : 5185 - 5211
  • [2] An enhanced associative learning-based exploratory whale optimizer for global optimization
    Heidari, Ali Asghar
    Aljarah, Ibrahim
    Faris, Hossam
    Chen, Huiling
    Luo, Jie
    Mirjalili, Seyedali
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (09) : 5185 - 5211
  • [3] Embedded Learning Approaches in the Whale Optimizer to Solve Coverage Combinatorial Problems
    Becerra-Rozas, Marcelo
    Cisternas-Caneo, Felipe
    Crawford, Broderick
    Soto, Ricardo
    Garcia, Jose
    Astorga, Gino
    Palma, Wenceslao
    MATHEMATICS, 2022, 10 (23)
  • [4] Whale Optimizer-Based Clustering for Breast Histopathology Image Segmentation
    Ray, Swarnajit
    Das, Arunita
    Dhal, Krishna Gopal
    Galvez, Jorge
    Naskar, Prabir Kumar
    INTERNATIONAL JOURNAL OF SWARM INTELLIGENCE RESEARCH, 2022, 13 (01)
  • [5] An Improved Whale Optimizer with Multiple Strategies for Intelligent Prediction of Talent Stability
    Li, Hong
    Ke, Sicheng
    Rao, Xili
    Li, Caisi
    Chen, Danyan
    Kuang, Fangjun
    Chen, Huiling
    Liang, Guoxi
    Liu, Lei
    ELECTRONICS, 2022, 11 (24)
  • [6] Real-time epileptic seizure recognition using Bayesian genetic whale optimizer and adaptive machine learning
    Anter, Ahmed M.
    Abd Elaziz, Mohamed
    Zhang, Zhiguo
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2022, 127 : 426 - 434
  • [7] A Comparison of Reinforcement Learning Based Approaches to Appliance Scheduling
    Chauhan, Namit
    Choudhary, Neha
    George, Koshy
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 253 - 258
  • [8] Swarm Reinforcement Learning Algorithms Based on Sarsa Method
    Iima, Hitoshi
    Kuroe, Yasuaki
    2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1963 - 1967
  • [9] Game Artificial Intelligence Based Using Reinforcement Learning
    Agung, Albertus
    Gaol, Ford Lumban
    INTERNATIONAL CONFERENCE ON ADVANCES SCIENCE AND CONTEMPORARY ENGINEERING 2012, 2012, 50 : 555 - 565
  • [10] Reinforcement learning-based mobile robot navigation
    Altuntas, Nihal
    Imal, Erkan
    Emanet, Nahit
    Ozturk, Ceyda Nur
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2016, 24 (03) : 1747 - 1767