A multi-strategy particle swarm optimization framework based on deep reinforcement learning

被引：1

作者：

Hou, Leyong ^{[1
]}

Fan, Debin ^{[1
]}

Cheng, Junjie ^{[1
]}

Wu, Honglian ^{[2
]}

Peng, Hu ^{[1
]}

Deng, Changshou ^{[2
]}

机构：

[1] JiuJiang Univ, Sch Comp & Big Data Sci, Jiujiang 332005, Peoples R China

[2] JiuJiang Univ, Sch Elect & Informat Engn, Jiujiang 332005, Peoples R China

来源：

2023 15TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE, ICACI | 2023年

基金：

中国国家自然科学基金;

关键词：

particle swarm optimization; deep reinforcement learning; multi-strategy;

D O I：

10.1109/ICACI58115.2023.10146133

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The particle swarm optimization (PSO) algorithm is a well-known optimization algorithm that has shown good performance in solving engineering problems. However, the performance and convergence speed of the PSO algorithm is easily affected by the parameter settings. In this paper, we propose an adaptive parameter optimization framework (APOF) for the PSO algorithm by using the Deep Deterministic Policy Gradient (DDPG) of deep reinforcement learning. In order to achieve better optimization effect, the strategy group is extracted from the APOF, so that the APOF can be combined with more strategies to improve the searchability of the optimized algorithm. This paper also improves the PSO algorithm and proposes the hybrid cluster PSO algorithm (HCPSO) as the built-in algorithm of the APOF. In the experiment, twenty-one functions are selected to implemented, and the optimization effect of the APOF algorithm is tested. The experimental results show that the APOF has a good optimization effect and scalability, and the built-in HCPSO algorithm also achieves good performance.

引用

页数：8

共 33 条

[1] Deep Reinforcement Learning A brief survey [J].

Arulkumaran, Kai ;

Deisenroth, Marc Peter ;

Brundage, Miles ;

Bharath, Anil Anthony .

IEEE SIGNAL PROCESSING MAGAZINE, 2017, 34 (06) :26-38

[2]

Hao JY, 2023, Arxiv, DOI [arXiv:2109.06668, DOI 10.48550/ARXIV.2109.06668]

[3] Online learning: A comprehensive survey [J].

Hoi, Steven C. H. ;

Sahoo, Doyen ;

Lu, Jing ;

Zhao, Peilin .

NEUROCOMPUTING, 2021, 459 :249-289

[4] A Q-learning-based swarm optimization algorithm for economic dispatch problem [J].

Hsieh, Yi-Zeng ;

Su, Mu-Chun .

NEURAL COMPUTING & APPLICATIONS, 2016, 27 (08) :2333-2350

[5]

[靳雁霞 Jin Yanxia], 2018, [微电子学与计算机, Microelectronics & Computer], V35, P1

[6]

Kennedy J, 1995, 1995 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS PROCEEDINGS, VOLS 1-6, P1942, DOI 10.1109/icnn.1995.488968

[7]

Kingma DP, 2014, ADV NEUR IN, V27

[8]

Levine S, 2020, Arxiv, DOI arXiv:2005.01643

[9]

Li DY., 1995, J Comp Res Dev, V32, P15

[10]

Lian Zhi-gang, 2010, Control Theory & Applications, V27, P1404

← 1 2 3 4 →