Particle swarm optimization based on temporal-difference learning for solving multi-objective optimization problems

被引：1

作者：

Zhang, Desong ^{[1
]}

Zhu, Guangyu ^{[1
]}

机构：

[1] Fuzhou Univ, Sch Mech Engn & Automat, Qi Shan Campus,2 Xue Yuan Rd, Fuzhou 350108, Fujian, Peoples R China

来源：

COMPUTING | 2023年 / 105卷 / 08期

关键词：

Multi-objective optimization; Particle swarm optimization; Reinforcement learning; Temporal-difference learning; EVOLUTIONARY ALGORITHMS; ENTROPY;

D O I：

10.1007/s00607-023-01166-w

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

Multi-objective evolutionary algorithms have become the most important method to deal with multi-objective optimization problems (MOP). To improve the performance of particle swarm optimization (PSO) in addressing MOPs, a multi-objective PSO based on temporal-difference learning (TDLMOPSO) is proposed in this paper. The iteration process of TDLMOPSO is transformed into a Markov decision process, particles are treated as agents, each agent has a personal archive, the states are designed for the connection of actions, the actions of particles contain all necessary behavior of them: basic movement, jump out of local optimum, and local search, and the rewards depend on the relationship between particles' positions and their personal archives. Besides, the external archive deletion strategy and the leader selection strategy are redesigned based on the unsupervised learning algorithm to enhance the diversity of solutions in the external archive. The effectiveness of TDLMOPSO is verified by applying it with other seven advanced multi-objective algorithms in MOP benchmark test suites. Furthermore, the time complexity and parameter sensitivity of TDLMOPSO are analyzed.

引用

页码：1795 / 1820

页数：26

共 30 条

[1]

Ali Meerza SyedIrfan., 2019, IEEE 1st International Conference on Advances in Science, Engineering and Robotics Technology (ICASERT), P1, DOI [DOI 10.1109/GCAT47503.2019.8978315, 10.1109/GCAT47503.2019.8978315]

[2] Combined use of coral reefs optimization and reinforcement learning for improving resource utilization and load balancing in cloud environments [J].

Asghari, Ali ;

Sohrabi, Mohammad Karim .

COMPUTING, 2021, 103 (07) :1545-1567

[3] A benchmark test suite for evolutionary many-objective optimization [J].

Cheng, Ran ;

Li, Miqing ;

Tian, Ye ;

Zhang, Xingyi ;

Yang, Shengxiang ;

Jin, Yaochu ;

Yao, Xin .

COMPLEX & INTELLIGENT SYSTEMS, 2017, 3 (01) :67-81

[4]

Deb K, 2002, IEEE C EVOL COMPUTAT, P825, DOI 10.1109/CEC.2002.1007032

[5] A novel two-archive strategy for evolutionary many-objective optimization algorithm based on reference points [J].

Ding, Rui ;

Dong, Hongbin ;

He, Jun ;

Li, Tao .

APPLIED SOFT COMPUTING, 2019, 78 :447-464

[6] IM-MOEA/D: An Inverse Modeling Multi-Objective Evolutionary Algorithm Based on Decomposition [J].

Farias, Lucas R. C. ;

Araujo, Aluizio F. R. .

2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, :462-467

[7] Survey Paper Multi-objective particle swarm optimization with adaptive strategies for feature selection [J].

Han, Fei ;

Chen, Wen-Tao ;

Ling, Qing-Hua ;

Han, Henry .

SWARM AND EVOLUTIONARY COMPUTATION, 2021, 62

[8] Adaptive Offspring Generation for Evolutionary Large-Scale Multiobjective Optimization [J].

He, Cheng ;

Cheng, Ran ;

Yazdani, Danial .

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (02) :786-798

[9] A fitness landscape ruggedness multiobjective differential evolution algorithm with a reinforcement learning strategy [J].

Huang, Ying ;

Li, Wei ;

Tian, Furong ;

Meng, Xiang .

APPLIED SOFT COMPUTING, 2020, 96

[10] A self-learning bee colony and genetic algorithm hybrid for cloud manufacturing services [J].

Li, Tianhua ;

Yin, Yongcheng ;

Yang, Bo ;

Hou, Jialin ;

Zhou, Kai .

COMPUTING, 2022, 104 (09) :1977-2003

← 1 2 3 →