Solving combined economic and emission dispatch problems using reinforcement learning-based adaptive differential evolution algorithm

被引：12

作者：

Luo, Wenguan ^{[1
,2
]}

Yu, Xiaobing ^{[1
,2
]}

Wei, Yifan ^{[1
,2
]}

机构：

[1] Nanjing Univ Informat Sci & Technol, Collaborat Innovat Ctr Forecast & Evaluat Meteorol, Nanjing 210044, Peoples R China

[2] Nanjing Univ Informat Sci & Technol, Res Inst Risk Governance & Emergency Decis Making, Sch Management Sci & Engn, Nanjing 210044, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 126卷

基金：

中国国家自然科学基金;

关键词：

Differential evolution; Reinforcement learning; CEED; Price penalty factors; LINE FLOW; OPTIMIZATION; MODEL;

D O I：

10.1016/j.engappai.2023.107002

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Nowadays, economic and environmental concerns in production have become increasingly significant. To address these issues, the Combined Economic and Emission Dispatch (CEED) problem has been introduced to optimize the power generation process by considering fuel cost and emitted substances. However, due to the nonlinearity and nonconvexity of the objective function, the optimization of CEED remains a challenge. In this paper, we develop a Reinforcement Learning-based Adaptive Differential Evolution (RLADE) algorithm to enhance the optimization performance. The mutation strategy and crossover probability of RLADE are optimized using Reinforcement Learning (RL) to respectively ensure better convergence speed and searchability. Additionally, two modifications of RL, namely the adaptive population size-based state division and fitness-rankingbased reward mechanism, are proposed to improve the accuracy of state division and reward calculation in RL. The experiments conducted in this paper consider two objective formulation methods of CEED problems, namely the quadratic and cubic criterion functions. The mean values and standard deviations of the obtained solutions were utilized to assess the performance of RLADE, as well as other comparative algorithms, namely DE algorithm and two RL-based DE variants. The results clearly demonstrate that RLADE surpasses its counterparts with proportion of 100%, 85.7%, and 100% for the 6-unit and 11-unit quadratic CEED problems, as well as cubic criterion functions, in terms of both search accuracy and convergence ability. Furthermore, the significance of RLADE's superiority is confirmed through the Wilcoxon's signed rank test.

引用

页数：16

共 55 条

[1] Analyze the environmental sustainability factors of China: The role of fossil fuel energy and renewable energy [J].