Learning to Optimize: Reference Vector Reinforcement Learning Adaption to Constrained Many-Objective Optimization of Industrial Copper Burdening System

被引:141
作者
Ma, Lianbo [1 ]
Li, Nan [1 ]
Guo, Yinan [2 ]
Wang, Xingwei [3 ]
Yang, Shengxiang [4 ]
Huang, Min [5 ]
Zhang, Hao [6 ]
机构
[1] Northeastern Univ, Coll Software, Shenyang 110819, Peoples R China
[2] China Univ Min & Technol Beijing, Sch Informat Sci & Engn, Beijing 100083, Peoples R China
[3] Northeastern Univ, Coll Comp Sci, Shenyang 110819, Peoples R China
[4] De Montfort Univ, Sch Comp Sci & Informat, Leicester LE1 9BH, Leics, England
[5] Northeastern Univ, Coll Informat Sci & Engn, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
[6] Chinese Acad Sci, Shenyang Inst Automat, Shenyang 110016, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimization; Shape; Adaptation models; Copper; Reinforcement learning; Task analysis; Sociology; Copper burdening optimization; many-objective optimization; reference vector reinforcement learning (RVRL); MULTIOBJECTIVE EVOLUTIONARY ALGORITHM; NONDOMINATED SORTING APPROACH; DECOMPOSITION; SEARCH; MOEA/D;
D O I
10.1109/TCYB.2021.3086501
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The performance of decomposition-based algorithms is sensitive to the Pareto front shapes since their reference vectors preset in advance are not always adaptable to various problem characteristics with no a priori knowledge. For this issue, this article proposes an adaptive reference vector reinforcement learning (RVRL) approach to decomposition-based algorithms for industrial copper burdening optimization. The proposed approach involves two main operations, that is: 1) a reinforcement learning (RL) operation and 2) a reference point sampling operation. Given the fact that the states of reference vectors interact with the landscape environment (quite often), the RL operation treats the reference vector adaption process as an RL task, where each reference vector learns from the environmental feedback and selects optimal actions for gradually fitting the problem characteristics. Accordingly, the reference point sampling operation uses estimation-of-distribution learning models to sample new reference points. Finally, the resultant algorithm is applied to handle the proposed industrial copper burdening problem. For this problem, an adaptive penalty function and a soft constraint-based relaxing approach are used to handle complex constraints. Experimental results on both benchmark problems and real-world instances verify the competitiveness and effectiveness of the proposed algorithm.
引用
收藏
页码:12698 / 12711
页数:14
相关论文
共 54 条
[1]   Diversity Management in Evolutionary Many-Objective Optimization [J].
Adra, Salem F. ;
Fleming, Peter J. .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2011, 15 (02) :183-195
[2]   An Enhanced Decomposition-Based Evolutionary Algorithm With Adaptive Reference Vectors [J].
Asafuddoula, Md ;
Singh, Hemant Kumar ;
Ray, Tapabrata .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (08) :2321-2334
[3]   An Efficient Algorithm for Computing Hypervolume Contributions [J].
Bringmann, Karl ;
Friedrich, Tobias .
EVOLUTIONARY COMPUTATION, 2010, 18 (03) :383-402
[4]   A Decomposition-Based Many-Objective Evolutionary Algorithm With Two Types of Adjustments for Direction Vectors [J].
Cai, Xinye ;
Mei, Zhiwei ;
Fan, Zhun .
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (08) :2335-2348
[5]   Six-DOF Spacecraft Optimal Trajectory Planning and Real-Time Attitude Control: A Deep Neural Network-Based Approach [J].
Chai, Runqi ;
Tsourdos, Antonios ;
Savvaris, Al ;
Chai, Senchun ;
Xia, Yuanqing ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (11) :5005-5013
[6]   Multiobjective Optimal Parking Maneuver Planning of Autonomous Wheeled Vehicles [J].
Chai, Runqi ;
Tsourdos, Antonios ;
Savvaris, Al ;
Chai, Senchun ;
Xia, Yuanqing ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2020, 67 (12) :10809-10821
[7]   Multiobjective Overtaking Maneuver Planning for Autonomous Ground Vehicles [J].
Chai, Runqi ;
Tsourdos, Antonios ;
Al Savvaris ;
Chai, Senchun ;
Xia, Yuanqing ;
Chen, C. L. Philip .
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (08) :4035-4049
[8]   Solving Multiobjective Constrained Trajectory Optimization Problem by an Extended Evolutionary Algorithm [J].
Chai, Runqi ;
Savvaris, Al ;
Tsourdos, Antonios ;
Xia, Yuanqing ;
Chai, Senchun .
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (04) :1630-1643
[9]  
Cheng R, 2017, CSR1701 U BIRM SCH C
[10]   A Reference Vector Guided Evolutionary Algorithm for Many-Objective Optimization [J].
Cheng, Ran ;
Jin, Yaochu ;
Olhofer, Markus ;
Sendhoff, Bernhard .
IEEE TRANSACTIONS ON EVOLUTIONARY COMPUTATION, 2016, 20 (05) :773-791