Parallel reinforcement learning for weighted multi-criteria model with adaptive margin

被引：0

作者：

Kazuyuki Hiraoka

Manabu Yoshida

Taketoshi Mishima

机构：

[1] Saitama University,Graduate School of Science and Engineering

来源：

Cognitive Neurodynamics | 2009年 / 3卷

关键词：

Reinforcement learning; Multi-criteria; Convex hull; Minkowski sum;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.

引用

页码：17 / 24

页数：7

共 50 条

[1] Parallel reinforcement learning for weighted multi-criteria model with adaptive margin
Hiraoka, Kazuyuki
Yoshida, Manabu
Mishima, Taketoshi
NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 487 - +
[2] Parallel reinforcement learning for weighted multi-criteria model with adaptive margin
Hiraoka, Kazuyuki
Yoshida, Manabu
Mishima, Taketoshi
COGNITIVE NEURODYNAMICS, 2009, 3 (01) : 17 - 24
[3] The steering approach for multi-criteria reinforcement learning
Mannor, S
Shimkin, N
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1563 - 1570
[4] Multi-criteria Scheduling in Parallel Environment with Learning Effect
Liu, Xinbo
Feng, Yue
Ding, Ning
Li, Rui
Chen, Xin
FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2024, 49 (01) : 3 - 20
[5] Adaptive linguistic weighted aggregation operators for multi-criteria decision making
Aggarwal, Manish
APPLIED SOFT COMPUTING, 2017, 58 : 690 - 699
[6] WEIGHTED MULTI-CRITERIA WORKFLOW TASK ASSIGNMENT
Yu Ouyang
Yang, Xiping
CIICT 2008: PROCEEDINGS OF CHINA-IRELAND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATIONS TECHNOLOGIES 2008, 2008, : 131 - +
[7] A multi-criteria active learning method based on adaptive density clustering
He Z.
Zhu W.
Chen X.
Zhang X.
Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (03): : 179 - 187
[8] Parallel strategies for a multi-criteria GRASP algorithm
Vianna, DS
Arroyo, JEC
Vieira, PS
de Azeredo, TR
SCCC 2005: XXV International Conference of the Chilean Computer Science Society, Proceedings, 2005, : 116 - 122
[9] Multi-criteria Design Optimization of Parallel Robots
Unal, Ramazan
Kiziltas, Gullu
Patoglu, Volkan
2008 IEEE CONFERENCE ON ROBOTICS, AUTOMATION, AND MECHATRONICS, VOLS 1 AND 2, 2008, : 577 - 583
[10] Capacity planning for integrated energy system based on reinforcement learning and multi-criteria evaluation
Zhou, Fan
Chen, Long
Zhao, Jun
Wang, Wei
ENERGY SYSTEMS-OPTIMIZATION MODELING SIMULATION AND ECONOMIC ASPECTS, 2023,

← 1 2 3 4 5 →