Parallel reinforcement learning for weighted multi-criteria model with adaptive margin

被引:0
|
作者
Kazuyuki Hiraoka
Manabu Yoshida
Taketoshi Mishima
机构
[1] Saitama University,Graduate School of Science and Engineering
来源
Cognitive Neurodynamics | 2009年 / 3卷
关键词
Reinforcement learning; Multi-criteria; Convex hull; Minkowski sum;
D O I
暂无
中图分类号
学科分类号
摘要
Reinforcement learning (RL) for a linear family of tasks is described in this paper. The key of our discussion is nonlinearity of the optimal solution even if the task family is linear; we cannot obtain the optimal policy using a naive approach. Although an algorithm exists for calculating the equivalent result to Q-learning for each task simultaneously, it presents the problem of explosion of set sizes. We therefore introduce adaptive margins to overcome this difficulty.
引用
收藏
页码:17 / 24
页数:7
相关论文
共 50 条
  • [1] Parallel reinforcement learning for weighted multi-criteria model with adaptive margin
    Hiraoka, Kazuyuki
    Yoshida, Manabu
    Mishima, Taketoshi
    NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 487 - +
  • [2] Parallel reinforcement learning for weighted multi-criteria model with adaptive margin
    Hiraoka, Kazuyuki
    Yoshida, Manabu
    Mishima, Taketoshi
    COGNITIVE NEURODYNAMICS, 2009, 3 (01) : 17 - 24
  • [3] The steering approach for multi-criteria reinforcement learning
    Mannor, S
    Shimkin, N
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 1563 - 1570
  • [4] Multi-criteria Scheduling in Parallel Environment with Learning Effect
    Liu, Xinbo
    Feng, Yue
    Ding, Ning
    Li, Rui
    Chen, Xin
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2024, 49 (01) : 3 - 20
  • [5] Adaptive linguistic weighted aggregation operators for multi-criteria decision making
    Aggarwal, Manish
    APPLIED SOFT COMPUTING, 2017, 58 : 690 - 699
  • [6] WEIGHTED MULTI-CRITERIA WORKFLOW TASK ASSIGNMENT
    Yu Ouyang
    Yang, Xiping
    CIICT 2008: PROCEEDINGS OF CHINA-IRELAND INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATIONS TECHNOLOGIES 2008, 2008, : 131 - +
  • [7] A multi-criteria active learning method based on adaptive density clustering
    He Z.
    Zhu W.
    Chen X.
    Zhang X.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2024, 45 (03): : 179 - 187
  • [8] Parallel strategies for a multi-criteria GRASP algorithm
    Vianna, DS
    Arroyo, JEC
    Vieira, PS
    de Azeredo, TR
    SCCC 2005: XXV International Conference of the Chilean Computer Science Society, Proceedings, 2005, : 116 - 122
  • [9] Multi-criteria Design Optimization of Parallel Robots
    Unal, Ramazan
    Kiziltas, Gullu
    Patoglu, Volkan
    2008 IEEE CONFERENCE ON ROBOTICS, AUTOMATION, AND MECHATRONICS, VOLS 1 AND 2, 2008, : 577 - 583
  • [10] Capacity planning for integrated energy system based on reinforcement learning and multi-criteria evaluation
    Zhou, Fan
    Chen, Long
    Zhao, Jun
    Wang, Wei
    ENERGY SYSTEMS-OPTIMIZATION MODELING SIMULATION AND ECONOMIC ASPECTS, 2023,