A rationally oriented forgettable profit sharing

被引:1
作者
Koujaku, Sadamori [1 ]
Watanabe, Kota [1 ]
Igarashi, Hajima [1 ]
机构
[1] Hokkaido Univ, Sapporo, Hokkaido 060, Japan
关键词
reinforcement learning; profit sharing; Miyazaki rational theorem;
D O I
10.1002/ecj.11461
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, the Rationally Oriented Forgettable Profit Sharing method (RFPS) for reinforcement learning is proposed. Although profit sharing (PS) provides good performances in real environments, its learning is often slow in long-term tasks because it is difficult to determine the appropriate discount rate which satisfies the Miyazaki rational theorem. There are several rationality-relaxed PS methods which work well for such tasks. However, these PS methods may result in many irrational loops. The proposed method fulfills rationality by forgetting the reinforced irrational loops. This method can be easily combined with ordinary PS methods and performs well in long-term tasks. Simulation results show that the proposed method can learn more efficiently than conventional PS methods. (c) 2013 Wiley Periodicals, Inc. Electron Comm Jpn, 96(7): 11-18, 2013; Published online in Wiley Online Library (wileyonlinelibrary.com). DOI 10.1002/ecj.11461
引用
收藏
页码:11 / 18
页数:8
相关论文
共 50 条
  • [31] Profit sharing and strike activity in Cournot oligopoly
    Mauleon, A
    Vannetelbosch, VJ
    JOURNAL OF ECONOMICS-ZEITSCHRIFT FUR NATIONALOKONOMIE, 1999, 69 (01): : 19 - 40
  • [32] Profit sharing and strike activity in Cournot oligopoly
    Ana Mauleon
    Vincent J. Vannetelbosch
    Journal of Economics, 1999, 69 : 19 - 40
  • [33] Debt burden, investment, and profit-sharing
    Kenshiro Ninomiya
    Evolutionary and Institutional Economics Review, 2023, 20 : 287 - 306
  • [34] On pricing and profit sharing of executive human capital
    Yan, QX
    Zhao, Y
    MANAGEMENT SCIENCES AND GLOBAL STRATEGIES IN THE 21ST CENTURY, VOLS 1 AND 2, 2004, : 1797 - 1802
  • [35] Profit sharing using a dynamic reinforcement function considering expectation value of reinforcement
    Tamashima, Daisuke
    Koakutsu, Seiichi
    Okamoto, Takashi
    Hirata, Hironori
    IEEJ Transactions on Electronics, Information and Systems, 2009, 129 (07) : 1339 - 1347+24
  • [36] Social Welfare and Profit-Sharing Rule in a Unionised Duopoly with Profit Tax/Subsidy
    Fanti, Luciano
    Buccella, Domenico
    HACIENDA PUBLICA ESPANOLA-REVIEW OF PUBLIC ECONOMICS, 2018, (226): : 59 - 84
  • [37] Collaborating freight forwarding enterprisesRequest allocation and profit sharing
    Marta Anna Krajewska
    Herbert Kopfer
    OR Spectrum, 2006, 28 : 301 - 317
  • [38] Firm regulation and profit sharing: A real option approach
    Moretto, Michele
    Valbonesi, Paola
    B E JOURNAL OF ECONOMIC ANALYSIS & POLICY, 2007, 7 (01):
  • [39] Proposal and Evaluation of an Indirect Reward Assignment Method for Reinforcement Learning by Profit Sharing Method
    Miyazaki, Kazuteru
    Kodama, Naoki
    Kobayashi, Hiroaki
    INTELLIGENT SYSTEMS AND APPLICATIONS, VOL 1, 2019, 868 : 187 - 200
  • [40] Profit sharing and investment by regulated utilities: A welfare analysis
    Moretto, Michele
    Panteghini, Paolo M.
    Scarpa, Carlo
    REVIEW OF FINANCIAL ECONOMICS, 2008, 17 (04) : 315 - 337