共 12 条
- [1] Yamamura M., Miyazaki K., Kobayashi S., A survey on learning for agents, Journal of JSAI, 10, 5, pp. 23-29, (1995)
- [2] Watkins C.J.C.H., Dayan P., Technical note: Q- Learning, Machine Learning, 8, pp. 55-68, (1992)
- [3] Arai S., Miyazaki K., Kobayashi S., Methodology in multi-agent reinforcement learning: Approaches by Q- learning and profit sharing, Journal of JSAI, 13, 5, pp. 609-618, (1998)
- [4] Miyazaki K., Yamamura M., Kobayashi S., A theory of profit sharing in reinforcement learning, Journal of JSAI, 9, 4, pp. 580-587, (1994)
- [5] Uemura W., Tatsumi S., About the reinforcement function for profit sharing, Transactions of JSAI, 19, 4, pp. 197-203, (2004)
- [6] Uemura W., Ueno A., Tatsumi S., A profit sharing method for forgetting past experiences effectively, Trans-actions of JSAI, 21, 1, pp. 81-93, (2006)
- [7] Hasegawa Y., Takada S., Nakano H., Arai S., Miyauchi A., A reinforcement learning method using a dynamic reinforcement function based on action selection probability, The IEICE Transactions on Information and Systems, J89D, 4, pp. 788-796, (2006)
- [8] Nakano H., Miyauchi A., Design of Reinforcement Functions in Profit Sharing Reinforcement Learning, IEICE Technical Report, 106, 574, pp. 1-6, (2007)
- [9] Kawai H., Ueno A., Tatsumi S., The consideration of rationality of Profit Sharing with roulette action selection, The 19th Annual Conference of JSAI, 1D3-03, (2005)
- [10] Matsui T., Ohwada H., Rationality of Profit Sharing Based on Expected Value, The 22th Annual Conference of JSAI, 3A2-1, (2008)