A multi-stage group decision model based on improved Q-learning

被引:0
|
作者
Zhang F. [1 ,2 ]
Liu L.-Y. [1 ,2 ]
Guo X.-X. [1 ,2 ]
机构
[1] College of Mathematics and Information Science, Hebei University, Baoding
[2] Hebei Key Laboratory of Machine Learning and Computational Intelligence, Baoding
来源
Kongzhi yu Juece/Control and Decision | 2019年 / 34卷 / 09期
关键词
Group consensus; Group decision making; Multi-stage group decision; Q-learning; Reinforcement learning; Uncertainty;
D O I
10.13195/j.kzyjc.2018.0082
中图分类号
学科分类号
摘要
The multi-stage group decision making problem is a typical sequential group decision making problem. It is normally utilized to find the optimal solution to the group decision problems in discrete deterministic environment. However, the real life environments faced by decision-makers are usually full of uncertainty, even unknown environments (with unknown state transition matrix). Therefore, it is essential for the decision-makers to obtain more information by interacting with the environment dynamically to achieve an optimal decision strategy with high consensus degree. Due to the advantage of reinforcement learning in handling the sequential decision-making problems, the classical reinforcement learning algorithm (Q-learning) is improved to discover the optimal solution of multi-stage group decision making problems under uncertain environment. Additionally, a theorem is proposed to show that the optimal group decision obtained by using the improved Q-learning algorithm is the group decision with the highest degree of group consensus. Finally, an illustrative example is presented to verify the rationality and feasibility of the proposed algorithm. © 2019, Editorial Office of Control and Decision. All right reserved.
引用
收藏
页码:1917 / 1922
页数:5
相关论文
共 19 条
  • [11] Zheng S.L., Han J.H., Luo X.F., Research on cooperation and reinforcement learning in multi-agent systems, Pattern Recognition & Artificial Intelligence, 15, 4, pp. 453-456, (2002)
  • [12] Zhou Z.H., Machine Learning, pp. 371-390, (2015)
  • [13] Mitchell T.M., Machine Learning, pp. 270-271, (2014)
  • [14] Hao J., Huang D., Cai Y., The dynamics of reinforcement social learning in networked cooperative multiagent systems, Engineering Applications of Artificial Intelligence, 58, pp. 111-122, (2017)
  • [15] Zhan Y.S., Ammar H.B., Taylor M.E., Scalable lifelong reinforcement learning, Pattern Recognition, 72, pp. 407-418, (2017)
  • [16] Foerster J., Nardelli N., Farquhar G., Et al., Stabilising experience replay for deep multi-agent reinforcement learning, Proc of the 34th Int Conf on Machine Learning, pp. 1146-1155, (2017)
  • [17] Mnih V., Kavukcuoglu K., Silver D., Et al., Playing atari with deep reinforcement learning
  • [18] Mnih V., Kavukcuoglu K., Silver D., Human-level control through deep reinforcement learning, Natur, 518, 7540, pp. 529-533, (2015)
  • [19] Palomares I., Martinez L., A semi-supervised multiagent system model to support consensus-reaching processes, IEEE Trans on Fuzzy Systems, 22, 4, pp. 762-777, (2014)