A multi-stage group decision model based on improved Q-learning

被引:0
|
作者
Zhang F. [1 ,2 ]
Liu L.-Y. [1 ,2 ]
Guo X.-X. [1 ,2 ]
机构
[1] College of Mathematics and Information Science, Hebei University, Baoding
[2] Hebei Key Laboratory of Machine Learning and Computational Intelligence, Baoding
来源
Kongzhi yu Juece/Control and Decision | 2019年 / 34卷 / 09期
关键词
Group consensus; Group decision making; Multi-stage group decision; Q-learning; Reinforcement learning; Uncertainty;
D O I
10.13195/j.kzyjc.2018.0082
中图分类号
学科分类号
摘要
The multi-stage group decision making problem is a typical sequential group decision making problem. It is normally utilized to find the optimal solution to the group decision problems in discrete deterministic environment. However, the real life environments faced by decision-makers are usually full of uncertainty, even unknown environments (with unknown state transition matrix). Therefore, it is essential for the decision-makers to obtain more information by interacting with the environment dynamically to achieve an optimal decision strategy with high consensus degree. Due to the advantage of reinforcement learning in handling the sequential decision-making problems, the classical reinforcement learning algorithm (Q-learning) is improved to discover the optimal solution of multi-stage group decision making problems under uncertain environment. Additionally, a theorem is proposed to show that the optimal group decision obtained by using the improved Q-learning algorithm is the group decision with the highest degree of group consensus. Finally, an illustrative example is presented to verify the rationality and feasibility of the proposed algorithm. © 2019, Editorial Office of Control and Decision. All right reserved.
引用
收藏
页码:1917 / 1922
页数:5
相关论文
共 19 条
  • [1] Hwang C.L., Group Decision Making Under Multiple Criteria, Methods and Applications, pp. 311-317, (1987)
  • [2] Peng Y., Hu Y., The Pareto optimization policies of multi-stage group decision making, J of Sichuan University: Natural Science Edition, 44, 3, pp. 482-484, (2007)
  • [3] Hao J.J., Zhu J.J., Liu Y., Model and algorithm for multi-stage group decision-making concerning different decision groups and dual information, Systems Engineering, 34, 5, pp. 129-134, (2016)
  • [4] Zhang Y.J., Optimal algorithm of multistage group decision-making under discrete determinate status, J of China Three Gorges University, 35, 2, pp. 104-107, (2013)
  • [5] Lu Z.P., Lu C.Y., Fast aggregated model for multi-stage group decision-making probelm based on preference distance method, Int Conf on Management and Service Science, pp. 1-4, (2011)
  • [6] Zhang N., Fang Z.G., Zhu J.J., Multi-stage grey situation group decision-making model based on Orness, Control and Decision, 30, 7, pp. 1227-1232, (2015)
  • [7] Luo D., Li Y.W., Multi-stage and multi-attribute risk group decision-making method based on grey information, Workshop on Grey System Theory and its Applications, pp. 305-310, (2014)
  • [8] Zhou S.H., Research on the large group decision making method of multi-objective and multi-stage within conflict based on fuzzy preference relation, (2013)
  • [9] Ma Y.R., Wang X., Partner selection model based on fuzzy language group decision-making method for dynamic logistics alliance under multiple time periods, Systems Engineering, 26, 6, pp. 32-36, (2008)
  • [10] Kaelbling L.P., Reinforcement learning: A survey, J of Artificial Intelligence Research, 4, 1, pp. 237-285, (1996)