Reinforcement Learning-Based Feedback and Weight-Adjustment Mechanisms for Consensus Reaching in Group Decision Making

被引:23
作者
Hassani, Hossein [1 ]
Razavi-Far, Roozbeh [1 ,2 ,3 ]
Saif, Mehrdad [1 ]
Herrera-Viedma, Enrique [4 ]
机构
[1] Univ Windsor, Dept Elect & Comp Engn, Windsor, ON N9B 3P4, Canada
[2] Univ New Brunswick, Fac Comp Sci, Fredericton, NB E3B 5A3, Canada
[3] Univ New Brunswick, Canadian Inst Cybersecur, Fredericton, NB E3B 5A3, Canada
[4] Univ Granada, Andalusian Res Inst Data Sci & Computat Intelligen, Dept Comp Sci & AI, Granada 18071, Spain
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2023年 / 53卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
Consensus models; decision making; decision support systems; deep learning; reinforcement learning; Z-numbers; PREFERENCE RELATIONS; MINIMUM ADJUSTMENT; FUZZY; CONFIDENCE; FRAMEWORK; MODEL; COST;
D O I
10.1109/TSMC.2022.3214221
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The number of discussion rounds and harmony degree of decision makers are two crucial efficiency measures to be considered in the design of the consensus-reaching process for the group decision-making problems. Adjusting the feedback parameter and importance weights of the decision makers in the recommendation mechanism has a great impact on these efficiency measures. This work aims to propose novel and efficient reinforcement learning-based adjustment mechanisms to address the tradeoff between the aforementioned measures. To employ these adjustment mechanisms, we propose to extract the dynamics of state transition from consensus models based on the distributed trust functions and Z-Numbers in order to convert the decision environment into a Markov decision process. Two independent reinforcement learning agents are then trained via a deep deterministic policy gradient algorithm to adjust the feedback parameter and importance weights of decision makers. The first agent is trained toward reducing the number of discussion rounds while ensuring the highest possible level of harmony degree among the decision makers. The second agent merely speeds up the consensus reaching process by adjusting the importance weights of the decision makers. Various experiments are designed to verify the applicability and scalability of the proposed feedback and weight-adjustment mechanisms in different decision environments.
引用
收藏
页码:2456 / 2468
页数:13
相关论文
共 50 条
[1]  
Bao Guang-yu, 2010, Control and Decision, V25, P780
[2]   A framework for dynamic multiple-criteria decision making [J].
Campanella, Gianluca ;
Ribeiro, Rita A. .
DECISION SUPPORT SYSTEMS, 2011, 52 (01) :52-60
[3]   New decision-making methods with interval reciprocal preference relations: A new admissible order relation of intervals [J].
Cheng, Xianjuan ;
Wan, Shuping ;
Dong, Jiuying ;
Martinez, Luis .
INFORMATION SCIENCES, 2021, 569 :400-429
[4]   Consensus Reaching and Strategic Manipulation in Group Decision Making With Trust Relationships [J].
Dong, Yucheng ;
Zha, Quanbo ;
Zhang, Hengjie ;
Herrera, Francisco .
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2021, 51 (10) :6304-6318
[5]   Multiperson decision making with different preference representation structures: A direct consensus framework and its properties [J].
Dong, Yucheng ;
Zhang, Hengjie .
KNOWLEDGE-BASED SYSTEMS, 2014, 58 :45-57
[6]   Computing the Numerical Scale of the Linguistic Term Set for the 2-Tuple Fuzzy Linguistic Representation Model [J].
Dong, Yucheng ;
Xu, Yinfeng ;
Yu, Shui .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2009, 17 (06) :1366-1378
[7]   Consensus Model Handling Minority Opinions and Noncooperative Behaviors in Large-Scale Group Decision-Making Under Double Hierarchy Linguistic Preference Relations [J].
Gou, Xunjie ;
Xu, Zeshui ;
Liao, Huchang ;
Herrera, Francisco .
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 51 (01) :283-296
[8]   Attitude quantifier based possibility distribution generation method for hesitant fuzzy linguistic group decision making [J].
Hao, Jingjing ;
Chiclana, Francisco .
INFORMATION SCIENCES, 2020, 518 :341-360
[9]   Fault Location in Smart Grids Through Multicriteria Analysis of Group Decision Support Systems [J].
Hassani, Hossein ;
Razavi-Far, Roozbeh ;
Saif, Mehrdad .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (12) :7318-7327
[10]   A rational consensus model in group decision making using linguistic assessments [J].
Herrera, F ;
Herrera-Viedma, E ;
Verdegay, JL .
FUZZY SETS AND SYSTEMS, 1997, 88 (01) :31-49