Time-critical interactive dynamic influence diagram

被引:2
作者
Pan, Yinghui [1 ]
Zeng, Yifeng [2 ,3 ]
Xiang, Yanping [4 ]
Sun, Le [5 ]
Chen, Xuefeng
机构
[1] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang, Peoples R China
[2] Xiamen Univ, Dept Automat, Xiamen, Peoples R China
[3] Univ Teesside, Sch Comp, Middlesbrough, Cleveland, England
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610054, Peoples R China
[5] Curtin Univ, Sch Informat Syst, Perth, WA 6845, Australia
关键词
Multiagent time-critical decision making; Interactive dynamic influence diagram; Model expansion; DECISION; FRAMEWORK;
D O I
10.1016/j.ijar.2014.11.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiagent time-critical dynamic decision making is a challenging task in many real-world applications where a trade-off between solution quality and computational tractability is required. In this paper, we present a formal representation for modelling time-critical multiagent dynamic decision problems based on interactive dynamic influence diagrams (I-DIDs). The new representation called time-critical I-DIDs (TC-IDIDs) represents space-temporal abstraction by providing time-index to nodes and the model is defined in terms of the condensed and deployed forms. The condensed form is a static model of TC-IDIDs and can be expanded into its dynamic version. To facilitate the conversion between the two forms, we exploit the notion of object-orientation design to develop flexible and reusable TC-IDIDs. The difficulty on expanding TC-1DIDs is to select a proper time sequence to index nodes in the condensed form so that the expanded TC-IDIDs can be solved efficiently without compromising the quality of the policy. For this purpose, we propose two methods to build the condensed form of TC-IDIDs. We evaluate the solution quality and time complexity in three well-studied problems and provide results in support. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:44 / 63
页数:20
相关论文
共 34 条
[1]  
[Anonymous], 2001, Games and Economic Behavior
[2]   Interactive epistemology II: Probability [J].
Aumann, RJ .
INTERNATIONAL JOURNAL OF GAME THEORY, 1999, 28 (03) :301-314
[3]  
Bangs O., 2003, P 8 SCAND C ART INT, P25
[4]  
Bulitko V., 2002, P AAAI KDD UAI 2002, P37
[5]  
Cohen I., 2008, Decision Analysis, V5, P100
[6]  
Cover T. M., 2006, Elements of information theory, Vsecond, DOI [DOI 10.1002/047174882X, 10.1002/0471200611]
[7]   DECISION-THEORETIC CONTROL OF INFERENCE FOR TIME-CRITICAL APPLICATIONS [J].
DEAN, T .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1991, 6 (04) :417-441
[8]  
Doshi P., 2005, P AAAI C ART INT, P969
[9]  
Doshi P., 2011, P IEEE WIC ACM INT C, P165
[10]   Graphical models for interactive POMDPs: representations and solutions [J].
Doshi, Prashant ;
Zeng, Yifeng ;
Chen, Qiongyu .
AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) :376-416