Time-critical interactive dynamic influence diagram

被引：2

作者：

Pan, Yinghui ^{[1
]}

Zeng, Yifeng ^{[2
,3
]}

Xiang, Yanping ^{[4
]}

Sun, Le ^{[5
]}

Chen, Xuefeng

机构：

[1] Jiangxi Univ Finance & Econ, Sch Informat Technol, Nanchang, Peoples R China

[2] Xiamen Univ, Dept Automat, Xiamen, Peoples R China

[3] Univ Teesside, Sch Comp, Middlesbrough, Cleveland, England

[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 610054, Peoples R China

[5] Curtin Univ, Sch Informat Syst, Perth, WA 6845, Australia

来源：

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING | 2015年 / 57卷

关键词：

Multiagent time-critical decision making; Interactive dynamic influence diagram; Model expansion; DECISION; FRAMEWORK;

D O I：

10.1016/j.ijar.2014.11.004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multiagent time-critical dynamic decision making is a challenging task in many real-world applications where a trade-off between solution quality and computational tractability is required. In this paper, we present a formal representation for modelling time-critical multiagent dynamic decision problems based on interactive dynamic influence diagrams (I-DIDs). The new representation called time-critical I-DIDs (TC-IDIDs) represents space-temporal abstraction by providing time-index to nodes and the model is defined in terms of the condensed and deployed forms. The condensed form is a static model of TC-IDIDs and can be expanded into its dynamic version. To facilitate the conversion between the two forms, we exploit the notion of object-orientation design to develop flexible and reusable TC-IDIDs. The difficulty on expanding TC-1DIDs is to select a proper time sequence to index nodes in the condensed form so that the expanded TC-IDIDs can be solved efficiently without compromising the quality of the policy. For this purpose, we propose two methods to build the condensed form of TC-IDIDs. We evaluate the solution quality and time complexity in three well-studied problems and provide results in support. (C) 2014 Elsevier Inc. All rights reserved.

引用

页码：44 / 63

页数：20

共 34 条

[1]

[Anonymous], 2001, Games and Economic Behavior

[2] Interactive epistemology II: Probability [J].

Aumann, RJ .

INTERNATIONAL JOURNAL OF GAME THEORY, 1999, 28 (03) :301-314

[3]

Bangs O., 2003, P 8 SCAND C ART INT, P25

[4]

Bulitko V., 2002, P AAAI KDD UAI 2002, P37

[5]

Cohen I., 2008, Decision Analysis, V5, P100

[6]

Cover T. M., 2006, Elements of information theory, Vsecond, DOI [DOI 10.1002/047174882X, 10.1002/0471200611]

[7] DECISION-THEORETIC CONTROL OF INFERENCE FOR TIME-CRITICAL APPLICATIONS [J].

DEAN, T .

INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 1991, 6 (04) :417-441

[8]

Doshi P., 2005, P AAAI C ART INT, P969

[9]

Doshi P., 2011, P IEEE WIC ACM INT C, P165

[10] Graphical models for interactive POMDPs: representations and solutions [J].

Doshi, Prashant ;

Zeng, Yifeng ;

Chen, Qiongyu .

AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2009, 18 (03) :376-416

← 1 2 3 4 →