Discovery of Causal Rules Using Partial Association

被引:22
作者
Jin, Zhou [1 ]
Li, Jiuyong [2 ]
Liu, Lin [2 ]
Thuc Duy Le [2 ]
Sun, Bingyu [3 ]
Wang, Rujing [3 ]
机构
[1] Univ Sci & Technol China, Dept Automat, Hefei 230026, Peoples R China
[2] Univ South Australia, Sch Comp & Informat Sci, Mawson Lakes, SA 5095, Australia
[3] Chinese Acad Sci, Inst Machines Intelligence, Hefei 230031, Peoples R China
来源
12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012) | 2012年
关键词
data mining; causality; partial association; causal rule;
D O I
10.1109/ICDM.2012.36
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Discovering causal relationships in large databases of observational data is challenging. The pioneering work in this area was rooted in the theory of Bayesian network (BN) learning, which however, is a NP-complete problem. Hence several constraint-based algorithms have been developed to efficiently discover causations in large databases. These methods usually use the idea of BN learning, directly or indirectly, and are focused on causal relationships with single cause variables. In this paper, we propose an approach to mine causal rules in large databases of binary variables. Our method expands the scope of causality discovery to causal relationships with multiple cause variables, and we utilise partial association tests to exclude noncausal associations, to ensure the high reliability of discovered causal rules. Furthermore an efficient algorithm is designed for the tests in large databases. We assess the method with a set of real-world diagnostic data. The results show that our method can effectively discover interesting causal rules in large databases.
引用
收藏
页码:309 / 318
页数:10
相关论文
共 24 条
[1]  
Aliferis CF, 2010, J MACH LEARN RES, V11, P171
[2]  
[Anonymous], 1956, The Direction of Time
[3]  
[Anonymous], 2002, EXPT QUASIEXPERIMENT
[4]  
[Anonymous], 1991, KR
[5]  
[Anonymous], P VLDB
[6]  
BIRCH MW, 1964, J ROY STAT SOC B, V26, P313
[7]  
Chickering D.M., 1996, TECHNICAL REPORT
[8]  
Cooper G., 1997, TECHNICAL REPORT
[9]   A simple constraint-based algorithm for efficiently mining observational databases for causal relationships [J].
Cooper, GF .
DATA MINING AND KNOWLEDGE DISCOVERY, 1997, 1 (02) :203-224
[10]  
Frank A., 2010, UCI machine learning repository, V213