Mining Causal Association Rules

被引:30
作者
Li, Jiuyong [1 ]
Thuc Duy Le [1 ]
Liu, Lin [1 ]
Liu, Jixue [1 ]
Jin, Zhou [2 ,3 ]
Sun, Bingyu [2 ]
机构
[1] Univ S Australia, Sch Informat Technol & Math Sci, Mawson Lakes, SA 5095, Australia
[2] Chinese Acad Sci, Inst Machine Intelligence, Hefei 230031, Peoples R China
[3] Univ Sci & Technol China, Dept Automat, Anhua 230026, Peoples R China
来源
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW) | 2013年
关键词
causal discovery; association rules; cohort study; odds ratio; BAYESIAN NETWORKS; CLASSIFICATION;
D O I
10.1109/ICDMW.2013.88
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Discovering causal relationships is the ultimate goal of many scientific explorations. Causal relationships can be identified with controlled experiments, but such experiments are often very expensive and sometimes impossible to conduct. On the other hand, the collection of observational data has increased dramatically in recent decades. Therefore it is desirable to find causal relationships from the data directly. Significant progress has been made in the field of discovering causal relationships using the Causal Bayesian Network (CBN) theory. The applications of CBNs, however, are greatly limited due to the high computational complexity. In another direction, association rule mining has been shown to be an efficient data mining means for relationship discovery. However, although causal relationships imply associations, the reverse does not always hold. In this paper we study how to use an efficient association mining approach to discover potential causal rules in observational data. We make use of the idea of retrospective cohort studies, a widely used approach in medical and social research, to detect causal association rules. In comparison with the constraint-based methods within the CBN paradigm, the proposed approach is faster and is capable of finding a cause consisting of combined variables.
引用
收藏
页码:114 / 123
页数:10
相关论文
共 36 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Agrawal R., 1996, ADV KNOWLEDGE DISCOV, V12, P307, DOI DOI 10.1007/978-3-319-31750-2.
[3]  
Aliferis CF, 2010, J MACH LEARN RES, V11, P171
[4]  
[Anonymous], 1956, The Direction of Time
[5]  
[Anonymous], 2005, Data Mining: Concepts and Techniques
[6]  
[Anonymous], 2003, Statistical Methods for Rates and Proportions
[7]  
[Anonymous], 1991, KR
[8]  
[Anonymous], 2007, Uci machine learning repository
[9]   Classification of human lung carcinomas by mRNA expression profiling reveals distinct adenocarcinoma subclasses [J].
Bhattacharjee, A ;
Richards, WG ;
Staunton, J ;
Li, C ;
Monti, S ;
Vasa, P ;
Ladd, C ;
Beheshti, J ;
Bueno, R ;
Gillette, M ;
Loda, M ;
Weber, G ;
Mark, EJ ;
Lander, ES ;
Wong, W ;
Johnson, BE ;
Golub, TR ;
Sugarbaker, DJ ;
Meyerson, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (24) :13790-13795
[10]  
Borgelt C., 2003, IEEE ICDM WORKSH FRE, P24