From data to global generalized knowledge

被引:9
作者
Chen, Yen-Liang [1 ]
Wu, Yu-Ying [2 ]
Chang, Ray-I [3 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Jhongli, Taiwan
[2] Nanya Inst Technol, Dept Informat Management, Jhongli, Taiwan
[3] Natl Taiwan Univ, Dept Engn Sci & Ocean Engn, Taipei 10764, Taiwan
关键词
Attribute-oriented induction; Data mining; Multiple-level mining; Generalized knowledge; ATTRIBUTE-ORIENTED INDUCTION; ASSOCIATION RULES; DISCOVERY; DATABASES; ALGORITHM;
D O I
10.1016/j.dss.2011.08.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The attribute-oriented induction (AOI) is a useful data mining method that extracts generalized knowledge from relational data and user's background knowledge. The method uses two thresholds, the relation threshold and attribute threshold, to guide the generalization process, and output generalized knowledge, a set of generalized tuples which describes the major characteristics of the target relation. Although AOI has been widely used in various applications, a potential weakness of this method is that it only provides a snapshot of the generalized knowledge, not a global picture. When thresholds are different, we would obtain different sets of generalized tuples, which also describe the major characteristics of the target relation. If a user wants to ascertain a global picture of induction, he or she must try different thresholds repeatedly. That is time-consuming and tedious. In this study, we propose a global AOI (GAOI) method, which employs the multiple-level mining technique with multiple minimum supports to generate all interesting generalized knowledge at one time. Experiment results on real-life dataset show that the proposed method is effective in finding global generalized knowledge. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:295 / 307
页数:13
相关论文
共 34 条
[1]   Use of data mining techniques to model crime scene investigator performance [J].
Adderley, Richard ;
Townsley, Michael ;
Bond, John .
KNOWLEDGE-BASED SYSTEMS, 2007, 20 (02) :170-176
[2]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[3]  
Angryk RA, 2005, IEEE INT CONF FUZZY, P785
[4]  
[Anonymous], CREDIT CARD DATASET
[5]  
[Anonymous], 2011, Pei. data mining concepts and techniques
[6]  
Beaubouef T., 2007, P 20 INT FLOR ART IN, P507
[7]   Data mining for credit card fraud: A comparative study [J].
Bhattacharyya, Siddhartha ;
Jha, Sanjeev ;
Tharakunnel, Kurian ;
Westland, J. Christopher .
DECISION SUPPORT SYSTEMS, 2011, 50 (03) :602-613
[8]  
CAI YD, 1990, PROCEEDINGS : 6TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, P281
[9]   Efficient attribute-oriented generalization for knowledge discovery from large databases [J].
Carter, CL ;
Hamilton, HJ .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1998, 10 (02) :193-208
[10]   Data mining: An overview from a database perspective [J].
Chen, MS ;
Han, JW ;
Yu, PS .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) :866-883