Maximal association rules: A tool for mining associations in text

被引:21
作者
Amir, A [1 ]
Aumann, Y [1 ]
Feldman, R [1 ]
Fresko, M [1 ]
机构
[1] Bar Ilan Univ, Dept Comp Sci, IL-52900 Ramat Gan, Israel
关键词
text mining; association rules; data mining;
D O I
10.1007/s10844-005-0196-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a new tool for mining association rules, which is of special value in text mining. The new tool, called maximal associations, is geared toward discovering associations that are frequently lost when using regular association rules. Intuitively, a maximal association rule X double right arrow(max) Y says that whenever X is the only item of its type in a transaction, than Y also appears, with some confidence. Maximal associations allow the discovery of associations pertaining to items that most often do not appear alone, but rather together with closely related items, and hence associations relevant only to these items tend to obtain low confidence. We provide a formal description of maximal association rules and efficient algorithms for discovering all such associations. We present the results of applying maximal association rules to two text corpora.
引用
收藏
页码:333 / 345
页数:13
相关论文
共 28 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Ahonen H., 1997, C199723 U HELS
[3]   An experiment in discovering association rules in the legal domain [J].
Bench-Capon, T ;
Coenen, F ;
Leng, P .
11TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATION, PROCEEDINGS, 2000, :1056-1060
[4]  
Brijs T., 1999, P 5 ACM SIGKDD INT C, P254, DOI 10.1145/312129.312241
[5]  
Brin S., 1997, P 1997 ACM SIGMOD IN, P265, DOI DOI 10.1145/253262.253327
[6]   Mining association rules with weighted items [J].
Cai, CH ;
Fu, AWC ;
Cheng, CH ;
Kwong, WW .
IDEAS 98 - INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 1998, :68-77
[7]  
DONG JN, 2000, SAC, P340
[8]  
Fayyad U., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P82
[9]   Mining text using keyword distributions [J].
Feldman, R ;
Dagan, I ;
Hirsh, H .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 1998, 10 (03) :281-300
[10]  
Feldman R., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P343