COST-tree: Identification of Cosine Interesting Patterns Based on FP-tree

被引:0
作者
Huang, Xiaojing [1 ]
Wu, Junjie [1 ]
Zhu, Shiwei [1 ]
Xiong, Hui [2 ]
机构
[1] Beihang Univ, Sch Econ & Management, Beijing, Peoples R China
[2] Rutgers State Univ, Rutgers Business Sch, New Brunswick, NJ 08901 USA
来源
ELECTRONIC-BUSINESS INTELLIGENCE: FOR CORPORATE COMPETITIVE ADVANTAGES IN THE AGE OF EMERGING TECHNOLOGIES & GLOBALIZATION | 2010年 / 14卷
关键词
Interestingness Measure; Cosine Measure; Conditional Anti-Monotone Property; FP-tree;
D O I
暂无
中图分类号
F [经济];
学科分类号
02 ;
摘要
The cosine similarity, also known as un-centered Pearson Correlation, has been widely used for mining association patterns, which contain objects strongly related to each other. However, it is often used as a post-evaluation measure and is computationally prohibitive for large data. To this end, we develop an FP-tree like algorithm, named COST-tree, for finding association patterns based on the cosine measure. A key idea is to combine the strength of the FP-tree structure and the Conditional Anti-Monotone Property of the cosine measure. Experimental results on real-world data demonstrate the effectiveness of COST-tree, in particular for finding rare but interesting patterns at extremely low support levels.
引用
收藏
页码:456 / +
页数:2
相关论文
共 9 条
[1]  
[Anonymous], 2005, Data Mining: Concepts and Techniques
[2]   Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].
Han, JW ;
Pei, J ;
Yin, YW ;
Mao, RY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87
[3]  
RYMON R, KR 92
[4]  
Tan P-N, 2016, Introduction to data mining
[5]  
Tan P-N., KDD 02
[6]  
Wu T., PKDD 07
[7]  
Xiong H, 2003, THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, P387
[8]  
Zhu S., 2009, COSINE INTERESTING P
[9]  
Zhu S., 2010, DKE IN PRESS