Enhanced Frequent Itemsets Based on Topic Modeling in Information Filtering

被引:0
作者
Than Than Wai [1 ]
Aung, Sint Sint [1 ]
机构
[1] Univ Comp Studies, Mandalay, Myanmar
来源
2017 16TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS 2017) | 2017年
关键词
Topic Modeling; Pattern Mining; Information Filtering; User Interest Modeling; Text Documents;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to generate user's information needs from a collection of documents, many term-based and pattern-based approaches have been used in Information Filtering. In these approaches, the documents in the collection are all about one topic. However, user's interests can be diverse and the documents in the collection often involve multiple topics. Topic modeling is useful for the area of machine learning and text mining. It generates models to discover the hidden multiple topics in a collection of documents and each of these topics are presented by distribution of words. But its effectiveness in information filtering has not been so well explored. Patterns are always thought to be more discriminative than single terms for describing documents. The major challenge found in frequent pattern mining is a large number of result patterns. As the minimum threshold becomes lower, an exponentially large number of patterns are generated. To deal with the above mentioned limitations and problems, in this paper, a novel information filtering model, EFITM (Enhanced Frequent Itemsets based on Topic Model) model is proposed. Experimental results using the CRANFIELD dataset for the task of information filtering show that the proposed model outperforms over state-of-the-art models.
引用
收藏
页码:155 / 160
页数:6
相关论文
共 21 条
[1]  
[Anonymous], 2002, P 8 ACM SIGKDD INT C, DOI DOI 10.1145/775047.775110
[2]  
Azzopardi L, 2004, IEEE IJCNN, P3281
[3]  
Bastide Y., 2000, SIGKDD EXPLORATIONS, V2, P66, DOI DOI 10.1145/380995.381017
[4]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[5]  
Chauhan Sapna, 2013, INT J ENG SCI INVENT, V2
[6]  
Cheng H, 2007, PROC INT CONF DATA, P691
[7]  
Christopher H. S., 2009, INTRO INFORM RETRIEV
[8]  
Funkranz J., 1998, AUSTR RES I ARTIF IN, V3
[9]  
Gao Yang, 2015, IEEE TRANSACTION KNO
[10]   Frequent pattern mining: current status and future directions [J].
Han, Jiawei ;
Cheng, Hong ;
Xin, Dong ;
Yan, Xifeng .
DATA MINING AND KNOWLEDGE DISCOVERY, 2007, 15 (01) :55-86