Pattern-based Topic Models for Information Filtering

被引:8
|
作者
Gao, Yang [1 ]
Xu, Yue [1 ]
Li, Yuefeng [1 ]
机构
[1] QUT, Fac Sci & Engn, Brisbane, Qld, Australia
来源
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW) | 2013年
关键词
Topic models; user modelling; pattern mining; closed pattern; information filtering;
D O I
10.1109/ICDMW.2013.30
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Topic modelling, such as Latent Dirichlet Allocation (LDA), was proposed to generate statistical models to represent multiple topics in a collection of documents, which has been widely utilized in the fields of machine learning and information retrieval, etc. But its effectiveness in information filtering is rarely known. Patterns are always thought to be more representative than single terms for representing documents. In this paper, a novel information filtering model, Pattern-based Topic Model (PBTM), is proposed to represent the text documents not only using the topic distributions at general level but also using semantic pattern representations at detailed specific level, both of which contribute to the accurate document representation and document relevance ranking. Extensive experiments are conducted to evaluate the effectiveness of PBTM by using the TREC data collection Reuters Corpus Volume 1. The results show that the proposed model achieves outstanding performance.
引用
收藏
页码:921 / 928
页数:8
相关论文
共 50 条
  • [11] BicPAM: Pattern-based biclustering for biomedical data analysis
    Henriques, Rui
    Madeira, Sara C.
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2014, 9
  • [12] Behaviour Pattern-Based Model Generation for Model-Based Testing
    Kanstren, Teemu
    2009 COMPUTATION WORLD: FUTURE COMPUTING, SERVICE COMPUTATION, COGNITIVE, ADAPTIVE, CONTENT, PATTERNS, 2009, : 233 - 241
  • [13] Pattern Based Topic Model for Data Mining
    Jadhav, B. S.
    Bhosale, D. S.
    Jadhav, D. S.
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 382 - 387
  • [14] BicPAMS: software for biological data analysis with pattern-based biclustering
    Rui Henriques
    Francisco L. Ferreira
    Sara C. Madeira
    BMC Bioinformatics, 18
  • [15] Pattern-based Fall Prediction using Hospital Clinical Notes
    Wijesinghe, Yashodhya, V
    Xu, Yue
    Li, Yuefeng
    Zhang, Qing
    2020 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2020), 2020, : 433 - 437
  • [16] Information filtering based on personalized topology information
    Chen, Bolun
    Chen, Ling
    2015 THIRD INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA, 2015, : 184 - 189
  • [17] Multi-labeling with topic models for searching security information
    Osada, Yuki
    Nagasawa, Ryusei
    Shiraishi, Yoshiaki
    Takita, Makoto
    Furumoto, Keisuke
    Takahashi, Takeshi
    Mohri, Masami
    Morii, Masakatu
    ANNALS OF TELECOMMUNICATIONS, 2022, 77 (11-12) : 777 - 788
  • [18] Multi-labeling with topic models for searching security information
    Yuki Osada
    Ryusei Nagasawa
    Yoshiaki Shiraishi
    Makoto Takita
    Keisuke Furumoto
    Takeshi Takahashi
    Masami Mohri
    Masakatu Morii
    Annals of Telecommunications, 2022, 77 : 777 - 788
  • [19] Topic distillation and spectral filtering
    Chakrabarti, S
    Dom, BE
    Gibson, D
    Kumar, R
    Raghavan, P
    Rajagopalan, S
    Tomkins, A
    ARTIFICIAL INTELLIGENCE REVIEW, 1999, 13 (5-6) : 409 - 435
  • [20] Topic Distillation and Spectral Filtering
    Soumen Chakrabarti
    Byron E. Dom
    David Gibson
    Ravi Kumar
    Prabhakar Raghavan
    Sridhar Rajagopalan
    Andrew Tomkins
    Artificial Intelligence Review, 1999, 13 : 409 - 435