Lifelong Hierarchical Topic Modeling via Non-negative Matrix Factorization

被引:0
作者
Lin, Zhicheng [1 ]
Yan, Jiaxing [1 ]
Lei, Zhiqi [1 ]
Rao, Yanghui [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
来源
WEB AND BIG DATA, PT IV, APWEB-WAIM 2023 | 2024年 / 14334卷
基金
中国国家自然科学基金;
关键词
Hierarchical topic model; Semantic knowledge graph; Non-negative matrix factorization;
D O I
10.1007/978-981-97-2421-5_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical topic modeling has been widely used in mining the latent topic hierarchy of documents. However, most of such models are limited to a one-shot scenario since they do not use the identified topic information to guide the subsequent mining of topics. By storing and exploiting the previous knowledge, we propose a lifelong hierarchical topic model based on Non-negative Matrix Factorization (NMF) for boosting the topic quality over a text stream. In particular, we construct a knowledge graph by the accumulated topic hierarchy information and use the knowledge graph to guide the training of our model on future documents. Moreover, the structure information in the knowledge graph is completed by supervised learning. Experiments on real-world corpora validate the effectiveness of our approach on lifelong learning paradigms.
引用
收藏
页码:155 / 170
页数:16
相关论文
共 44 条
  • [1] Ahmed A., 2013, P INT C MACHINE LEAR, P1426
  • [2] Alvarez-Melis David, 2017, 5 INT C LEARN REPR I
  • [3] An introduction to MCMC for machine learning
    Andrieu, C
    de Freitas, N
    Doucet, A
    Jordan, MI
    [J]. MACHINE LEARNING, 2003, 50 (1-2) : 5 - 43
  • [4] [Anonymous], 2010, P 3 ACM INT C WEB SE, DOI 10.1145/ 1718487.1718501
  • [5] Blei DM, 2004, ADV NEUR IN, V16, P17
  • [6] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022
  • [7] Card D, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, P2031
  • [8] Chen X., 2013, Ph.D. thesis
  • [9] Affinity Regularized Non-Negative Matrix Factorization for Lifelong Topic Modeling
    Chen, Yong
    Wu, Junjie
    Lin, Jianying
    Liu, Rui
    Zhang, Hui
    Ye, Zhiwen
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2020, 32 (07) : 1249 - 1262
  • [10] Modeling Emerging, Evolving and Fading Topics using Dynamic Soft Orthogonal NMF with Sparse Representation
    Chen, Yong
    Zhang, Hui
    Wu, Junjie
    Wang, Xingguang
    Liu, Rui
    Lin, Mengxiang
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 61 - 70