Lifelong Hierarchical Topic Modeling via Non-negative Matrix Factorization

被引:0
作者
Lin, Zhicheng [1 ]
Yan, Jiaxing [1 ]
Lei, Zhiqi [1 ]
Rao, Yanghui [1 ]
机构
[1] Sun Yat Sen Univ, Sch Data & Comp Sci, Guangzhou, Peoples R China
来源
WEB AND BIG DATA, PT IV, APWEB-WAIM 2023 | 2024年 / 14334卷
基金
中国国家自然科学基金;
关键词
Hierarchical topic model; Semantic knowledge graph; Non-negative matrix factorization;
D O I
10.1007/978-981-97-2421-5_11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hierarchical topic modeling has been widely used in mining the latent topic hierarchy of documents. However, most of such models are limited to a one-shot scenario since they do not use the identified topic information to guide the subsequent mining of topics. By storing and exploiting the previous knowledge, we propose a lifelong hierarchical topic model based on Non-negative Matrix Factorization (NMF) for boosting the topic quality over a text stream. In particular, we construct a knowledge graph by the accumulated topic hierarchy information and use the knowledge graph to guide the training of our model on future documents. Moreover, the structure information in the knowledge graph is completed by supervised learning. Experiments on real-world corpora validate the effectiveness of our approach on lifelong learning paradigms.
引用
收藏
页码:155 / 170
页数:16
相关论文
共 44 条
  • [21] Kim Joon Hee, 2012, CIKM, P783, DOI DOI 10.1145/2396761.2396861
  • [22] Kingma D. P., ADAM METHOD STOCHAST
  • [23] Overcoming catastrophic forgetting in neural networks
    Kirkpatricka, James
    Pascanu, Razvan
    Rabinowitz, Neil
    Veness, Joel
    Desjardins, Guillaume
    Rusu, Andrei A.
    Milan, Kieran
    Quan, John
    Ramalho, Tiago
    Grabska-Barwinska, Agnieszka
    Hassabis, Demis
    Clopath, Claudia
    Kumaran, Dharshan
    Hadsell, Raia
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) : 3521 - 3526
  • [24] Lau J.H., 2014, P 14 C EUROPEAN CHAP, P530, DOI [10.3115/v1/E14-1056, DOI 10.3115/V1/E14-1056]
  • [25] Lee DD, 2001, ADV NEUR IN, V13, P556
  • [26] Sparsemax and Relaxed Wasserstein for Topic Sparsity
    Lin, Tianyi
    Hu, Zhiyue
    Guo, Xin
    [J]. PROCEEDINGS OF THE TWELFTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM'19), 2019, : 141 - 149
  • [27] Topic Splitting: A Hierarchical Topic Model Based on Non-Negative Matrix Factorization
    Liu, Rui
    Wang, Xingguang
    Wang, Deqing
    Zuo, Yuan
    Zhang, He
    Zheng, Xianzhu
    [J]. JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2018, 27 (04) : 479 - 496
  • [28] Miao YS, 2017, PR MACH LEARN RES, V70
  • [29] Mimno David., 2007, Proceedings of the 24th international conference on Machine learning, P633, DOI 10.1145/1273496.1273576
  • [30] Ming ZY, 2010, SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, P2