Nonparametric Topic Modeling with Neural Inference

被引:8
|
作者
Ning, Xuefei [1 ]
Zheng, Yin [2 ]
Jiang, Zhuxi [3 ]
Wang, Yu [1 ]
Yang, Huazhong [1 ]
Huang, Junzhou [4 ]
Zhao, Peilin [4 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Tencent, Weixin Grp, Shenzhen, Peoples R China
[3] Momenta, Beijing, Peoples R China
[4] Tencent AI Lab, Shenzhen, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1016/j.neucom.2019.12.128
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work focuses on combining nonparametric topic models with Auto-Encoding Variational Bayes (AEVB). Specifically, we first propose iTM-VAE, where the topics are treated as trainable parameters and the document-specific topic proportions are obtained by a stick-breaking construction. The inference of iTM-VAE is modeled by neural networks such that it can be computed in a simple feed-forward manner. We also describe how to introduce a hyper-prior into iTM-VAE so as to model the uncertainty of the prior parameter. Actually, the hyper-prior technique is quite general and we show that it can be applied to other AEVB based models to alleviate the collapse-to-prior problem elegantly. Moreover, we also propose HiTM-VAE, where the document-specific topic distributions are generated in a hierarchical manner. HiTM-VAE is even more flexible and can generate topic representations with better variability and sparsity. Experimental results on 20News and Reuters RCV1-V2 datasets show that the proposed models outperform the state-of-the-art baselines significantly. The advantages of the hyper-prior technique and the hierarchical model construction are also confirmed by experiments. (c) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:296 / 306
页数:11
相关论文
共 50 条
  • [31] Neural Topic Modeling with Bidirectional Adversarial Training
    Wang, Rui
    Hu, Xuemeng
    Zhou, Deyu
    He, Yulan
    Xiong, Yuxuan
    Ye, Chenchen
    Xu, Haiyang
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 340 - 350
  • [32] Leveraging spiking neural networks for topic modeling
    Bialas, Marcin
    Mironczuk, Marcin Michal
    Mandziuk, Jacek
    NEURAL NETWORKS, 2024, 178
  • [33] Neural Topic Modeling with Continual Lifelong Learning
    Gupta, Pankaj
    Chaudhary, Yatin
    Runkler, Thomas
    Schuetze, Hinrich
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
  • [34] Neural Embedded Dirichlet Processes for Topic Modeling
    Palencia-Olivar, Miguel
    Bonnevay, Stephane
    Aussem, Alexandre
    Canitia, Bruno
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2021), 2021, 12898 : 299 - 310
  • [35] Hierarchical neural topic modeling with manifold regularization
    Chen, Ziye
    Ding, Cheng
    Rao, Yanghui
    Xie, Haoran
    Tao, Xiaohui
    Cheng, Gary
    Wang, Fu Lee
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (06): : 2139 - 2160
  • [36] Improving Context Modeling in Neural Topic Segmentation
    Xing, Linzi
    Hackinen, Brad
    Carenini, Giuseppe
    Trebbi, Francesco
    1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 626 - 636
  • [37] A geometry-driven neural topic model for trip purpose inference
    Jiaqi Zhang
    Zipei Fan
    Xuan Song
    Ryosuke Shibasaki
    GeoInformatica, 2024, 28 : 313 - 333
  • [38] A geometry-driven neural topic model for trip purpose inference
    Zhang, Jiaqi
    Fan, Zipei
    Song, Xuan
    Shibasaki, Ryosuke
    GEOINFORMATICA, 2024, 28 (02) : 313 - 333
  • [39] Effective interrelation of Bayesian nonparametric document clustering and embedded-topic modeling
    Costa, Gianni
    Ortale, Riccardo
    KNOWLEDGE-BASED SYSTEMS, 2021, 234
  • [40] Bayesian Nonparametric Modeling of Categorical Data for Information Fusion and Causal Inference
    Xiong, Sihan
    Fu, Yiwei
    Ray, Asok
    ENTROPY, 2018, 20 (06)