Nonparametric Topic Modeling with Neural Inference

被引：8

作者：

Ning, Xuefei ^{[1
]}

Zheng, Yin ^{[2
]}

Jiang, Zhuxi ^{[3
]}

Wang, Yu ^{[1
]}

Yang, Huazhong ^{[1
]}

Huang, Junzhou ^{[4
]}

Zhao, Peilin ^{[4
]}

机构：

[1] Tsinghua Univ, Beijing, Peoples R China

[2] Tencent, Weixin Grp, Shenzhen, Peoples R China

[3] Momenta, Beijing, Peoples R China

[4] Tencent AI Lab, Shenzhen, Peoples R China

来源：

NEUROCOMPUTING | 2020年 / 399卷 / 399期

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1016/j.neucom.2019.12.128

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This work focuses on combining nonparametric topic models with Auto-Encoding Variational Bayes (AEVB). Specifically, we first propose iTM-VAE, where the topics are treated as trainable parameters and the document-specific topic proportions are obtained by a stick-breaking construction. The inference of iTM-VAE is modeled by neural networks such that it can be computed in a simple feed-forward manner. We also describe how to introduce a hyper-prior into iTM-VAE so as to model the uncertainty of the prior parameter. Actually, the hyper-prior technique is quite general and we show that it can be applied to other AEVB based models to alleviate the collapse-to-prior problem elegantly. Moreover, we also propose HiTM-VAE, where the document-specific topic distributions are generated in a hierarchical manner. HiTM-VAE is even more flexible and can generate topic representations with better variability and sparsity. Experimental results on 20News and Reuters RCV1-V2 datasets show that the proposed models outperform the state-of-the-art baselines significantly. The advantages of the hyper-prior technique and the hierarchical model construction are also confirmed by experiments. (c) 2020 Elsevier B.V. All rights reserved.

引用

页码：296 / 306

页数：11

共 50 条

[31] Neural Topic Modeling with Bidirectional Adversarial Training
Wang, Rui
Hu, Xuemeng
Zhou, Deyu
He, Yulan
Xiong, Yuxuan
Ye, Chenchen
Xu, Haiyang
58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 340 - 350
[32] Leveraging spiking neural networks for topic modeling
Bialas, Marcin
Mironczuk, Marcin Michal
Mandziuk, Jacek
NEURAL NETWORKS, 2024, 178
[33] Neural Topic Modeling with Continual Lifelong Learning
Gupta, Pankaj
Chaudhary, Yatin
Runkler, Thomas
Schuetze, Hinrich
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
[34] Neural Embedded Dirichlet Processes for Topic Modeling
Palencia-Olivar, Miguel
Bonnevay, Stephane
Aussem, Alexandre
Canitia, Bruno
MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE (MDAI 2021), 2021, 12898 : 299 - 310
[35] Hierarchical neural topic modeling with manifold regularization
Chen, Ziye
Ding, Cheng
Rao, Yanghui
Xie, Haoran
Tao, Xiaohui
Cheng, Gary
Wang, Fu Lee
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (06): : 2139 - 2160
[36] Improving Context Modeling in Neural Topic Segmentation
Xing, Linzi
Hackinen, Brad
Carenini, Giuseppe
Trebbi, Francesco
1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (AACL-IJCNLP 2020), 2020, : 626 - 636
[37] A geometry-driven neural topic model for trip purpose inference
Jiaqi Zhang
Zipei Fan
Xuan Song
Ryosuke Shibasaki
GeoInformatica, 2024, 28 : 313 - 333
[38] A geometry-driven neural topic model for trip purpose inference
Zhang, Jiaqi
Fan, Zipei
Song, Xuan
Shibasaki, Ryosuke
GEOINFORMATICA, 2024, 28 (02) : 313 - 333
[39] Effective interrelation of Bayesian nonparametric document clustering and embedded-topic modeling
Costa, Gianni
Ortale, Riccardo
KNOWLEDGE-BASED SYSTEMS, 2021, 234
[40] Bayesian Nonparametric Modeling of Categorical Data for Information Fusion and Causal Inference
Xiong, Sihan
Fu, Yiwei
Ray, Asok
ENTROPY, 2018, 20 (06)

← 1 2 3 4 5 →