Learning Topic Models by Belief Propagation

被引:34
作者
Zeng, Jia [1 ]
Cheung, William K. [2 ]
Liu, Jiming [2 ]
机构
[1] Soochow Univ, Sch Comp Sci & Technol, Suzhou 215006, Peoples R China
[2] Hong Kong Baptist Univ, Dept Comp Sci, Kowloon Tong, Hong Kong, Peoples R China
关键词
Latent Dirichlet allocation; topic models; belief propagation; message passing; factor graph; Bayesian networks; Markov random fields; hierarchical Bayesian models; Gibbs sampling; variational Bayes; EM;
D O I
10.1109/TPAMI.2012.185
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Latent Dirichlet allocation (LDA) is an important hierarchical Bayesian model for probabilistic topic modeling, which attracts worldwide interest and touches on many important applications in text mining, computer vision and computational biology. This paper represents the collapsed LDA as a factor graph, which enables the classic loopy belief propagation (BP) algorithm for approximate inference and parameter estimation. Although two commonly used approximate inference methods, such as variational Bayes (VB) and collapsed Gibbs sampling (GS), have gained great success in learning LDA, the proposed BP is competitive in both speed and accuracy, as validated by encouraging experimental results on four large-scale document datasets. Furthermore, the BP algorithm has the potential to become a generic scheme for learning variants of LDA-based topic models in the collapsed space. To this end, we show how to learn two typical variants of LDA-based topic models, such as author-topic models (ATM) and relational topic models (RTM), using BP based on the factor graph representations.
引用
收藏
页码:1121 / 1134
页数:14
相关论文
共 50 条
  • [41] Belief propagation algorithms for finding the probable configurations over factor graph models
    Zheng Wang
    Yunsheng Liu
    Guangwei Wang
    [J]. Knowledge and Information Systems, 2014, 39 : 265 - 285
  • [42] Learning Topic Models: Identifiability and Finite-Sample Analysis
    Chen, Yinyin
    He, Shishuang
    Yang, Yun
    Liang, Feng
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2023, 118 (544) : 2860 - 2875
  • [43] Efficient Learning Algorithm for Maximum Entropy Discrimination Topic Models
    Chen J.
    Zhu J.
    [J]. Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2019, 32 (08): : 736 - 745
  • [44] Learning Supervised Topic Models for Classification and Regression from Crowds
    Rodrigues, Filipe
    Lourenco, Mariana
    Ribeiro, Bernardete
    Pereira, Francisco C.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2409 - 2422
  • [45] Belief propagation algorithms for finding the probable configurations over factor graph models
    Wang, Zheng
    Liu, Yunsheng
    Wang, Guangwei
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2014, 39 (02) : 265 - 285
  • [46] EXIT Analysis for Belief Propagation In Degree-Correlated Stochastic Block Models
    Saad, Hussein
    Abotabl, Ahmed
    Nosratinia, Aria
    [J]. 2016 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2016, : 775 - 779
  • [47] Low complexity sparse Bayesian learning using combined belief propagation and mean field with a stretched factor graph
    Zhang, Chuanzong
    Yuan, Zhengdao
    Wang, Zhongyong
    Guo, Qinghua
    [J]. SIGNAL PROCESSING, 2017, 131 : 344 - 349
  • [48] Self-Tuning Algorithms for Multisensor-Multitarget Tracking Using Belief Propagation
    Soldi, Giovanni
    Meyer, Florian
    Braca, Paolo
    Hlawatsch, Franz
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2019, 67 (15) : 3922 - 3937
  • [49] Online Estimation of Unknown Parameters in Multisensor-Multitarget Tracking: a Belief Propagation Approach
    Soldi, Giovanni
    Braca, Paolo
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2151 - 2157
  • [50] A Distributed Particle-based Belief Propagation Algorithm for Cooperative Simultaneous Localization and Synchronization
    Meyer, Florian
    Etzlinger, Bernhard
    Hlawatsch, Franz
    Springer, Andreas
    [J]. 2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 527 - 531