Nested Hierarchical Dirichlet Processes

被引:124
|
作者
Paisley, John [1 ]
Wang, Chong [2 ]
Blei, David M. [3 ]
Jordan, Michael I. [4 ,5 ]
机构
[1] Columbia Univ, Dept Elect Engn, New York, NY 10027 USA
[2] Voleon Capital Management, Berkeley, CA USA
[3] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
[4] Univ Calif Berkeley, Dept EECS, Berkeley, CA USA
[5] Univ Calif Berkeley, Dept Stat, Berkeley, CA USA
关键词
Bayesian nonparametrics; Dirichlet process; topic modeling; stochastic optimization;
D O I
10.1109/TPAMI.2014.2318728
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP generalizes the nested Chinese restaurant process (nCRP) to allow each word to follow its own path to a topic node according to a per-document distribution over the paths on a shared tree. This alleviates the rigid, single-path formulation assumed by the nCRP, allowing documents to easily express complex thematic borrowings. We derive a stochastic variational inference algorithm for the model, which enables efficient inference for massive collections of text documents. We demonstrate our algorithm on 1.8 million documents from The New York Times and 2.7 million documents from Wikipedia.
引用
收藏
页码:256 / 270
页数:15
相关论文
共 50 条
  • [1] Hierarchical topic modeling with nested hierarchical Dirichlet process
    Ding, Yi-qun
    Li, Shan-ping
    Zhang, Zhen
    Shen, Bin
    JOURNAL OF ZHEJIANG UNIVERSITY-SCIENCE A, 2009, 10 (06): : 858 - 867
  • [2] Hierarchical topic modeling with nested hierarchical Dirichlet process
    Yi-qun Ding
    Shan-ping Li
    Zhen Zhang
    Bin Shen
    Journal of Zhejiang University-SCIENCE A, 2009, 10 : 858 - 867
  • [3] Hierarchical Dirichlet processes
    Teh, Yee Whye
    Jordan, Michael I.
    Beal, Matthew J.
    Blei, David M.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (476) : 1566 - 1581
  • [4] Hierarchical topic modeling with nested hierarchical Dirichlet process附视频
    Yiqun DING Shanping LI Zhen ZHANG Bin SHEN School of Computer Science and Technology Zhejiang University Hangzhou China State Street Hangzhou Hangzhou China
    Journal of Zhejiang University(Science A:An International Applied Physics & Engineering Journal), 2009, (06) : 858 - 867
  • [5] Hierarchical Dirichlet processes and their applications: a survey
    Zhou J.-Y.
    Wang F.-Y.
    Zeng D.-J.
    Zidonghua Xuebao/Acta Automatica Sinica, 2011, 37 (04): : 389 - 407
  • [6] Hierarchical Dirichlet Processes with Social Influence
    Qian, Jin
    Gong, Yeyun
    Zhang, Qi
    Huang, Xuanjing
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 490 - 502
  • [7] Blocked Gibbs Sampler for Hierarchical Dirichlet Processes
    Das, Snigdha
    Niu, Yabo
    Ni, Yang
    Mallick, Bani K.
    Pati, Debdeep
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024,
  • [8] Supervised Hierarchical Dirichlet Processes with Variational Inference
    Zhang, Cheng
    Ek, Carl Henrik
    Gratal, Xavi
    Pokorny, Florian T.
    Kjellstrom, Hedvig
    2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2013, : 254 - 261
  • [9] Hybrid Parallel Inference for Hierarchical Dirichlet Processes
    Omoto, Tsukasa
    Eguchi, Koji
    Tora, Shotaro
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 815 - 820
  • [10] Hierarchical Dirichlet processes for tracking maneuvering targets
    Fox, Emily B.
    Sudderth, Erik B.
    Willsky, Alan S.
    2007 PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, VOLS 1-4, 2007, : 1415 - +