WINDOW-BASED TOPIC MODEL FOR HDP

被引:0
|
作者
Liu, Di [1 ]
Zeng, Ye [1 ]
Luo, Yu [1 ]
Pang, Hong [1 ]
Wu, Xiao-Hua [1 ]
机构
[1] Univ Elect Sci & Technol China, Sch Informat & Software Engn, Chengdu 610054, Peoples R China
来源
2019 16TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICWAMTIP) | 2019年
关键词
Hierarchical Dirichlet process; Topic model; Window; Belief propagation;
D O I
10.1109/iccwamtip47768.2019.9067737
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Hierarchical Dirichlet process (HDP) is a non-parametric Bayesian model, and has been widely applied in the application of topic models. However, the model is based on the "bag of words" hypothesis, ignoring the order of words in the document, resulting in a lack of word context semantics. In this regard, this paper proposes a window-based hierarchical Dirichlet process model (WHDP). The model uses windows to divide documents into smaller fragments, and keeps the order between words while moving windows, so as to reduce the semantic confusion of the text. We applied our method in real dataset and compared with other existing methods, such as sampling belief propagation algorithm for HDP, LDA model, and sliding window based topic model. The results show that the proposed method performs the superiority in convergence rate, perplexity and generalization ability.
引用
收藏
页码:70 / 75
页数:6
相关论文
共 50 条
  • [21] The Research on Topic Model of Microblog Based On the Context Theory
    Gao Jian-ming
    Lu Peng-yu
    2016 23RD ANNUAL INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE & ENGINEERING, VOLS. I AND II, 2016, : 1760 - 1766
  • [22] Review of Deep Learning-Based Topic Model
    Huang J.-J.
    Li P.-W.
    Peng M.
    Xie Q.-Q.
    Xu C.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (05): : 827 - 855
  • [23] The framework of infrared video mining based on topic model
    Liu, Lin
    Tang, Lin
    Li, Hong
    Yao, Shaowen
    2014 SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL 2, 2014, : 354 - 359
  • [24] Document Representation Based on Semantic Smoothed Topic Model
    Liu, Ying
    Song, Wei
    Liu, Lizhen
    Wang, Hanshi
    2016 17TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2016, : 65 - 69
  • [25] A Method for Constructing Supervised Topic Model Based on Term Frequency-Inverse Topic Frequency
    Gou, Zhinan
    Huo, Zheng
    Liu, Yuanzhen
    Yang, Yi
    SYMMETRY-BASEL, 2019, 11 (12):
  • [26] Spatial topic pyramid model: topic model with regional spatial information
    Pan, Zhiyong
    Liu, Yang
    Liu, Guojun
    Guo, Maozu
    Li, Mingyu
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [27] Cross-media Topic Detection with Refined CNN based Image-Dominant Topic Model
    Wang, Zhiyi
    Li, Liang
    Huang, Qingming
    MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, : 1171 - 1174
  • [28] Proposal of Network Generation Model based on Latent Preference Topic
    Akayama, Ikuto
    Hijikata, Yoshinori
    Kuramochi, Toshiya
    Sakata, Nobuchika
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2018), 2018,
  • [29] Text Classification of Network Pyramid Scheme based on Topic Model
    Mu, Pengyu
    He, Jingsha
    Zhu, Nafei
    NLPIR 2019: 2019 3RD INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, 2019, : 15 - 19
  • [30] Topic Model Based Knowledge Graph for Entity Similarity Measuring
    Sun, Haoran
    Ren, Rui
    Cai, Hongming
    Xu, Boyi
    Liu, Yonggang
    Li, Tongyu
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON E-BUSINESS ENGINEERING (ICEBE 2018), 2018, : 94 - 101