A Phrase Topic Model Based on Distributed Representation

被引:3
作者
Ma, Jialin [1 ]
Cheng, Jieyi [1 ]
Zhang, Lin [1 ]
Zhou, Lei [1 ]
Chen, Bolun [1 ,2 ]
机构
[1] Huaiyin Inst Technol, Jiangsu Internet Things & Moblie Internet Technol, Huaian 223003, Peoples R China
[2] Univ Fribourg, CH-1700 Fribourg, Switzerland
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2020年 / 64卷 / 01期
关键词
Phrase; topic model; LDA; distributed representation; Gibbs sampling;
D O I
10.32604/cmc.2020.09780
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional topic models have been widely used for analyzing semantic topics from electronic documents. However, the obvious defects of topic words acquired by them are poor in readability and consistency. Only the domain experts are possible to guess their meaning. In fact, phrases are the main unit for people to express semantics. This paper presents a Distributed Representation-Phrase Latent Dirichlet Allocation (DR-Phrase LDA) which is a phrase topic model. Specifically, we reasonably enhance the semantic information of phrases via distributed representation in this model. The experimental results show the topics quality acquired by our model is more readable and consistent than other similar topic models.
引用
收藏
页码:455 / 469
页数:15
相关论文
共 21 条
[1]  
[Anonymous], 2013, NIPS
[2]  
[Anonymous], 2005, PARAMETER ESTIMATION
[3]  
Bing Li, 2016, Database Systems for Advanced Applications. 21st International Conference, DASFAA 2016. Proceedings: LNCS 9642, P197, DOI 10.1007/978-3-319-32025-0_13
[4]  
Blei D.M., 2009, ARXIV PREPRINT ARXIV
[5]   A CORRELATED TOPIC MODEL OF SCIENCE [J].
Blei, David M. ;
Lafferty, John D. .
ANNALS OF APPLIED STATISTICS, 2007, 1 (01) :17-35
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]   Discovering Coherent Topics Using General Knowledge [J].
Chen, Zhiyuan ;
Mukherjee, Arjun ;
Liu, Bing ;
Hsu, Meichun ;
Castellanos, Malu ;
Ghosh, Riddhiman .
PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, :209-218
[8]   Scalable Topical Phrase Mining from Text Corpora [J].
El-Kishky, Ahmed ;
Song, Yanglei ;
Wang, Chi ;
Voss, Clare R. ;
Han, Jiawei .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2014, 8 (03) :305-316
[9]  
Fei Geli., 2014, COLING, P667
[10]   Opinion-based entity ranking [J].
Ganesan, Kavita ;
Zhai, ChengXiang .
INFORMATION RETRIEVAL, 2012, 15 (02) :116-150