Entity Ranking from Annotated Text Collections Using Multitype Topic Models

被引:0
作者
Shiozaki, Hitohiro [1 ]
Eguchi, Koji [2 ]
机构
[1] Kobe Univ, Grad Sch Sci & Technol, 1-1 Rokkoudai, Kobe, Hyogo 6578501, Japan
[2] Kobe Univ, Grad Sch Engn, 1-1 Rokkoudai, Kobe, Hyogo 6578501, Japan
来源
FOCUSED ACCESS TO XML DOCUMENTS | 2008年 / 4862卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Very recently, topic model-based retrieval methods have produced good results using Latent Dirichlet Allocation (LDA) model or its variants in language modeling framework. However, for the task of retrieving annotated documents when using the LDA-based methods, some post-processing is required outside the model in order to make use of multiple word types that are specified by the annotations. In this paper, we explore new retrieval methods using a 'multitype topic model' that can directly handle multiple word types, such as annotated entities, category labels and other words that are typically used in Wikipedia. We investigate how to effectively apply the multitype topic model to retrieve documents from an annotated collection, and show the effectiveness of our methods through experiments on entity ranking using a Wikipedia collection.
引用
收藏
页码:279 / +
页数:3
相关论文
共 19 条
[1]  
[Anonymous], ADV NEURAL INFORM PR
[2]  
[Anonymous], MODERN INFORM RETRIE
[3]  
[Anonymous], 2006, P 29 ANN INT ACM SIG, DOI DOI 10.1145/1148170.1148204
[4]  
[Anonymous], P 27 INT ACM SIGIR C
[5]  
[Anonymous], P 24 ANN INT ACM SIG, DOI DOI 10.1145/383952.384019
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]  
Callan J. P., 1992, DEXA 92. Database and Expert Systems Applications. Proceedings of the International Conference, P78
[8]  
Gallinari P., 2006, ACM SIGIR FORUM, V40, P1, DOI [10.1145/1147197.1147210, DOI 10.1145/1147197.1147210]
[9]   Finding scientific topics [J].
Griffiths, TL ;
Steyvers, M .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 :5228-5235
[10]  
HIEMSTRA D, 1998, LNCS, V1513, P569, DOI DOI 10.1007/3-540-49653-X_34