Dynamic Topic Detection and Tracking: A Comparison of HDP, C-Word, and Cocitation Methods

被引:65
作者
Ding, Wanying [1 ]
Chen, Chaomei [1 ]
机构
[1] Drexel Univ, Coll Comp & Informat, Philadelphia, PA 19104 USA
关键词
text mining; computer graphics; knowledge modeling; NETWORKS;
D O I
10.1002/asi.23134
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Cocitation and co-word methods have long been used to detect and track emerging topics in scientific literature, but both have weaknesses. Recently, while many researchers have adopted generative probabilistic models for topic detection and tracking, few have compared generative probabilistic models with traditional cocitation and co-word methods in terms of their overall performance. In this article, we compare the performance of hierarchical Dirichlet process (HDP), a promising generative probabilistic model, with that of the 2 traditional topic detecting and tracking methodscocitation analysis and co-word analysis. We visualize and explore the relationships between topics identified by the 3 methods in hierarchical edge bundling graphs and time flow graphs. Our result shows that HDP is more sensitive and reliable than the other 2 methods in both detecting and tracking emerging topics. Furthermore, we demonstrate the important topics and topic evolution trends in the literature of terrorism research with the HDP method.
引用
收藏
页码:2084 / 2097
页数:14
相关论文
共 21 条
[1]  
[Anonymous], 2012, PLOS ONE
[2]  
[Anonymous], 1974, Essays of an Information Scientist
[3]   Dynamics of the evolution of the strategy concept 1962-2008: a co-word analysis [J].
Armando Ronda-Pupo, Guillermo ;
Angel Guerras-Martin, Luis .
STRATEGIC MANAGEMENT JOURNAL, 2012, 33 (02) :162-188
[4]  
Blei D., 2007, LECT NOTES PRINCETON
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]   FROM TRANSLATIONS TO PROBLEMATIC NETWORKS - AN INTRODUCTION TO CO-WORD ANALYSIS [J].
CALLON, M ;
COURTIAL, JP ;
TURNER, WA ;
BAUIN, S .
SOCIAL SCIENCE INFORMATION SUR LES SCIENCES SOCIALES, 1983, 22 (02) :191-235
[7]   CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature [J].
Chen, CM .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2006, 57 (03) :359-377
[8]  
Ding Y., 2011, J AM SOC INFORM SCI, V62, P187
[9]   PageRank for Ranking Authors in Co-citation Networks [J].
Ding, Ying ;
Yan, Erjia ;
Frazho, Arthur ;
Caverlee, James .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (11) :2229-2243
[10]   BAYESIAN ANALYSIS OF SOME NONPARAMETRIC PROBLEMS [J].
FERGUSON, TS .
ANNALS OF STATISTICS, 1973, 1 (02) :209-230