Detecting emerging topics by exploiting probability burst and association rule mining: A case study of Library and Information Science

被引:4
作者
Xu, Min [1 ]
Li, Guangjian [1 ]
Wang, Xiaodi [1 ]
机构
[1] Peking Univ, Dept Informat Management, Beijing 100871, Peoples R China
关键词
Latent Dirichlet Allocation; Emerging topic detection; Probability burst; Association rule mining; Library and Information Science research; RESEARCH FRONTS; LDA; EVOLUTION; CITATION;
D O I
10.22452/mjlis.vol25no1.3
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
The primary reason for detecting emerging topics is to reduce researchers' time in finding current related topic while maintaining awareness of current trends in a particular field. Nowadays, the amount of information is growing rapidly, but tracking the development of a research field by manually reading the literature is challenging. This study takes Library and Information Science (LIS) as a case study to present a new method for detecting emerging topics. This novel method could be applied to analyse various types of documents and detect emerging topics automatically. This method utilizes a Latent Dirichlet Allocation (LDA) model to generate topics and calculate probabilities. It discovers emerging topics by detecting probability burst in consecutive time spans. Association rule mining and lexical similarity computation are adopted to represent the topics. This work tests the method by comparing the results of emerging topics from the LIS data in the baseline paper. The validation demonstrates that the proposed approach is feasible.
引用
收藏
页码:47 / 66
页数:20
相关论文
共 37 条
[21]   Bursty and hierarchical structure in streams [J].
Kleinberg, J .
DATA MINING AND KNOWLEDGE DISCOVERY, 2003, 7 (04) :373-397
[22]   Explore the research front of a specific research theme based on a novel technique of enhanced co-word analysis [J].
Li, Munan ;
Chu, Yanqun .
JOURNAL OF INFORMATION SCIENCE, 2017, 43 (06) :725-741
[23]   Cost-Effective Online Trending Topic Detection and Popularity Prediction in Microblogging [J].
Miao, Zhongchen ;
Chen, Kai ;
Fang, Yi ;
He, Jianhua ;
Zhou, Yi ;
Zhang, Wenjun ;
Zha, Hongyuan .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2017, 35 (03)
[24]   Time line visualization of research fronts [J].
Morris, SA ;
Yen, G ;
Wu, Z ;
Asnake, B .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (05) :413-422
[25]  
National Institute of Standards and Technology (NIST), 2004, 2004 TOP DET TRACK T
[26]  
PERSSON O, 1994, J AM SOC INFORM SCI, V45, P31, DOI 10.1002/(SICI)1097-4571(199401)45:1<31::AID-ASI4>3.0.CO
[27]  
2-G
[28]  
Rosen-Zvi Michal., 2004, UAI
[29]   Detecting emerging research fronts based on topological measures in citation networks of scientific publications [J].
Shibata, Naoki ;
Kajikawa, Yuya ;
Takeda, Yoshiyuki ;
Matsushima, Katsumori .
TECHNOVATION, 2008, 28 (11) :758-775
[30]   Comparative Study on Methods of Detecting Research Fronts Using Different Types of Citation [J].
Shibata, Naoki ;
Kajikawa, Yuya ;
Takeda, Yoshiyuki ;
Matsushima, Katsumori .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (03) :571-580