Topic Diffusion Discovery based on Sparseness-constrained Non-negative Matrix Factorization

被引:1
作者
Kang, Yihuang [1 ]
Lin, Keng-Pei [1 ]
Cheng, I-Ling [1 ]
机构
[1] Natl Sun Yat Sen Univ, Kaohsiung, Taiwan
来源
2018 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IRI) | 2018年
关键词
Topic Modeling; Topic Diffusion; Topic Detection and Tracking; Non-Negative Matrix Factorization; Information Divergence;
D O I
10.1109/IRI.2018.00021
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to recent explosion of text data, researchers have been overwhelmed by ever-increasing volume of articles produced by different research communities. Various scholarly search websites, citation recommendation engines, and research databases have been created to simplify the text search tasks. However, it is still difficult for researchers to be able to identify potential research topics without doing intensive reviews on a tremendous number of articles published by journals, conferences, meetings, and workshops. In this paper, we consider a novel topic diffusion discovery technique that incorporates sparseness-constrained Non-negative Matrix Factorization with generalized Jensen-Shannon divergence to help understand term-topic evolutions and identify topic diffusions. Our experimental result shows that this approach can extract more prominent topics from large article databases, visualize relationships between terms of interest and abstract topics, and further help researchers understand whether given terms/topics have been widely explored or whether new topics are emerging from literature.
引用
收藏
页码:94 / 101
页数:8
相关论文
共 27 条
[1]  
Aggarwal Charu C, 2012, Mining text data, P163, DOI [DOI 10.1007/978-1-4614-3223-46, DOI 10.1007/978-1-4614-3223-4, 10.1007/978-1-4614-3223-4]
[2]   On-Line LDA: Adaptive Topic Models for Mining Text Streams with Applications to Topic Detection and Tracking [J].
AlSumait, Loulwah ;
Barbara, Daniel ;
Domeniconi, Carlotta .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :3-12
[3]  
[Anonymous], 2012, Proceedings of the fifth ACM International Conference on Web Search and Data Mining
[4]   Learning Topic Models - Going beyond SVD [J].
Arora, Sanjeev ;
Ge, Rong ;
Moitra, Ankur .
2012 IEEE 53RD ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE (FOCS), 2012, :1-10
[5]   Probabilistic Topic Models [J].
Blei, David M. .
COMMUNICATIONS OF THE ACM, 2012, 55 (04) :77-84
[6]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[7]  
Blei DM., 2006, P 23 INT C MACHINE L, P113, DOI DOI 10.1145/1143844.1143859
[8]   UTOPIAN: User-Driven Topic Modeling Based on Interactive Nonnegative Matrix Factorization [J].
Choo, Jaegul ;
Lee, Changhyun ;
Reddy, Chandan K. ;
Park, Haesun .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) :1992-2001
[9]  
Feinerer I, 2008, J STAT SOFTW, V25, P1
[10]   The complete mitochondrial genome of Liachirus melanospilos (Pleuronectiformes: Soleidae) [J].
Gong, Li ;
Shi, Wei ;
Wang, Shu-Ying ;
Kong, Xiao-Yu .
MITOCHONDRIAL DNA, 2015, 26 (05) :732-733