Constructing and Visualizing Topic Forests for Text Streams

被引:1
作者
Fushimi, Takayasu [1 ]
Satoh, Tetsuji [2 ]
机构
[1] Tokyo Univ Technol, 1404-1 Katakuramachi, Hachioji, Tokyo 1920982, Japan
[2] Univ Tsukuba, 1-2 Kasuga, Tsukuba, Ibaraki 3058550, Japan
来源
2017 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2017) | 2017年
关键词
Visualization; Text stream; Tree structure;
D O I
10.1145/3106426.3106455
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A great deal of such texts as news and blog articles, web pages, and scientific literature are posted on the web as time goes by, and are generally called time-series documents or text streams. For each document, some strongly or weakly relevant texts exist. Although such relevance is represented as citations among scientific literatures, trackback among blog articles, hyperlinks among Wikipedia articles or web pages and so on, the relevance among news articles is not always clearly specified. One easy way to build a similarity network is by calculating the similarity among news articles and making links among similar articles; however, adding information about the posted times of articles to a similarity network is difficult. To overcome this problem, we propose a framework that consists of two parts: 1) tree structures called Topic Forests and 2) their visualization. Topic Forests are constructed by semantically and temporally linking cohesive texts while preserving their posted order. We provide effective access for users to text streams by embedding Topic Forests over the polar coordinates with a technique called Polar Coordinate Embedding. From experimental evaluations using the actual text streams of news articles, we confirm that Topic Forests semantically and temporally maintain cohesiveness, and Polar Coordinate Embedding achieves effective accessibility.
引用
收藏
页码:10 / 17
页数:8
相关论文
共 18 条
[1]   Real-Time Visualization of Streaming Text with a Force-Based Dynamic System [J].
Alsakran, Jamal ;
Chen, Yang ;
Luo, Dongning ;
Zhao, Ye ;
Yang, Jing ;
Dou, Wenwen ;
Liu, Shixia .
IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2012, 32 (01) :34-45
[2]  
[Anonymous], 2003, P 20 INT C MACH LEAR
[3]  
[Anonymous], 2005, ADV NEURAL INFORM PR
[4]  
[Anonymous], 1952, Psychometrika
[5]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[6]  
Fushimi T, 2011, LECT NOTES ARTIF INT, V7106, P697, DOI 10.1007/978-3-642-25832-9_71
[7]  
Ishikawa Y., 2007, T SCROLL VISUALIZING, P235
[8]   AN ALGORITHM FOR DRAWING GENERAL UNDIRECTED GRAPHS [J].
KAMADA, T ;
KAWAI, S .
INFORMATION PROCESSING LETTERS, 1989, 31 (01) :7-15
[9]  
Keim D. A., 2010, IEEE T VISUALIZATION, V18, P93
[10]   Authoritative sources in a hyperlinked environment [J].
Kleinberg, JM .
JOURNAL OF THE ACM, 1999, 46 (05) :604-632