Hashtag Sense Induction Based on Co-occurrence Graphs

被引:0
作者
Wang, Mengmeng [1 ]
Iwaihara, Mizuho [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Wakamatsu Ku, Kitakyushu, Fukuoka 8080135, Japan
来源
WEB TECHNOLOGIES AND APPLICATIONS (APWEB 2015) | 2015年 / 9313卷
关键词
Twitter; Hashtag; Sense Induction; Co-occurrence Graph; Wikipedia;
D O I
10.1007/978-3-319-25255-1_13
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Twitter hashtags are used to categorize tweets for improving search categorizing topic. But the fact that people can create and use hashtags freely leads to a situation such that one hashtag may have multiple senses. In this paper, we propose a method to induce senses of a hashtag in a particular time frame. Our assumption is that for a sense of a hashtag the context words around it are similar. Then we design a method that uses a co-occurrence graph and community detection algorithm. Both words and hashtags are nodes of the co-occurrence graph, and an edge represents the relation of two nodes co-occurring in the same tweet. A list of words with a high node degree representing a sense is extracted as a community of the graph. We take Wikipedia disambiguation list page as word sense inventory to refine the results by removing non-sense topics.
引用
收藏
页码:154 / 165
页数:12
相关论文
共 6 条
[1]  
[Anonymous], 2011, P TEXTGRAPHS6 GRAPH
[2]   Fast unfolding of communities in large networks [J].
Blondel, Vincent D. ;
Guillaume, Jean-Loup ;
Lambiotte, Renaud ;
Lefebvre, Etienne .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
[3]  
Klapafits I.P., 2010, Proceedings of EMNLP 2010, P745
[4]  
Nasiruddin M., 2013, P RECITAL 2013, P192
[5]  
Van De Cruys T., 2010, THESIS
[6]   HyperLex:: lexical cartography for information retrieval [J].
Véronis, J .
COMPUTER SPEECH AND LANGUAGE, 2004, 18 (03) :223-252