Defining Semantic Meta-hashtags for Twitter Classification

被引:0
作者
Costa, Joana [1 ,2 ]
Silva, Catarina [1 ,2 ]
Antunes, Mario [1 ,3 ]
Ribeiro, Bernardete [2 ]
机构
[1] Polytech Inst Leiria, Comp Sci Commun & Res Ctr, Sch Technol & Management, Leiria, Portugal
[2] Univ Coimbra, Ctr Informat & Syst, Dept Informat Engn, P-3000 Coimbra, Portugal
[3] Ctr Res Adv Comp Syst, Coimbra, Portugal
来源
ADAPTIVE AND NATURAL COMPUTING ALGORITHMS, ICANNGA 2013 | 2013年 / 7824卷
关键词
Meta-hashtags; Semantic; Text Classification; Twitter;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Given the wide spread of social networks, research efforts to retrieve information using tagging from social networks communications have increased. In particular, in Twitter social network, hashtags are widely used to define a shared context for events or topics. While this is a common practice often the hashtags freely introduced by the user become easily biased. In this paper, we propose to deal with this bias defining semantic meta-hashtags by clustering similar messages to improve the classification. First, we use the user-defined hashtags as the Twitter message class labels. Then, we apply the meta-hashtag approach to boost the performance of the message classification. The meta-hashtag approach is tested in a Twitter-based dataset constructed by requesting public tweets to the Twitter API. The experimental results yielded by comparing a baseline model based on user-defined hashtags with the clustered meta-hashtag approach show that the overall classification is improved. It is concluded that by incorporating semantics in the meta-hashtag model can have impact in different applications, e.g. recommendation systems, event detection or crowdsourcing.
引用
收藏
页码:226 / 235
页数:10
相关论文
共 28 条
[1]  
Abel F, 2011, LECT NOTES COMPUT SC, V6787, P1, DOI 10.1007/978-3-642-22362-4_1
[2]  
Abel F, 2011, LECT NOTES COMPUT SC, V6644, P375, DOI 10.1007/978-3-642-21064-8_26
[3]  
[Anonymous], Proceedings of the fifth ACMinternational conference on Web search and data mining, DOI [DOI 10.1145/2124295.2124320, 10.1145/2124295.2124320]
[4]  
[Anonymous], 2012, Proceedings of the 21st international conference on World Wide Web, DOI DOI 10.1145/2187836.2187872
[5]   Serglycin-deficient cytotoxic T lymphocytes display defective secretory granule maturation and granzyme B storage [J].
Grujic, M ;
Braga, T ;
Lukinius, A ;
Eloranta, ML ;
Knight, SD ;
Pejler, G ;
Åbrink, M .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2005, 280 (39) :33411-33418
[6]  
Becker H., 2011, ICWSM, P226
[7]  
Chang H.-C., 2010, P 73 ASIS T ANN M NA, V47, P227
[8]  
Costa J., 2011, Proceedings of the 2011 11th International Conference on Intelligent Systems Design and Applications (ISDA), P469, DOI 10.1109/ISDA.2011.6121700
[9]   Why Rumors Spread So Quickly in Social Networks [J].
Doer, Benjamin ;
Fouz, Mahmoud ;
Friedrich, Tobias .
COMMUNICATIONS OF THE ACM, 2012, 55 (06) :70-75
[10]  
Efron M, 2010, SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, P787