Using Tweets Embeddings For Hashtag Recommendation in Twitter

被引:27
|
作者
Ben-Lhachemi, Nada [1 ]
Nfaoui, El Habib [1 ]
机构
[1] Sidi Mohammed Ben Abdellah Univ, LIIAN Lab, Fes, Morocco
来源
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS2017) | 2018年 / 127卷
关键词
Word Embeddings; DBSCAN; Recommender system; Twitter; Hashtag; Clustering;
D O I
10.1016/j.procs.2018.01.092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social microblogging platforms such as Twitter have become hugely popular forms of this latest sort of blogging. Twitter users make and use hashtags in their tweets to categorize them according to topic or theme, likewise to make them ascertainable to other bloggers through search. However, the liberated hashtag creation policy make a wide hardness for bloggers to find appropriates hashtags for their posts. Indeed, the task of recommending hashtags has many benefits to afford; notably it assists users to choose relevant hashtags for their posts in real time, which will save them from a supplementary stress. Actually, the achieve success of several models of neural networks for calculating word embeddings, has driven approaches for generating syntactic and semantic embeddings for long and noisy text, such as paragraphs, sentences and micro-blogs. On the parallel lines, our aim is to develop a hashtag recommender system to assist users to choose relevant hashtags for their posts in real time, based on using semantic embeddings representation of tweets, which we can subsequently use to capture semantic similarity or relatedness between tweets. In the current paper, we introduce an approach to hashtag recommendation in Twitter that is based on the following proceedings: Using a pre-trained word embeddings on a large corpus such as Google News applying one of the famous embeddings methods, Representing a given tweet by a weighted averaging value of its word embeddings, Then combining these features with the DBSCAN (density-based spatial clustering of applications with noise) clustering algorithm, to divide the heterogeneous collection of tweets into clusters that contain syntactically and semantically similar tweets. Afterwards, Recommending the top-K suitable hashtags to the user after computing the similarity between the entered tweet and the centroids of obtained clusters. Our system achieved promising results which demonstrate the effectiveness of our approach. (C) 2018 The Authors. Published by Elsevier B.V.
引用
收藏
页码:7 / 15
页数:9
相关论文
共 50 条
  • [21] Search Result Personalization in Twitter Using Neural Word Embeddings
    Samarawickrama, Sameendra
    Karunasekera, Shanika
    Harwood, Aaron
    Kotagiri, Ramamohanarao
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY, DAWAK 2017, 2017, 10440 : 244 - 258
  • [22] Use of the Hashtag #DataSavesLives on Twitter: Exploratory and Thematic Analysis
    Teodorowski, Piotr
    Rodgers, Sarah E.
    Fleming, Kate
    Frith, Lucy
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2022, 24 (11)
  • [23] Interactive Hashtag Recommendation System
    Lin, Chun-Ting
    Li, Tsai-Yen
    2022 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, TAAI, 2022, : 165 - 170
  • [24] Evolutionary Personalized Hashtag Recommendation
    Yu, Jianjun
    Shen, Yi
    WEB-AGE INFORMATION MANAGEMENT, WAIM 2014, 2014, 8485 : 34 - 37
  • [25] Social media signal detection using tweets volume, hashtag, and sentiment analysis
    Faria Nazir
    Mustansar Ali Ghazanfar
    Muazzam Maqsood
    Farhan Aadil
    Seungmin Rho
    Irfan Mehmood
    Multimedia Tools and Applications, 2019, 78 : 3553 - 3586
  • [26] Using word embeddings in Twitter election classification
    Xiao Yang
    Craig Macdonald
    Iadh Ounis
    Information Retrieval Journal, 2018, 21 : 183 - 207
  • [27] HaRNaT- A dynamic hashtag recommendation system using news
    Gupta, Divya
    Chakraverty, Shampa
    ONLINE SOCIAL NETWORKS AND MEDIA, 2025, 45
  • [28] Social media signal detection using tweets volume, hashtag, and sentiment analysis
    Nazir, Faria
    Ghazanfar, Mustansar Ali
    Maqsood, Muazzam
    Aadil, Farhan
    Rho, Seungmin
    Mehmood, Irfan
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (03) : 3553 - 3586
  • [29] Using word embeddings in Twitter election classification
    Yang, Xiao
    Macdonald, Craig
    Ounis, Iadh
    INFORMATION RETRIEVAL JOURNAL, 2018, 21 (2-3): : 183 - 207
  • [30] #secondcivilwarletters from the front: Discursive illusions in a trending Twitter hashtag
    Ross, Andrew S.
    Bhatia, Aditi
    NEW MEDIA & SOCIETY, 2019, 21 (10) : 2222 - 2241