A Data-Driven Approach for Twitter Hashtag Recommendation

被引:26
作者
Belhadi, Asma [1 ]
Djenouri, Youcef [2 ]
Lin, Jerry Chun-Wei [3 ]
Cano, Alberto [4 ]
机构
[1] Univ Sci & Technol Houari Boumediene USTHB, Dept Comp Sci, Algiers 16111, Algeria
[2] SINTEF Digital, Dept Math & Cybernet, N-0314 Oslo, Norway
[3] Western Norway Univ Appl Sci, Dept Comp Math & Phys, N-5063 Bergen, Norway
[4] Virginia Commonwealth Univ, Dept Comp Sci, Med Coll Virginia Campus, Richmond, VA 23284 USA
关键词
High average utility patterns; hashtag recommendation; ontology construction; temporal information; TOPIC MODEL; FREQUENT; PATTERNS; GENERATION;
D O I
10.1109/ACCESS.2020.2990799
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses the hashtag recommendation problem using high average-utility pattern mining. We introduce a novel framework called PM-HRec (Pattern Mining for Hashtag Recommendation). It consists of two main stages. First, offline processing transforms the corpus of tweets into a transactional database considering the temporal information of the tagged tweets (tweets with hashtags). The method discovers the temporal top k high average utility patterns. Irrelevant tagged tweets and the ontology of tagged tweets are also constructed offline. Second, an online processing inputs the utility patterns, the ontology, and the irrelevant tagged tweets to extract the most relevant hashtags for a given orpheline tweet (tweet without hashtags). Extensive experiments were carried out on large tweets collections. The proposed PM-HRec outperforms the existing state of the art hashtag recommendation approaches in terms of quality of recommended hashtags and runtime processing.
引用
收藏
页码:79182 / 79191
页数:10
相关论文
共 56 条
[1]  
Aggarwal CC, 2009, KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, P29
[2]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[3]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[4]  
Agrawal R., P 20 INT C VERY LARG
[5]  
[Anonymous], 2013, NIPS
[6]  
[Anonymous], [No title captured]
[7]  
[Anonymous], 2017, P IEEE C COMP VIS PA
[8]   A general-purpose distributed pattern mining system [J].
Belhadi, Asma ;
Djenouri, Youcef ;
Lin, Jerry Chun-Wei ;
Cano, Alberto .
APPLIED INTELLIGENCE, 2020, 50 (09) :2647-2662
[9]   Exploring Pattern Mining Algorithms for Hashtag Retrieval Problem [J].
Belhadi, Asma ;
Djenouri, Youcef ;
Lin, Jerry Chun-Wei ;
Zhang, Chongsheng ;
Cano, Alberto .
IEEE ACCESS, 2020, 8 :10569-10583
[10]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022