WASM: A Dataset for Hashtag Recommendation for Arabic Tweets

被引:1
|
作者
Al-Shaibani, Maged S. [1 ]
Luqman, Hamzah [1 ,2 ]
Al-Ghofaily, Abdulaziz S. [1 ]
Al-Najim, Abdullatif A. [1 ]
机构
[1] King Fahd Univ Petr & Minerals, Informat & Comp Sci Dept, Dhahran, Saudi Arabia
[2] SDAIA KFUPM Joint Res Ctr Artificial Intelligence, Dhahran 31261, Saudi Arabia
关键词
Hashtag Recommendation; Hashtag Generation; Tweets Classification; Arabic Tweets; Twitter; Hashtags;
D O I
10.1007/s13369-023-08567-1
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
As one of the largest microblogging websites in the world, Twitter generates a huge amount of information daily. The massive size of the generated data increases the difficulty for humans to follow and receive information relevant to their interests. Therefore, Twitter allows users to annotate and categorize their tweets using appropriate hashtags. However, finding an appropriate hashtag for a tweet is not always straightforward. Furthermore, many users violate the hashtag flow by posting irrelevant content to the hashtag topic. These problems increase the need for a hashtag recommendation and classification system. This topic has received considerable attention from researchers in some languages, such as English and Chinese. However, this problem has not yet been explored for the Arabic language owing to the lack of datasets. In this study, we bridge this gap by proposing WASM, an Arabic Twitter hashtag recommendation dataset consisting of more than 100,000 tweets annotated with 87 hashtags. The proposed dataset is subjected to several rounds of automatic and manual filtrations to ensure that it is suitable for tasks related to tweets and hashtags. Further, we propose three systems for hashtag recommendation and classification. Each of these systems approaches the task differently by considering it as classification, generation, and named entity recognition problems. The results obtained using these systems are promising and can be used to benchmark the WASM dataset. The data and code are available at https://github.com/Hamzah-Luqman/wasm.
引用
收藏
页码:12131 / 12145
页数:15
相关论文
共 50 条
  • [31] Hashtag recommendation for enhancing the popularity of social media posts
    Chakrabarti, Purnadip
    Malvi, Eish
    Bansal, Shubhi
    Kumar, Nagendra
    SOCIAL NETWORK ANALYSIS AND MINING, 2023, 13 (01)
  • [32] Sentiment Enhanced Multi-Modal Hashtag Recommendation for Micro-Videos
    Yang, Chao
    Wang, Xiaochan
    Jiang, Bin
    IEEE ACCESS, 2020, 8 (08): : 78252 - 78264
  • [33] Hashtag our stories: Hashtag recommendation for micro-videos via harnessing multiple modalities
    Cao, Da
    Miao, Lianhai
    Rong, Huigui
    Qin, Zheng
    Nie, Liqiang
    KNOWLEDGE-BASED SYSTEMS, 2020, 203 (203)
  • [34] DemoHash: Hashtag Recommendation based on User Demographic Information
    Jeong, Dahye
    Oh, Soyoung
    Park, Eunil
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 210
  • [35] EgoTR: Personalized Tweets Recommendation Approach
    Benzarti, Slim
    Faiz, Rim
    INTELLIGENT SYSTEMS IN CYBERNETICS AND AUTOMATION THEORY, VOL 2, 2015, 348 : 227 - 238
  • [36] USER CONDITIONAL HASHTAG RECOMMENDATION FOR MICRO-VIDEOS
    Liu, Shang
    Xie, Jiayi
    Zou, Cong
    Chen, Zhenzhong
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [37] AMNN: Attention-Based Multimodal Neural Network Model for Hashtag Recommendation
    Yang, Qi
    Wu, Gaosheng
    Li, Yuhua
    Li, Ruixuan
    Gu, Xiwu
    Deng, Huicai
    Wu, Junzhuang
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2020, 7 (03) : 768 - 779
  • [38] A Data-Driven Approach for Twitter Hashtag Recommendation
    Belhadi, Asma
    Djenouri, Youcef
    Lin, Jerry Chun-Wei
    Cano, Alberto
    IEEE ACCESS, 2020, 8 : 79182 - 79191
  • [39] Hashtag recommendation for enhancing the popularity of social media posts
    Purnadip Chakrabarti
    Eish Malvi
    Shubhi Bansal
    Nagendra Kumar
    Social Network Analysis and Mining, 13
  • [40] Hashtag Recommendation Approach Based on Content and User Characteristics
    Van Cuong Tran
    Hwang, Dosam
    Ngoc Thanh Nguyen
    CYBERNETICS AND SYSTEMS, 2018, 49 (5-6) : 368 - 383