TSim: a system for discovering similar users on Twitter

被引:8
|
作者
AlMahmoud, Hind [1 ]
AlKhalifa, Shurug [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Informat Technol Dept, Riyadh, Saudi Arabia
关键词
Twitter; MapReduce; Similarity on social media; Big data;
D O I
10.1186/s40537-018-0147-2
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a framework for discovering similar users on Twitter that can be used in profiling users for social, recruitment and security reasons. The framework contains a novel formula that calculates the similarity between users on Twitter by using seven different signals (features). The signals are followings and followers, mention, retweet, favorite, common hashtag, common interests, and profile similarity. The proposed framework is scalable and can handle big data because it is implemented using the MapReduce paradigm. It is also adjustable since the weight and contribution of each signal in calculating the final similarity score is determined by the user based on their needs. The accuracy of the system was evaluated through human judges and by comparing the system's results against Twitter's Who To Follow service. The results show moderately accurate results.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Identifying Fake News and Fake Users on Twitter
    Atodiresei, Costel-Sergiu
    Tanaselea, Alexandru
    Iftene, Adrian
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KES-2018), 2018, 126 : 451 - 461
  • [32] Measuring Spatial Influence of Twitter Users by Interactions
    Wei, Hong
    Sankaranarayanan, Jagan
    Samet, Hanan
    PROCEEDINGS OF 1ST ACM SIGSPATIAL WORKSHOP ON ANALYTICS FOR LOCAL EVENTS AND NEWS (LENS 2017), 2015,
  • [33] Location Prediction Using Sentiments of Twitter Users
    Singh, Ritu
    Toshniwal, Durga
    BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2018), 2018, 11031 : 98 - 108
  • [34] Where's @Waldo?: Finding Users on Twitter
    Clarkson, Kyle
    Srivastava, Gautam
    Meawad, Fatma
    Dwivedi, Ashutosh Dhar
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2019, PT II, 2019, 11509 : 338 - 349
  • [35] Towards the Elimination of the Miscommunication Between Users in Twitter
    Ueda, Tomoaki
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    AI 2015: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2015, 9457 : 589 - 595
  • [36] An Analysis of Positivity and Negativity Attributes of Users in Twitter
    Roshanaei, Mahnaz
    Mishra, Shivakant
    2014 PROCEEDINGS OF THE IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2014), 2014, : 365 - 370
  • [37] Treemap Visualization: A Hierarchical Method for Discovering User Profiles on Twitter
    Lopez-Ornelas, Erick
    Abascal-Mena, Rocio
    COMPUTACION Y SISTEMAS, 2022, 26 (01): : 195 - 202
  • [38] Discovering Potential Effects of Dietary Supplements from Twitter Data
    Jiang, Keyuan
    Tang, Yongbing
    Cook, G. Elliott
    Madden, Michael M.
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (DH'17), 2017, : 119 - 126
  • [39] Aggregation and Summarization of Thematically Similar Twitter Microblog Messages
    Endres, Markus
    Rudenko, Lena
    Groeninger, Dominik
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2021, 2021, 12843 : 102 - 117
  • [40] Discovering topics in Twitter about the COVID-19 outbreak in Spain
    Aguero-Torales, Marvin M., I
    Vilares, David
    Lopez-Herrera, Antonio G.
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2021, (66): : 177 - 190