TSim: a system for discovering similar users on Twitter

被引:8
|
作者
AlMahmoud, Hind [1 ]
AlKhalifa, Shurug [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Informat Technol Dept, Riyadh, Saudi Arabia
关键词
Twitter; MapReduce; Similarity on social media; Big data;
D O I
10.1186/s40537-018-0147-2
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a framework for discovering similar users on Twitter that can be used in profiling users for social, recruitment and security reasons. The framework contains a novel formula that calculates the similarity between users on Twitter by using seven different signals (features). The signals are followings and followers, mention, retweet, favorite, common hashtag, common interests, and profile similarity. The proposed framework is scalable and can handle big data because it is implemented using the MapReduce paradigm. It is also adjustable since the weight and contribution of each signal in calculating the final similarity score is determined by the user based on their needs. The accuracy of the system was evaluated through human judges and by comparing the system's results against Twitter's Who To Follow service. The results show moderately accurate results.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
    Chien, Ong Kok
    Hoong, Poo Kuan
    Ho, Chiung Ching
    2014 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND TECHNOLOGY (ICCST), 2014,
  • [22] ADVERTISING RECOMMENDATION SYSTEM BASED ON DYNAMIC DATA ANALYSIS ON TURKISH SPEAKING TWITTER USERS
    Sevli, Onur
    Kucuksille, Ecir Ugur
    TEHNICKI VJESNIK-TECHNICAL GAZETTE, 2017, 24 (02): : 571 - 578
  • [23] Twitter features distributions across similar labelers
    AlMansour, Amal Abdullah
    2017 13TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2017, : 405 - 410
  • [24] New literacies practices of teenage Twitter users
    Gleason, Benjamin
    LEARNING MEDIA AND TECHNOLOGY, 2016, 41 (01) : 31 - 54
  • [25] New method to measure the influence of Twitter users
    Essaidi, Abdessamad
    Zaidouni, Dounia
    Bellafkih, Mostafa
    2020 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING IN DATA SCIENCES (ICDS), 2020,
  • [26] An Analytical Model for Identifying Suspected Users on Twitter
    Singh, Monika
    Singh, Amardeep
    Bansal, Divya
    Sofat, Sanjeev
    CYBERNETICS AND SYSTEMS, 2019, 50 (04) : 383 - 404
  • [27] Features combination for gender recognition on Twitter users
    Fernandez, Daniela
    Moctezuma, Daniela
    Siordia, Oscar S.
    2016 IEEE INTERNATIONAL AUTUMN MEETING ON POWER, ELECTRONICS AND COMPUTING (ROPEC), 2016,
  • [28] Confidence Index Analysis of Twitter Users Timeline
    Is, Hafzullah
    Tuncer, Taner
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [29] Seeing and Believing: Evaluating the Trustworthiness of Twitter Users
    Khan, Tanveer
    Michalas, Antonis
    IEEE ACCESS, 2021, 9 : 110505 - 110516
  • [30] Analysing the connectivity and communication of suicidal users on twitter
    Colombo, Gualtiero B.
    Burnap, Pete
    Hodorog, Andrei
    Scourfield, Jonathan
    COMPUTER COMMUNICATIONS, 2016, 73 : 291 - 300