TSim: a system for discovering similar users on Twitter

被引:8
作者
AlMahmoud, Hind [1 ]
AlKhalifa, Shurug [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Informat Technol Dept, Riyadh, Saudi Arabia
关键词
Twitter; MapReduce; Similarity on social media; Big data;
D O I
10.1186/s40537-018-0147-2
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
This paper presents a framework for discovering similar users on Twitter that can be used in profiling users for social, recruitment and security reasons. The framework contains a novel formula that calculates the similarity between users on Twitter by using seven different signals (features). The signals are followings and followers, mention, retweet, favorite, common hashtag, common interests, and profile similarity. The proposed framework is scalable and can handle big data because it is implemented using the MapReduce paradigm. It is also adjustable since the weight and contribution of each signal in calculating the final similarity score is determined by the user based on their needs. The accuracy of the system was evaluated through human judges and by comparing the system's results against Twitter's Who To Follow service. The results show moderately accurate results.
引用
收藏
页数:20
相关论文
共 10 条
[1]  
Dean J, 2004, USENIX ASSOCIATION PROCEEDINGS OF THE SIXTH SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION (OSDE '04), P137
[2]  
Goel A, 2013, WORKSH MIN LEARN GRA
[3]  
Gupta Pankaj, 2013, P 22 INT C WORLD WID, V13, P505, DOI DOI 10.1145/2488388.2488433
[4]  
Kamath K., 2014, US ENG OPT WORKSH KD
[5]  
Karpman Claire, 2020, PSYCHOL TODAY, P39
[6]   Discovering similar Twitter accounts using semantics [J].
Razis, Gerasimos ;
Anagnostopoulos, Ioannis .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2016, 51 :37-49
[7]  
Smith Craig, 2016, DMR
[8]  
Socher R., 2013, EMNLP, P1631, DOI DOI 10.1371/JOURNAL.PONE.0073791
[9]   Mining Interesting Topics in Twitter Communities [J].
Vathi, Eleni ;
Siolas, Georgios ;
Stafylopatis, Andreas .
COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2015), PT I, 2015, 9329 :123-132
[10]  
Word Lists by Theme, WORDBANKS ENCHANTEDL