Black marketed collusive users primary dataset from twitter/x online social media

被引:0
|
作者
Sabherwal, Suruchi [1 ]
Saxena, Bhawna [2 ]
Sinha, Adwitiya [3 ]
机构
[1] CMR Inst Technol, Informat Sci & Engn, Bengaluru, Karnataka, India
[2] Jaypee Inst Informat Technol, Comp Sc & Engn & Inf Tech, Noida, India
[3] TERI Sch Adv Studies, Nat & Appl Sci, Delhi, India
关键词
Online Social Network; Black market-driven Collusion; Freemium Services; Collusive Dataset; Machine Learning; Twitter/X;
D O I
10.1007/s13278-024-01373-z
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the realm of online social media, the proliferation of collusive behavior presents significant challenges for maintaining platform integrity and trust. This study introduces a primary labeled dataset focused on black-marketed collusive users on social media platforms, especially Twitter/X, aiming to classify collusive and genuine social media profiles. Collusive users, often operating in networks to manipulate metrics such as likes, retweets, and followers, were identified through specific patterns of interaction and engagement. Genuine users, on the other hand, were selected based on their organic and non-manipulative activity. The construction of our primary collusion dataset involved a meticulous process of data collection from 4 black marketing sites, followed by extracting features from Twitter/X. This collusive users data was merged with some genuine user data, which were heuristically collected from Twitter/X. Our primary dataset provides a valuable resource for research using machine learning, network science, and social media analysis, enabling the development and testing of algorithms designed to detect colluded users. By facilitating a deeper understanding of collusive dynamics, this work contributes to the broader efforts of safeguarding the authenticity and reliability of social media platforms. This comprehensive dataset will serve as a foundational tool for advancing research in addressing the collusive users Twitter/X social media. For elaborating the possibilities of model building, we have showcased the usage of our dataset with 15 machine learning classifiers, of which the LightGBM model outperformed with an AUC of 0.94. We have also demonstrated model enhancements using hyperparameter optimization with Bayesian Optimizer, Tree-structured Parzen Estimator, and Random Grid Search.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Online social integration of migrants: Evidence from Twitter
    Kim, Jisu
    Sonne, Soazic Elise Wang
    Garimella, Kiran
    Grow, Andre
    Weber, Ingmar
    Zagheni, Emilio
    MIGRATION STUDIES, 2023, 11 (04) : 544 - 571
  • [32] Latent Attribute Inference of Users in Social Media with Very Small Labeled Dataset
    Xiao, Ding
    Wang, Rui
    Wu, Lingling
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (10): : 2612 - 2618
  • [33] Twitter: Hard to distinguish Social Bots from real Users
    不详
    INFORMATION-WISSENSCHAFT UND PRAXIS, 2019, 70 (04): : 223 - 224
  • [34] Online Communication about Depression and Anxiety among Twitter Users with Schizophrenia: Preliminary Findings to Inform a Digital Phenotype Using Social Media
    Hswen, Yulin
    Naslund, John A.
    Brownstein, John S.
    Hawkins, Jared B.
    PSYCHIATRIC QUARTERLY, 2018, 89 (03) : 569 - 580
  • [35] Online Communication about Depression and Anxiety among Twitter Users with Schizophrenia: Preliminary Findings to Inform a Digital Phenotype Using Social Media
    Yulin Hswen
    John A. Naslund
    John S. Brownstein
    Jared B. Hawkins
    Psychiatric Quarterly, 2018, 89 : 569 - 580
  • [36] C-ANN: a deep leaning model for detecting black-marketed colluders in Twitter social network
    Suruchi Gera
    Adwitiya Sinha
    Neural Computing and Applications, 2022, 34 : 15113 - 15127
  • [37] C-ANN: a deep leaning model for detecting black-marketed colluders in Twitter social network
    Gera, Suruchi
    Sinha, Adwitiya
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (18): : 15113 - 15127
  • [38] Politicians on Social Media. The online database of members of national parliaments on Twitter
    Haman, Michael
    Skolnik, Milan
    PROFESIONAL DE LA INFORMACION, 2021, 30 (02):
  • [39] Social media games: Insights from Twitter analytics
    Pai R.R.
    Alathur S.
    International Journal of Web Based Communities, 2020, 16 (01) : 34 - 50
  • [40] Twitter (X), Fast Fashion and Backlash: Argumentation and Ethics on Social Media
    Balabanova, Ekaterina
    Palmieri, Rudi
    Liu, Zixiu
    BUSINESS AND PROFESSIONAL COMMUNICATION QUARTERLY, 2023,