Reddit CrosspostNet-Studying Reddit Communities with Large-Scale Crosspost Graph Networks

被引:2
作者
Sawicki, Jan [1 ,2 ]
Ganzha, Maria [1 ]
Paprzycki, Marcin [3 ]
Watanobe, Yutaka [2 ]
机构
[1] Warsaw Univ Technol, Fac Math & Informat Sci, PL-00662 Warsaw, Poland
[2] Univ Aizu, Dept Comp Sci & Engn, Aizu Wakamatsu 9658580, Japan
[3] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
关键词
graph network-based analysis; Reddit; subreddits; online social networks; big data; crossposts;
D O I
10.3390/a16090424
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the largest open social medium on the Internet, Reddit is widely studied in the scientific literature. Due to its structured form and division into topical subfora (subreddits), conducted research often concerns connections and interactions between users and/or whole, subreddit-structure-based communities. Overall, the relations between communities are most often studied by applying graph networks, with various creation algorithms. In this work, a novel approach is proposed to build and understand the structure of Reddit. It is based on crossposts-posts that appeared on one subreddit and then were crossposted to another. After capturing one year of crossposts, a directed weighted graph network, using seven million posts from over 10,000 of the most popular subreddits, has been created. Using graph network algorithms, its characteristics are captured and compared to similar studies. We identify the information "sinks" and "sources"-the most active crossposting subreddits. Moreover, we obtained graph network metrics: the degree (modeled with the Power Law), clustering, community detection algorithms, and connected components structure network are compared to previous studies on Reddit network(s), yielding consistent, but also novel results. Finally, the relations between extensively studied subreddits (e.g., r/AITA, r/Parenting, r/politics) and new ones, which were not accounted for in previous research, opening new paths for data-driven studies, are summarized.
引用
收藏
页数:21
相关论文
共 32 条
  • [1] Statistical mechanics of complex networks
    Albert, R
    Barabási, AL
    [J]. REVIEWS OF MODERN PHYSICS, 2002, 74 (01) : 47 - 97
  • [2] Baowaly MK, 2022, COMPUT SIST, V26, P311, DOI [10.13053/cys-26-1-4175, 10.13053/CyS-26-1-4175]
  • [3] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [4] Analysis of Moral Judgment on Reddit
    Botzer, Nicholas
    Gu, Shawn
    Weninger, Tim
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (03) : 947 - 957
  • [5] Identifying Social Roles in reddit Using Network Structure
    Buntain, Cody
    Golbeck, Jennifer
    [J]. WWW'14 COMPANION: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 615 - 620
  • [6] The effect of toxicity on COVID-19 news network formation in political subcommunities on Reddit: An affiliation network approach
    Chipidza, Wallace
    [J]. INTERNATIONAL JOURNAL OF INFORMATION MANAGEMENT, 2021, 61
  • [7] Clauset A, 2004, PHYS REV E, V70, DOI 10.1103/PhysRevE.70.066111
  • [8] Datta S., 2019, Proceedings of the international AAAI conference on Web and Social Media, V13, P146
  • [9] Social Norms on Reddit: A Demographic Analysis
    De Candia, Sara
    Morales, Gianmarco De Francisci
    Monti, Corrado
    Bonchi, Francesco
    [J]. PROCEEDINGS OF THE 14TH ACM WEB SCIENCE CONFERENCE, WEBSCI 2022, 2022, : 139 - 147
  • [10] De Choudhury M., 2014, 8 INT AAAI C WEBL SO, V8, P71, DOI [DOI 10.1609/ICWSM.V8I1.14526, 10.1609/icwsm.v8i1.14526]