Maximum entropy networks for large scale social network node analysis

被引:8
作者
De Clerck, Bart [1 ,2 ]
Rocha, Luis E. C. [1 ,3 ]
Van Utterbeeck, Filip [2 ]
机构
[1] Univ Ghent, Dept Econ, Ghent, Belgium
[2] Royal Mil Acad, Dept Math, Brussels, Belgium
[3] Univ Ghent, Dept Phys & Astron, Ghent, Belgium
关键词
Social networks; Maximum entropy networks; Disinformation identification; Network analysis; COMMUNITY STRUCTURE;
D O I
10.1007/s41109-022-00506-7
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently proposed computational techniques allow the application of various maximum entropy network models at a larger scale. We focus on disinformation campaigns and apply different maximum entropy network models on the collection of datasets from the Twitter information operations report. For each dataset, we obtain additional Twitter data required to build an interaction network. We consider different interaction networks which we compare to an appropriate null model. The null model is used to identify statistically significant interactions. We validate our method and evaluate to what extent it is suited to identify communities of members of a disinformation campaign in a non-supervised way. We find that this method is suitable for larger social networks and allows to identify statistically significant interactions between users. Extracting the statistically significant interaction leads to the prevalence of users involved in a disinformation campaign being higher. We found that the use of different network models can provide different perceptions of the data and can lead to the identification of different meaningful patterns. We also test the robustness of the methods to illustrate the impact of missing data. Here we observe that sampling the correct data is of great importance to reconstruct an entire disinformation operation.
引用
收藏
页数:22
相关论文
共 74 条
  • [1] On the Bias of Traceroute Sampling: or, Power-Law Degree Distributions in Regular Graphs
    Achlioptas, Dimitris
    Clauset, Aaron
    Kempe, David
    Moore, Cristopher
    [J]. JOURNAL OF THE ACM, 2009, 56 (04)
  • [2] Baltakiene M, 2018, Arxiv, DOI [arXiv:1805.04307, DOI 10.48550/ARXIV.1805.04307]
  • [3] Extracting significant signal of news consumption from social networks: the case of Twitter in Italian political elections
    Becatti, Carolina
    Caldarelli, Guido
    Lambiotte, Renaud
    Saracco, Fabio
    [J]. PALGRAVE COMMUNICATIONS, 2019, 5 (1)
  • [4] CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING
    BENJAMINI, Y
    HOCHBERG, Y
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) : 289 - 300
  • [5] Bianconi G, 2018, MULTILAYER NETWORKS: STRUCTURE AND FUNCTION, DOI 10.1093/oso/9780198753919.001.0001
  • [6] Bianconi G., 2021, Higher-Order Networks, Elements in the Structure and Dynamics of Complex Networks
  • [7] Statistical mechanics of multiplex networks: Entropy and overlap
    Bianconi, Ginestra
    [J]. PHYSICAL REVIEW E, 2013, 87 (06)
  • [8] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [9] Bradshaw S, 2018, TECHNICAL REPORT
  • [10] Bruno M, 2021, Arxiv, DOI arXiv:2107.14155