Empirical networks are sparse: Enhancing multiedge models with zero-inflation

被引:0
作者
Casiraghi, Giona [1 ]
Andres, Georges [1 ]
机构
[1] Swiss Fed Inst Technol, Chair Syst Design, Weinbergstr 56-58, CH-8092 Zurich, Switzerland
来源
PNAS NEXUS | 2025年 / 4卷 / 01期
基金
瑞士国家科学基金会;
关键词
sparsity; zero-inflation; multiedges; statistical modeling; complex networks; RANDOM GRAPH MODELS; POISSON REGRESSION;
D O I
10.1093/pnasnexus/pgaf001
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Real-world networks are sparse. As we show in this article, even when a large number of interactions is observed, most node pairs remain disconnected. We demonstrate that classical multiedge network models, such as the G(N,p), configuration models, and stochastic block models, fail to accurately capture this phenomenon. To mitigate this issue, zero-inflation must be integrated into these traditional models. Through zero-inflation, we incorporate a mechanism that accounts for the excess number of zeroes (disconnected pairs) observed in empirical data. By performing an analysis on all the datasets from the Sociopatterns repository, we illustrate how zero-inflated models more accurately reflect the sparsity and heavy-tailed edge count distributions observed in empirical data. Our findings underscore that failing to account for these ubiquitous properties in real-world networks inadvertently leads to biased models that do not accurately represent complex systems and their dynamics.
引用
收藏
页数:11
相关论文
共 35 条
  • [1] Intrinsic priors for comparing zero-inflation parameters in Poisson models
    Kim, Kipum
    Jeong, Hyeon Jun
    Kim, Yongdai
    Kim, Seong W.
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2025, 54 (01): : 319 - 335
  • [2] Testing for varying zero-inflation and dispersion in generalized Poisson regression models
    Xie, Feng-Chang
    Lin, Jin-Guan
    Wei, Bo-Cheng
    JOURNAL OF APPLIED STATISTICS, 2010, 37 (09) : 1509 - 1522
  • [3] A Note on Tests for Zero-Inflation in Correlated Count Data
    Xiang, Liming
    Teo, Guo Shou
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2011, 40 (07) : 992 - 1005
  • [4] Score tests for zero-inflation and over-dispersion in generalized linear models
    Deng, DL
    Paul, SR
    STATISTICA SINICA, 2005, 15 (01) : 257 - 276
  • [5] A New Computational Algorithm for Assessing Overdispersion and Zero-Inflation in Machine Learning Count Models with Python']Python
    Favero, Luiz Paulo Lopes
    Duarte, Alexandre
    Santos, Helder Prado
    COMPUTERS, 2024, 13 (04)
  • [6] Addressing overdispersion and zero-inflation for clustered count data via new multilevel heterogenous hurdle models
    Altinisik, Yasin
    JOURNAL OF APPLIED STATISTICS, 2023, 50 (02) : 408 - 433
  • [7] The misuse of the Vuong test for non-nested models to test for zero-inflation
    Wilson, Paul
    ECONOMICS LETTERS, 2015, 127 : 51 - 53
  • [8] A score test for zero-inflation in correlated count data
    Xiang, Liming
    Lee, Andy H.
    Yau, Kelvin K. W.
    McLachlan, Geoffrey J.
    STATISTICS IN MEDICINE, 2006, 25 (10) : 1660 - 1671
  • [9] Sensitivity of score tests for zero-inflation in count data
    Lee, AH
    Xiang, LM
    Fung, WK
    STATISTICS IN MEDICINE, 2004, 23 (17) : 2757 - 2769
  • [10] A score test for zero-inflation in multilevel count data
    Moghimbeigi, Abbas
    Eshraghian, Mohammad Reza
    Mohammad, Kazem
    McArdle, Brian
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (04) : 1239 - 1248