Proof of uniform sampling of binary matrices with fixed row sums and column sums for the fast Curveball algorithm

被引:25
作者
Carstens, C. J. [1 ]
机构
[1] RMIT Univ, Sch Math & Geospatial Sci, Melbourne, Vic 3000, Australia
关键词
NETWORKS;
D O I
10.1103/PhysRevE.91.042812
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
Randomization of binary matrices has become one of the most important quantitative tools in modern computational biology. The equivalent problem of generating random directed networks with fixed degree sequences has also attracted a lot of attention. However, it is very challenging to generate truly unbiased random matrices with fixed row and column sums. Strona et al. [Nat. Commun. 5, 4114 (2014)] introduce the innovative Curveball algorithm and give numerical support for the proposition that it generates truly random matrices. In this paper, we present a rigorous proof of convergence to the uniform distribution. Furthermore, we show the Curveball algorithm must include certain failed trades to ensure uniform sampling.
引用
收藏
页数:7
相关论文
共 17 条
[1]  
[Anonymous], 1996, SANKHYA INDIAN J S A
[2]  
[Anonymous], ARXIVCONDMAT0312028
[3]   Generating uniformly distributed random networks [J].
Artzy-Randrup, Y ;
Stone, L .
PHYSICAL REVIEW E, 2005, 72 (05)
[5]   Comment on "Subgraphs in random networks" [J].
Kino, OD .
PHYSICAL REVIEW E, 2004, 70 (05) :3
[6]   Bias in generation of random graphs [J].
Klein-Hennig, Hendrike ;
Hartmann, Alexander K. .
PHYSICAL REVIEW E, 2012, 85 (02)
[7]   Specificity and stability in topology of protein networks [J].
Maslov, S ;
Sneppen, K .
SCIENCE, 2002, 296 (5569) :910-913
[8]   Randomization of presence-absence matrices: Comments and new algorithms [J].
Miklos, I ;
Podani, J .
ECOLOGY, 2004, 85 (01) :86-92
[9]   Network motifs: Simple building blocks of complex networks [J].
Milo, R ;
Shen-Orr, S ;
Itzkovitz, S ;
Kashtan, N ;
Chklovskii, D ;
Alon, U .
SCIENCE, 2002, 298 (5594) :824-827
[10]  
Mitzenmacher M., 2017, Probability and Computing: Randomization and Probabilistic Techniques in Algorithms and Data Analysis