Weighted Jump in Random Walk graph sampling

被引:0
作者
Qi, Xiao [1 ]
机构
[1] East China Normal Univ, 3663 North Zhongshan Rd, Shanghai 200062, Peoples R China
关键词
Random walk; Graph sampling; Data mining; Social networks; NETWORKS;
D O I
10.1016/j.neucom.2024.127581
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Graph sampling is a challenging problem in network analysis due to the complex structures of networks. Currently, a series of graph sampling algorithms based on random walks achieve good results in graph sampling tasks. However, the existing methods often reduce the conductance of graphs, causing the sampler to stay in the same node for a long time. This results in undersampling. In this paper, we propose a novel Weighted Jump Random Walk (WJRW) algorithm to generate representative samples. We design a parameter in the WJRW algorithm that can adjust the proportions of random walk and random jump in every step. According to the issue of repeated sample nodes in the Generalized Maximum Degree (GMD) method and the problem of large deviations in the Simple Random Walk (SRW), WJRW addresses the weaknesses of the GMD method and enhances the diffusion of a random walker on graphs, leading to a more representative sample. Then, WJRW addresses the issue of large deviations in SRW and enhances the efficiency of the unbiased estimator. By generating smoother stationary distributions. Numerical experiments with extensive real -world networks verify that our method achieves higher accuracy than classical and state-of-the-art methods in estimating distribution estimation.
引用
收藏
页数:10
相关论文
共 42 条
[1]   Search in power-law networks [J].
Adamic, L.A. ;
Lukose, R.M. ;
Puniyani, A.R. ;
Huberman, B.A. .
Physical Review E - Statistical, Nonlinear, and Soft Matter Physics, 2001, 64 (4 II) :461351-461358
[2]   Network Sampling: From Static to Streaming Graphs [J].
Ahmed, Nesreen K. ;
Neville, Jennifer ;
Kompella, Ramana .
ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2014, 8 (02)
[3]  
Arentze Theo, 2000, Citeseer
[4]   Random Sampling from a Search Engine's Index [J].
Bar-Yossef, Ziv ;
Gurevich, Maxim .
JOURNAL OF THE ACM, 2008, 55 (05)
[5]  
Bar-Yossef Ziv., 2000, VLDB '00: Proceedings of the 26th International Conference on Very Large Data Bases, P535
[6]   Self-Interacting Random Walks: Aging, Exploration, and First-Passage Times [J].
Barbier-Chebbah, A. ;
Benichou, O. ;
Voituriez, R. .
PHYSICAL REVIEW X, 2022, 12 (01)
[7]   On the stability of citation networks [J].
Benatti, Alexandre ;
de Arruda, Henrique Ferraz ;
Silva, Filipi Nascimento ;
Comin, Cesar Henrique ;
Costa, Luciano da Fontoura .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 610
[8]   CORRELATED RANDOM-WALKS [J].
BENDER, EA ;
RICHMOND, LB .
ANNALS OF PROBABILITY, 1984, 12 (01) :274-278
[9]  
Bianchini M., 2005, ACM Transactions on Internet Technology, V5, P92, DOI 10.1145/1052934.1052938
[10]   Using biological networks to integrate, visualize and analyze genomics data [J].
Charitou, Theodosia ;
Bryan, Kenneth ;
Lynn, David J. .
GENETICS SELECTION EVOLUTION, 2016, 48