Sequential Random Sampling Revisited: Hidden Shuffle Method

被引：0

作者：

Shekelyan, Michael ^{[1
]}

Cormode, Graham ^{[1
]}

机构：

[1] Univ Warwick, Coventry, W Midlands, England

来源：

24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS) | 2021年 / 130卷

基金：

欧洲研究理事会;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Random sampling (without replacement) is ubiquitously employed to obtain a representative subset of the data. Unlike common methods, sequential methods report samples in ascending order of index without keeping track of previous samples. This enables lightweight iterators that can jump directly from one sampled position to the next. Previously, sequential methods focused on drawing from the distribution of gap sizes, which requires intricate algorithms that are difficult to validate and can be slow in the worst-case. This can be avoided by a new method, the Hidden Shuffle. The name mirrors the fact that although the algorithm does not resemble shuffliing, its correctness can be proven by conceptualising the sampling process as a random shuffle. The Hidden Shuffle algorithm stores just a handful of values, can be implemented in few lines of code, offers strong worst-case guarantees and is shown to be faster than state-of-the-art methods while using comparably few random variates.

引用

页数：9

共 50 条

[31] A list sequential sampling method suitable for real-time sampling
Bondesson, Lennart
Thorburn, Daniel
SCANDINAVIAN JOURNAL OF STATISTICS, 2008, 35 (03) : 466 - 483
[32] Sequential Monte Carlo sampling in hidden Markov models of nonlinear dynamical systems
Zeng, X.
Anitescu, M.
APPLIED MATHEMATICS AND COMPUTATION, 2014, 233 : 507 - 521
[33] A Sequential Importance Sampling Algorithm for Generating Random Graphs with Prescribed Degrees
Blitzstein, Joseph
Diaconis, Persi
INTERNET MATHEMATICS, 2011, 6 (04) : 489 - 522
[34] Optimal Sequential Sampling Policy of Partitioned Random Search and Its Approximation
Z. B. Tang
Journal of Optimization Theory and Applications, 1998, 98 : 431 - 448
[35] Optimal sequential sampling policy of partitioned random search and its approximation
Tang, ZB
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1998, 98 (02) : 431 - 448
[36] Shuffle from sequential to parallel in production planning
Department of Computer Science, Hermman Oberth Faculty of Engineering, Lucian Blaga University of Sibiu, 4 Emil Cioran Street, 550025 Sibiu, Romania
WSEAS Trans. Comput. Res., 2008, 1 (1-8):
[37] Search for overall optimization with the method of random sampling
Li, Lin
Yuan, Xucang
Guangxue Jishu/Optical Technique, 1994, (01): : 2 - 5
[38] A CALCULATOR-ASSISTED METHOD OF RANDOM SAMPLING
SCHOEN, DJ
FRUCHTER, D
ECOLOGY, 1983, 64 (01) : 205 - 206
[39] A METHOD FOR RANDOM SELECTION OF ENVIRONMENTAL SAMPLING POINTS
TOOHEY, RE
STEBBINGS, JH
BROWN, W
HEALTH PHYSICS, 1987, 52 : S51 - S51
[40] A simple method for sampling random Clifford operators
Van den Berg, Ewout
2021 IEEE INTERNATIONAL CONFERENCE ON QUANTUM COMPUTING AND ENGINEERING (QCE 2021) / QUANTUM WEEK 2021, 2021, : 54 - 59

← 1 2 3 4 5 →