Are bitvectors optimal?

被引:45
作者
Buhrman, H
Miltersen, PB
Radhakrishnan, J
Venkatesh, S
机构
[1] CWI, NL-1090 GB Amsterdam, Netherlands
[2] Aarhus Univ, BRICS, DK-8000 Aarhus C, Denmark
[3] Aarhus Univ, Dept Comp Sci, DK-8000 Aarhus C, Denmark
[4] Tata Inst Fundamental Res, Sch Technol & Comp Sci, Mumbai 400005, India
[5] Inst Adv Study, Princeton, NJ 08540 USA
关键词
data structures; set membership problem; bit probe model; set systems;
D O I
10.1137/S0097539702405292
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the static membership problem : Given a set S of at most n keys drawn from a universe U of size m, store it so that queries of the form Is u in S?" can be answered by making few accesses to the memory. We study schemes for this problem that use space close to the information theoretic lower bound of ( n log( m n)) bits and yet answer queries by reading a small number of bits of the memory. We show that, for epsilon > 0, there is a scheme that stores O (n/epsilon2 log m) bits and answers membership queries using a randomized algorithm that reads just one bit of memory and errs with probability at most. We consider schemes that make no error for queries in S but are allowed to err with probability at most for queries not in S. We show that there exist such schemes that store O ((n/epsilon)(2) log m) bits and answer queries using just one bitprobe. If multiple probes are allowed, then the number of bits stored can be reduced to O(n(1+delta) log m) for any delta > 0. The schemes mentioned above are based on probabilistic constructions of set systems with small intersections. We show lower bounds that come close to our upper bounds (for a large range of n and epsilon) : Schemes that answer queries with just one bitprobe and error probability epsilon must use Omega(n(2)/epsilon(2) log(n/epsilon) log m) bits of storage; if the error is restricted to queries not in S, then the scheme must use Omega(n(2)/epsilon(2) log(n/epsilon) log m) bits of storage. We also consider deterministic schemes for the static membership problem and show tradeoffs between space and the number of probes.
引用
收藏
页码:1723 / 1744
页数:22
相关论文
共 37 条
  • [1] ALON N, 1992, PROBABILISTIC METHOD
  • [2] ARVIND V, 1997, LECT NOTES COMPUTER, V1346, P235
  • [3] Babai Laszlo, 1991, P 23 ANN ACM S THEOR, P21, DOI [10.1145/103418.103428, DOI 10.1145/103418.103428]
  • [4] Bollobas B, 1985, RANDOM GRAPHS
  • [5] Membership in constant time and almost-minimum space
    Brodnik, A
    Munro, JI
    [J]. SIAM JOURNAL ON COMPUTING, 1999, 28 (05) : 1627 - 1640
  • [6] BRODNIK A, 1994, LECT NOTES COMPUT SC, V855, P72
  • [7] Chaudhuri S., 1996, Proceedings of the Twenty-Eighth Annual ACM Symposium on the Theory of Computing, P30, DOI 10.1145/237814.237824
  • [8] Dyachkov Arkadii Georgievich, 1982, Problemy Peredachi Informatsii, V18, P7
  • [9] COMPLEXITY OF SOME SIMPLE RETRIEVAL PROBLEMS
    ELIAS, P
    FLOWER, RA
    [J]. JOURNAL OF THE ACM, 1975, 22 (03) : 367 - 379
  • [10] FAMILIES OF FINITE SETS IN WHICH NO SET IS COVERED BY THE UNION OF R OTHERS
    ERDOS, P
    FRANKL, P
    FUREDI, Z
    [J]. ISRAEL JOURNAL OF MATHEMATICS, 1985, 51 (1-2) : 79 - 89