Function Interpolation for Learned Index Structures

被引:4
作者
Setiawan, Naufal Fikri [1 ]
Rubinstein, Benjamin I. P. [1 ]
Borovica-Gajic, Renata [1 ]
机构
[1] Univ Melbourne, Sch Comp & Informat Syst, Melbourne, Vic, Australia
来源
DATABASES THEORY AND APPLICATIONS, ADC 2020 | 2020年 / 12008卷
关键词
Indexing; Databases; Function approximation; TREE;
D O I
10.1007/978-3-030-39469-1_6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Range indexes such as B-trees are widely recognised as effective data structures for enabling fast retrieval of records by the query key. While such classical indexes offer optimal worst-case guarantees, recent research suggests that average-case performance might be improved by alternative machine learning-based models such as deep neural networks. This paper explores an alternative approach by modelling the task as one of function approximation via interpolation between compressed subsets of keys. We explore the Chebyshev and Bernstein polynomial bases, and demonstrate substantial benefits over deep neural networks. In particular, our proposed function interpolation models exhibit memory footprint two orders of magnitude smaller compared to neural network models, and 30-40% accuracy improvement over neural networks trained with the same amount of time, while keeping query time generally on-par with neural network models.
引用
收藏
页码:68 / 80
页数:13
相关论文
共 24 条
  • [1] Aldà F, 2017, AAAI CONF ARTIF INTE, P1705
  • [2] [Anonymous], 2007, CIDR, DOI DOI 10.1002/PER
  • [3] [Anonymous], 1982, Introduction to Approximation Theory
  • [4] [Anonymous], 1989, SIGMOD Rec, DOI http
  • [5] [Anonymous], BIGLEARN NIPS WORKSH
  • [6] Bayer R., 1971, Organization and maintenance of large ordered indices
  • [7] Boyd JP, 2009, COMMUN COMPUT PHYS, V5, P484
  • [8] Brisebarre Nicolas, 2010, P INT S SYMB ALG COM, P147, DOI [10.1145/1837934.1837966, DOI 10.1145/1837934.1837966]
  • [9] FITing-Tree: A Data-aware Index Structure
    Galakatos, Alex
    Markovitch, Michael
    Binnig, Carsten
    Fonseca, Rodrigo
    Kraska, Tim
    [J]. SIGMOD '19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2019, : 1189 - 1206
  • [10] Gammerman A., 1998, Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference (1998), P148