Patterns for Indexing Large Datasets

被引:0
作者
Gaur, Garima [1 ]
Kalra, Sumit [1 ]
Bhattacharya, Arnab [1 ]
机构
[1] Indian Inst Technol Kanpur, Dept Comp Sci & Engn, Kanpur, Uttar Pradesh, India
来源
EUROPLOP 2018: PROCEEDINGS OF THE 23RD EUROPEAN CONFERENCE ON PATTERN LANGUAGES OF PROGRAMS | 2018年
关键词
Indexing; Hierarchical structure; High-dimensional datasets;
D O I
10.1145/3282308.3282314
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Searching is one of the fundamental tasks in Computer Science. An intuitive way to search is to do it linearly, that is, start at the beginning of the dataset and continue till the searched-for item is found or nothing is found. However, as the volume of data increases, the response time of linear search is no longer acceptable. Indexes are designed to search through massive datasets quickly. There are a number of different ways of building complex and advanced indexes. Appropriate selection and modification of indexing structures according to dynamic business requirements is crucial for data-intensive applications. In this work, we present a few basic reusable indexing structures. These structures can be used to create advanced and complex indexing structures with lesser effort and time.
引用
收藏
页数:6
相关论文
共 10 条
  • [1] [Anonymous], 2014, FUNDAMENTALS DATABAS
  • [2] Bayer R., 2002, Organization and maintenance of large ordered indexes
  • [3] Berchtold Stefan, P 1998 ACM SIGMOD IN
  • [4] Berchtold Stefan, P 22 INT C VER LARG
  • [5] Golub G.H., 1971, SINGULAR VALUE DECOM, DOI [10.1007/978-3-642-86940-2_10, DOI 10.1007/978-3-642-86940-2_10]
  • [6] GUTTMAN A, P 1984 ACM SIGMOD IN
  • [7] iDistance:: An adaptive B+-tree based indexing method for nearest neighbor search
    Jagadish, HV
    Ooi, BC
    Tan, KL
    Yu, C
    Zhang, R
    [J]. ACM TRANSACTIONS ON DATABASE SYSTEMS, 2005, 30 (02): : 364 - 397
  • [8] Jolliffe I., 2002, PRINCIPAL COMPONENT
  • [9] Ooi Beng Chin, 2000, INDEXING EDGES MDASH
  • [10] Tenenbaum J., 2000, GLOBAL GEOMETRIC FRA