Evaluation of Parallel Indexing Scheme for Big Data

被引:1
作者
Funaki, Kenta [1 ]
Hochin, Teruhisa [1 ]
Nomiya, Hiroki [1 ]
Nakanishi, Hideya [2 ]
机构
[1] Kyoto Inst Technol, Dept Informat Sci, Kyoto, Japan
[2] Natl Inst Fus Sci, Gifu, Japan
来源
3RD INTERNATIONAL CONFERENCE ON APPLIED COMPUTING AND INFORMATION TECHNOLOGY (ACIT 2015) 2ND INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND INTELLIGENCE (CSI 2015) | 2015年
关键词
Multi-dimensional index; Parallel processing; Big data; Insertion performance;
D O I
10.1109/ACIT-CSI.2015.37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper evaluates the parallel indexing scheme proposed for efficient processing of big data. In the proposed scheme, three kinds of computing nodes are introduced. These are reception-nodes, representative-nodes, and normal-nodes. A reception-node receives data for insertion. A representative-node receives queries. Normal-nodes retrieve data from indexes. Three kinds of indexes are also introduced. These are a whole-index, a partial-index, and a reception-index. In a partial-index, data are stored. In a whole-index, partial-indexes are stored as its data. In a reception-index, additional data are stored. The reception index is moved to a normal-node, and becomes a partial-index. The proposed scheme is also a data distribution scheme for shortening the insertion time. It is experimentally clarified that data can be inserted into nodes with little time overhead. It is also clarified that retrieval time decreases according to the number of normal-nodes. It is shown that the overlap distribution scheme is considered to be better than the area expansion and the proximity ones.
引用
收藏
页码:148 / 153
页数:6
相关论文
共 11 条
  • [11] Zhou Y., 2008, INT C EARTH OBS DAT