Efficient retrieval of multidimensional datasets through parallel I/O

被引:4
作者
Prabhakar, S [1 ]
Abdel-Ghaffar, K [1 ]
Agrawal, D [1 ]
El Abbadi, A [1 ]
机构
[1] Purdue Univ, W Lafayette, IN 47907 USA
来源
FIFTH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, PROCEEDINGS | 1998年
关键词
D O I
10.1109/HIPC.1998.738011
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Many scientific and engineering applications process large multidimensional datasets. An important access pattern for these applications is the retrieval of data corresponding to ranges of values in multiple dimensions. Performance is limited by disks largely due to high disk latencies. Tiling and distributing the data across multiple disks is an effective technique for improving performance through parallel I/O. The distribution of tiles across the disks is an important factor in achieving gains. Several schemes for declustering multidimensional data to improve the performance of range queries have been proposed in the literature. We extend the class of Cyclic schemes which have been developed earlier for two-dimensional data to multi pie dimensions. We establish important properties of Cyclic schemes, based upon which we reduce the search space for determining good declustering schemes within the class of Cyclic schemes. Through experimental evaluation, we establish that the Cyclic schemes are superior to other declustering schemes, including the state-of-the-art, both in terms of the degree of parallelism and robustness.
引用
收藏
页码:375 / 382
页数:8
相关论文
共 50 条
[31]   Efficient Multidimensional Harmonic Retrieval: A Hierarchical Signal Separation Framework [J].
Lin, Chun-Hung ;
Fang, Wen-Hsien .
IEEE SIGNAL PROCESSING LETTERS, 2013, 20 (05) :427-430
[32]   Parallel File System Analysis Through Application I/O Tracing [J].
Wright, S. A. ;
Hammond, S. D. ;
Pennycook, S. J. ;
Bird, R. F. ;
Herdman, J. A. ;
Miller, I. ;
Vadgama, A. ;
Bhalerao, A. ;
Jarvis, S. A. .
COMPUTER JOURNAL, 2013, 56 (02) :141-155
[33]   An optimization of Apriori algorithm through the usage of parallel I/O and hints [J].
Pérez, MS ;
Pons, RA ;
García, F ;
Carretero, J ;
Córdoba, ML .
ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2002, 2475 :449-452
[34]   Improvements to Efficient Retrieval of Very Large Temporal Datasets with the TravelLight Method [J].
de Carvalho, Alexandre Valle ;
Oliveira, Marco Amaro ;
Rocha, Artur .
PROCEEDINGS OF THE 2014 9TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI 2014), 2014,
[35]   Approximate Spreading Activation for Efficient Knowledge Retrieval from Large Datasets [J].
Grinberg, Maurice ;
Haltakov, Vladimir ;
Stefanov, Hristo .
NEURAL NETS WIRN10, 2011, 226 :326-333
[36]   An Efficient Data Retrieval Parallel Reeb Graph Algorithm [J].
Hajij, Mustafa ;
Rosen, Paul .
ALGORITHMS, 2020, 13 (10)
[37]   Efficient information retrieval by dispatching mobile agents in parallel [J].
Qu, Wenyu ;
Li, Keqiu ;
Zhang, Chao .
MUE: 2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND UBIQUITOUS ENGINEERING, PROCEEDINGS, 2008, :73-+
[38]   Efficient Memory Partitioning for Parallel Data Access in Multidimensional Arrays [J].
Meng, Chenyue ;
Yin, Shouyi ;
Ouyang, Peng ;
Liu, Leibo ;
Wei, Shaojun .
2015 52ND ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2015,
[39]   Parallel I/O: An introduction [J].
Womble, DE ;
Greenberg, DS .
PARALLEL COMPUTING, 1997, 23 (4-5) :403-417
[40]   IOSPReD: I/O Specialized Packaging of Reduced Datasets and Data-Intensive Applications for Efficient Reproducibility [J].
Niddodi, Chaitra ;
Gehani, Ashish ;
Malik, Tanu ;
Mohan, Sibin ;
Rilee, Michael Lee .
IEEE ACCESS, 2023, 11 (1718-1731) :1718-1731