Multilevel techniques for compression and reduction of scientific datathe univariate case

被引:50
作者
Ainsworth, Mark [1 ,2 ]
Tugluk, Ozan [1 ,3 ]
Whitney, Ben [1 ]
Klasky, Scott [2 ]
机构
[1] Brown Univ, Div Appl Math, 182 George St, Providence, RI 02912 USA
[2] Oak Ridge Natl Lab, Comp Sci & Math Div, Oak Ridge, TN 37831 USA
[3] Univ Turkish Aeronaut Assoc, Dept Astronaut Engn, TR-06790 Ankara, Turkey
关键词
Data compression; Data reduction; Lossy compression; Multilevel compression; Error-controlled compression;
D O I
10.1007/s00791-018-00303-9
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We present a multilevel technique for the compression and reduction of univariate data and give an optimal complexity algorithm for its implementation. A hierarchical scheme offers the flexibility to produce multiple levels of partial decompression of the data so that each user can work with a reduced representation that requires minimal storage whilst achieving the required level of tolerance. The algorithm is applied to the case of turbulence modelling in which the datasets are traditionally not only extremely large but inherently non-smooth and, as such, rather resistant to compression. We decompress the data for a range of relative errors, carry out the usual analysis procedures for turbulent data, and compare the results of the analysis on the reduced datasets to the results that would be obtained on the full dataset. The results obtained demonstrate the promise of multilevel compression techniques for the reduction of data arising from large scale simulations of complex phenomena such as turbulence modelling.
引用
收藏
页码:65 / 76
页数:12
相关论文
共 30 条
  • [1] COMPRESSION USING LOSSLESS DECIMATION: ANALYSIS AND APPLICATION
    Ainsworth, Mark
    Klasky, Scott
    Whitney, Ben
    [J]. SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2017, 39 (04) : B732 - B757
  • [2] [Anonymous], 2017, J HOPKINS TURBULENCE
  • [3] Parallel Tensor Compression for Large-Scale Scientific Data
    Austin, Woody
    Ballard, Grey
    Kolda, Tamara G.
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 912 - 922
  • [4] THE HIERARCHICAL BASIS MULTIGRID METHOD
    BANK, RE
    DUPONT, TF
    YSERENTANT, H
    [J]. NUMERISCHE MATHEMATIK, 1988, 52 (04) : 427 - 458
  • [5] A BASIC NORM EQUIVALENCE FOR THE THEORY OF MULTILEVEL METHODS
    BORNEMANN, F
    YSERENTANT, H
    [J]. NUMERISCHE MATHEMATIK, 1993, 64 (04) : 455 - 476
  • [6] Burtscher M, 2016, SC '16: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, P264, DOI 10.1109/SC.2016.22
  • [7] Cover T. M., 2006, WILEY SERIES TELECOM
  • [8] MULTILEVEL PRECONDITIONING
    DAHMEN, W
    KUNOTH, A
    [J]. NUMERISCHE MATHEMATIK, 1992, 63 (03) : 315 - 344
  • [9] THE WAVELET TRANSFORM, TIME-FREQUENCY LOCALIZATION AND SIGNAL ANALYSIS
    DAUBECHIES, I
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1990, 36 (05) : 961 - 1005
  • [10] Fast Error-bounded Lossy HPC Data Compression with SZ
    Di, Sheng
    Cappello, Franck
    [J]. 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS 2016), 2016, : 730 - 739