Evaluating Lossy Compression on Climate Data

被引:0
作者
Huebbe, Nathanael [1 ]
Wegener, Al [2 ]
Kunkel, Julian Martin [1 ]
Ling, Yi [2 ]
Ludwig, Thomas [3 ]
机构
[1] Univ Hamburg, Hamburg, Germany
[2] Samplify, Santa Clara, CA USA
[3] German Climate Comp Ctr, Hamburg, Germany
来源
SUPERCOMPUTING (ISC 2013) | 2013年 / 7905卷
关键词
Data Compression; GRIB2; JPEG; 2000; APAX;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
While the amount of data used by today's high-performance computing (HPC) codes is huge, HPC users have not broadly adopted data compression techniques, apparently because of a fear that compression will either unacceptably degrade data quality or that compression will be too slow to be worth the effort. In this paper, we examine the effects of three lossy compression methods (GRIB2 encoding, GRIB2 using JPEG 2000 and LZMA, and the commercial Samplify APAX algorithm) on decompressed data quality, compression ratio, and processing time. A careful evaluation of selected lossy and lossless compression methods is conducted, assessing their influence on data quality, storage requirements and performance. The differences between input and decoded datasets are described and compared for the GRIB2 and APAX compression methods. Performance is measured using the compressed file sizes and the time spent on compression and decompression. Test data consists both of 9 synthetic data exposing compression behavior and 123 climate variables output from a climate model. The benefits of lossy compression for HPC systems are described and are related to our findings on data quality.
引用
收藏
页码:343 / 356
页数:14
相关论文
共 11 条
  • [1] The JPEG2000 still image coding system: An overview
    Christopoulos, C
    Skodras, A
    Ebrahimi, T
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2000, 46 (04) : 1103 - 1127
  • [2] Dey C., 2007, BINARY FORM FM 92 GR
  • [3] ECMA, 2001, 321 ECMA
  • [4] Reducing the HPC-datastorage footprint with MAFISC-Multidimensional Adaptive Filtering Improved Scientific data Compression
    Huebbe, Nathanael
    Kunkel, Julian
    [J]. COMPUTER SCIENCE-RESEARCH AND DEVELOPMENT, 2013, 28 (2-3): : 231 - 239
  • [5] Iverson J, 2012, LECT NOTES COMPUT SC, V7484, P843, DOI 10.1007/978-3-642-32820-6_83
  • [6] Lakshminarasimhan S., 2012, CONCURRENCY COMPUTAT
  • [7] Lakshminarasimhan S, 2011, LECT NOTES COMPUT SC, V6852, P366, DOI 10.1007/978-3-642-23400-2_34
  • [8] Fast and efficient compression of floating-point data
    Lindstrom, Peter
    Isenburg, Martin
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (05) : 1245 - 1250
  • [9] Sullivan S., 2012, TECH REP
  • [10] Wegener A., 2006, US patent, Patent No. [7,009,533, 7009533]