Accelerating the RTTOV-7 IASI and AMSU-A radiative transfer models on graphics processing units: evaluating central processing unit/graphics processing unit-hybrid and pure-graphics processing unit approaches

被引:6
|
作者
Mielikainen, Jarno [1 ]
Huang, Bormin [1 ]
Huang, Hung-Lung Allen [1 ]
Saunders, Roger [2 ]
机构
[1] Univ Wisconsin, Space Sci & Engn Ctr, Cooperat Inst Meteorol Satellite Studies, Madison, WI 53706 USA
[2] Met Off, Exeter EX1 3PB, Devon, England
来源
JOURNAL OF APPLIED REMOTE SENSING | 2011年 / 5卷
关键词
radiative transfer model; RTTOV; IASI; AMSU-A; GPU; CUDA;
D O I
10.1117/1.3658028
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The radiative transfer for television operational vertical sounder (RTTOV) is a widely-used radiative transfer model (RTM) for calculation of radiances for satellite infrared and microwave sensors, including the 8461-channel infrared atmospheric sounding interferometer (IASI) and the 15-band Advanced Microwave Sounding Unit-A (AMSU-A). In the era of hyperspectral sounders with thousands of spectral channels, the computation of the RTM becomes more time-consuming. The RTM performance in operational numerical weather prediction systems still limits the number of used channels in hyperspectral sounders to only a few hundred. To take full advantage of such high-resolution infrared observations, a computationally efficient radiative transfer model is needed to facilitate satellite data assimilation. In this paper, we develop the parallel implementation of the RTTOV-7 IASI and AMSU-A RTMs to run the predictor module on CPUs in pipeline with the transmittance and radiance modules on NVIDIA many-core graphics processing units (GPUs). We show that concurrent execution of RTTOV-7 IASI RTM on CPU and GPU, in addition to asynchronous data transfer from CPU to GPU, allows the GPU accelerated code running on the 240-core NVIDIA Tesla C1060 to reach a speedup of 461x and 1793x for 1- and 4-GPU configurations, respectively. To compute one day's amount of 1,296,000 IASI spectra, the CPU code running on the host AMD Phenom II X4 940 CPU core with 3.0 GHz will take 2.8 days. Thus, GPU acceleration reduced running time to 8.75 and 2.25 min on 1- and 4-GPU configurations, respectively. Speedup for the RTTOV AMSU-A RTM varied from 29x to 75x for 1 and 4 GPUs, respectively. To further boost the speedup of a multispectral RTM, we developed a novel pure-GPU version of the RTTOV AMSU-A RTM where the predictor module also runs on GPUs to achieve a 96% reduction in the host-to-device data transfer. The speedups for the pure-GPU AMSU-A RTM are significantly increased to 56x and 125x for 1- and 4-GPU configurations, respectively. C (C) 2011 Society of Photo-Optical Instrumentation Engineers (SPIE).
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Accelerating the RTTOV-7 Radiative Transfer Model on Graphics Processing Units
    Mielikainen, Jarno
    Huang, Bormin
    Huang, Allen H.
    Saunders, Roger
    SATELLITE DATA COMPRESSION, COMMUNICATIONS, AND PROCESSING VI, 2010, 7810
  • [2] Accelerating density functional calculations with graphics processing unit
    Yasuda, Koji
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2008, 4 (08) : 1230 - 1236
  • [3] Ortho-Rectification of Hyperspectral Camera Data with Central Processing Unit and Graphics Processing Unit
    Esin, Yunus Emre
    Demirel, Berkan
    Ozdil, Omer
    Ozturk, Safak
    2019 9TH INTERNATIONAL CONFERENCE ON RECENT ADVANCES IN SPACE TECHNOLOGIES (RAST), 2019, : 465 - 468
  • [4] Collaborative Parallel Hybrid Metaheuristics on Graphics Processing Unit
    Roberge, Vincent
    Tarbouchi, Mohammed
    Okou, Francis
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2015, 14 (01)
  • [5] Graphics processing unit implementation of Hidden Markov models
    Gultyaeva, T. A.
    Sautin, A. S.
    Uvarov, V. E.
    2014 12TH INTERNATIONAL CONFERENCE ON ACTUAL PROBLEMS OF ELECTRONICS INSTRUMENT ENGINEERING (APEIE), 2014, : 571 - 573
  • [6] Accelerating Molecular Dynamics Simulation Using Graphics Processing Unit
    Myung, Hun Joo
    Sakamaki, Ryuji
    Oh, Kwang Jin
    Narumi, Tetsu
    Yasuoka, Kenji
    Lee, Sik
    BULLETIN OF THE KOREAN CHEMICAL SOCIETY, 2010, 31 (12): : 3639 - 3643
  • [7] Accelerating space variant Gaussian filtering on graphics processing unit
    Dudek, Roman
    Cuenca, Carmelo
    Quintana, Francisca
    COMPUTER AIDED SYSTEMS THEORY- EUROCAST 2007, 2007, 4739 : 984 - +
  • [8] Accelerating Wright-Fisher Forward Simulations on the Graphics Processing Unit
    Lawrie, David S.
    G3-GENES GENOMES GENETICS, 2017, 7 (09): : 3229 - 3236
  • [9] Accelerating atomistic refinement of NMR structures using Graphics Processing Unit
    Jee, Jun-Goo
    JOURNAL OF THE KOREAN MAGNETIC RESONANCE SOCIETY, 2014, 18 (02): : 69 - 73
  • [10] Parallel Accelerating Number Theoretic Transform for Bootstrapping on a Graphics Processing Unit
    Li, Huixian
    Pan, Deng
    Li, Jinglei
    Wang, Hao
    MATHEMATICS, 2024, 12 (03)