Accelerating the RTTOV-7 IASI and AMSU-A radiative transfer models on graphics processing units: evaluating central processing unit/graphics processing unit-hybrid and pure-graphics processing unit approaches

被引:6
|
作者
Mielikainen, Jarno [1 ]
Huang, Bormin [1 ]
Huang, Hung-Lung Allen [1 ]
Saunders, Roger [2 ]
机构
[1] Univ Wisconsin, Space Sci & Engn Ctr, Cooperat Inst Meteorol Satellite Studies, Madison, WI 53706 USA
[2] Met Off, Exeter EX1 3PB, Devon, England
来源
JOURNAL OF APPLIED REMOTE SENSING | 2011年 / 5卷
关键词
radiative transfer model; RTTOV; IASI; AMSU-A; GPU; CUDA;
D O I
10.1117/1.3658028
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The radiative transfer for television operational vertical sounder (RTTOV) is a widely-used radiative transfer model (RTM) for calculation of radiances for satellite infrared and microwave sensors, including the 8461-channel infrared atmospheric sounding interferometer (IASI) and the 15-band Advanced Microwave Sounding Unit-A (AMSU-A). In the era of hyperspectral sounders with thousands of spectral channels, the computation of the RTM becomes more time-consuming. The RTM performance in operational numerical weather prediction systems still limits the number of used channels in hyperspectral sounders to only a few hundred. To take full advantage of such high-resolution infrared observations, a computationally efficient radiative transfer model is needed to facilitate satellite data assimilation. In this paper, we develop the parallel implementation of the RTTOV-7 IASI and AMSU-A RTMs to run the predictor module on CPUs in pipeline with the transmittance and radiance modules on NVIDIA many-core graphics processing units (GPUs). We show that concurrent execution of RTTOV-7 IASI RTM on CPU and GPU, in addition to asynchronous data transfer from CPU to GPU, allows the GPU accelerated code running on the 240-core NVIDIA Tesla C1060 to reach a speedup of 461x and 1793x for 1- and 4-GPU configurations, respectively. To compute one day's amount of 1,296,000 IASI spectra, the CPU code running on the host AMD Phenom II X4 940 CPU core with 3.0 GHz will take 2.8 days. Thus, GPU acceleration reduced running time to 8.75 and 2.25 min on 1- and 4-GPU configurations, respectively. Speedup for the RTTOV AMSU-A RTM varied from 29x to 75x for 1 and 4 GPUs, respectively. To further boost the speedup of a multispectral RTM, we developed a novel pure-GPU version of the RTTOV AMSU-A RTM where the predictor module also runs on GPUs to achieve a 96% reduction in the host-to-device data transfer. The speedups for the pure-GPU AMSU-A RTM are significantly increased to 56x and 125x for 1- and 4-GPU configurations, respectively. C (C) 2011 Society of Photo-Optical Instrumentation Engineers (SPIE).
引用
收藏
页数:14
相关论文
共 50 条
  • [21] Accelerating robust 3D pose estimation utilizing a graphics processing unit
    Gerlach, Adam R.
    Walker, Bruce K.
    INTELLIGENT ROBOTS AND COMPUTER VISION XXVIII: ALGORITHMS AND TECHNIQUES, 2011, 7878
  • [22] Graphics processing unit implementation of lattice Boltzmann models for flowing soft systems
    Bernaschi, Massimo
    Rossi, Ludovico
    Benzi, Roberto
    Sbragaglia, Mauro
    Succi, Sauro
    PHYSICAL REVIEW E, 2009, 80 (06):
  • [23] Autism Disease Detection Using Transfer Learning Techniques: Performance Comparison Between Central Processing Unit vs Graphics Processing Unit Functions for Neural Networks
    Akter, Mst Shapna
    Shahriar, Hossain
    Cuzzocrea, Alfredo
    2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 1084 - 1092
  • [24] Accelerating the formant synthesis of haegeum sounds using a general-purpose graphics processing unit
    Kang, Myeongsu
    Islam, Shohidul
    Islam, Rashedul
    Kim, Jong-Myon
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (23) : 15445 - 15459
  • [25] Accelerating 2d fault diagnosis of an induction motor using a graphics processing unit
    School of Electrical Engineering, University of Ulsan, Korea, Republic of
    不详
    Int. J. Multimedia Ubiquitous Eng., 1 (341-352):
  • [26] Accelerating IP routing algorithm using graphics processing unit for high speed multimedia communication
    Uddin, Jia
    Jeong, In-Kyu
    Kang, Myeongsu
    Kim, Cheol-Hong
    Kim, Jong-Myon
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (23) : 15365 - 15379
  • [27] Accelerating IP routing algorithm using graphics processing unit for high speed multimedia communication
    Jia Uddin
    In-Kyu Jeong
    Myeongsu Kang
    Cheol-Hong Kim
    Jong-Myon Kim
    Multimedia Tools and Applications, 2016, 75 : 15365 - 15379
  • [28] Accelerating the formant synthesis of haegeum sounds using a general-purpose graphics processing unit
    Myeongsu Kang
    Shohidul Islam
    Rashedul Islam
    Jong-Myon Kim
    Multimedia Tools and Applications, 2016, 75 : 15445 - 15459
  • [29] Accelerating Conjugate Heat Transfer Simulations in Squared Heated Cavities through Graphics Processing Unit (GPU) Computing
    Reis, Cesar Augusto Borges da Silva
    Botezelli, Daniel
    de Azevedo, Arthur Mendonca
    Magalhaes, Elisan dos Santos
    da Silveira Neto, Aristeu
    COMPUTATION, 2024, 12 (05)
  • [30] Parallel and distributed computing models on a graphics processing unit to accelerate simulation of membrane systems
    Maroosi, Ali
    Muniyandi, Ravie Chandren
    Sundararajan, Elankovan
    Zin, Abdullah Mohd
    SIMULATION MODELLING PRACTICE AND THEORY, 2014, 47 : 60 - 78