Development of a GPU-based high-performance radiative transfer model for the Infrared Atmospheric Sounding Interferometer (IASI)

被引:48
|
作者
Huang, Bormin [1 ]
Mielikainen, Jarno [2 ]
Oh, Hyunjong [1 ]
Huang, Hung-Lung Allen [1 ]
机构
[1] Univ Wisconsin, Ctr Space Sci & Engn, Madison, WI 53706 USA
[2] Univ Eastern Finland, Dept Comp Sci, Kuopio, Finland
基金
芬兰科学院;
关键词
GPU; CUDA; Radiative transfer model; Infrared Atmospheric Sounding; Interferometer (IASI); ACCURATE TRANSMITTANCE MODEL; COMPUTATIONALLY FAST; ABSORBING GAS; PARALLEL; GRAPHICS;
D O I
10.1016/j.jcp.2010.09.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Satellite-observed radiance is a nonlinear functional of surface properties and atmospheric temperature and absorbing gas profiles as described by the radiative transfer equation (RTE). In the era of hyperspectral sounders with thousands of high-resolution channels, the computation of the radiative transfer model becomes more time-consuming. The radiative transfer model performance in operational numerical weather prediction systems still limits the number of channels we can use in hyperspectral sounders to only a few hundreds. To take the full advantage of such high-resolution infrared observations, a computationally efficient radiative transfer model is needed to facilitate satellite data assimilation. In recent years the programmable commodity graphics processing unit (GPU) has evolved into a highly parallel, multi-threaded, many-core processor with tremendous computational speed and very high memory bandwidth. The radiative transfer model is very suitable for the GPU implementation to take advantage of the hardware's efficiency and parallelism where radiances of many channels can be calculated in parallel in GPUs. In this paper, we develop a GPU-based high-performance radiative transfer model for the Infrared Atmospheric Sounding Interferometer (IASI) launched in 2006 onboard the first European meteorological polar-orbiting satellites, METOP-A. Each IASI spectrum has 8461 spectral channels. The IASI radiative transfer model consists of three modules. The first module for computing the regression predictors takes less than 0.004% of GPU time, while the second module for transmittance computation and the third module for radiance computation take approximately 92.5% and 7.5%, respectively. Our GPU-based IASI radiative transfer model is developed to run on a low-cost personal supercomputer with four GPUs with total 960 compute cores, delivering near 4 TFlops theoretical peak performance. By massively parallelizing the second and third modules, we reached 364x speedup for 1 GPU and 1455x speedup for all 4 GPUs, both with respect to the original GPU-based single-threaded Fortran code with the -O-2 compiling optimization. The significant 1455x speedup using a computer with four GPUs means that the proposed GPU-based high-performance forward model is able to compute one day's amount of 1,296,000 IASI spectra within nearly 10 min, whereas the original single GPU-based version will impractically take more than 10 days. This model runs over 80% of the theoretical memory bandwidth with asynchronous data transfer. A novel GPU-GPU pipeline implementation of the IASI radiative transfer model is proposed. The GPU-based high-performance IASI radiative transfer model is suitable for the assimilation of the IASI radiance observations into the operational numerical weather forecast model. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:2207 / 2221
页数:15
相关论文
共 8 条
  • [1] GPU-Accelerated Multi-Profile Radiative Transfer Model for the Infrared Atmospheric Sounding Interferometer
    Mielikainen, Jarno
    Huang, Bormin
    Huang, Hung-Lung Allen
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2011, 4 (03) : 691 - 700
  • [2] GFlow: Towards GPU-based High-Performance Table Matching in OpenFlow Switches
    Qiu, Kun
    Chen, Zhe
    Chen, Yang
    Zhao, Jin
    Wang, Xin
    2015 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN), 2015, : 283 - 288
  • [3] GPU-based high-performance computing for integrated surface-sub-surface flow modeling
    Le, Phong V. V.
    Kumar, Praveen
    Valocchi, Albert J.
    Dang, Hoang-Vu
    ENVIRONMENTAL MODELLING & SOFTWARE, 2015, 73 : 1 - 13
  • [4] Task offloading using GPU-based particle swarm optimization for high-performance vehicular edge computing
    Alqarni, Mohamed A.
    Mousa, Mohamed H.
    Hussein, Mohamed K.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 10356 - 10364
  • [5] Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams
    Vivek Kumar
    Dilip Kumar Sharma
    Vinay Kumar Mishra
    The Journal of Supercomputing, 2021, 77 : 6936 - 6960
  • [6] Mille Cheval: a GPU-based in-memory high-performance computing framework for accelerated processing of big-data streams
    Kumar, Vivek
    Sharma, Dilip Kumar
    Mishra, Vinay Kumar
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (07): : 6936 - 6960
  • [7] G-SM3: High-Performance Implementation of GPU-based SM3 Hash Function
    Dong, Jiankuo
    Lu, Sheng
    Zhang, Pinchang
    Zheng, Fangyu
    Xiao, Fu
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, ICPADS, 2022, : 201 - 208
  • [8] GPU-parallel performance of the community radiative transfer model (CRTM) with the optical depth in absorber space (ODAS)-based transmittance algorithm
    Mielikainen, Jarno
    Huang, Bormin
    Huang, Hung-Lung Allen
    Lee, Tsengdar
    HIGH-PERFORMANCE COMPUTING IN REMOTE SENSING II, 2012, 8539