Using Multiple GPUs to Accelerate MTF Compensation and Georectification of High-Resolution Optical Satellite Images

被引:3
作者
Wang, Mi [1 ,2 ]
Fang, Liuyang [1 ,3 ]
Li, Deren [1 ,2 ]
Pan, Jun [1 ,2 ]
机构
[1] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430079, Peoples R China
[2] Wuhan Univ, Collaborat Innovat Ctr Geospatial Technol, Wuhan 430079, Peoples R China
[3] Broadvis Engn Consultants, Natl Engn Lab Surface Transportat Weather Impacts, Kunming 650011, Peoples R China
基金
中国国家自然科学基金; 欧洲研究理事会; 国家高技术研究发展计划(863计划);
关键词
Basic implementation; cooperative processing (CP); correctness; georectification (GR); modulation transfer function compensation (MTFC); multiple graphic processing units (multi-GPUs); optimization independent processing (IP); SEGMENTATION;
D O I
10.1109/JSTARS.2015.2477460
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The rapid growth in the volume of data collected by modern high-resolution optical satellites puts pressure on near real-time processing. In this paper, we present our recent work on the acceleration of modulation transfer function compensation (MTFC) and georectification (GR), two of the most time-consuming optical satellite image processing algorithms, using multiple graphic processing units (multi-GPUs). A tailored strip consisting of 10 ZY-3 nadir images and covering most of the disaster area caused by Typhoon Fitow is used for the experiment (ZY-3 is the first high-accuracy civilian stereo-mapping optical satellite of China). Rapid profiling of the algorithms reveals that compensation and rectification take virtually over 99.50% of the total run times of MTFC and GR. To shorten the time, we port these two operations to a multi-GPU system that consists of an Intel Core i7 CPU and three Fermi-architecture NVIDIA GTX 580 GPUs. First, kernel arrangement and initial settings are determined in the early stage for basic single-GPU implementation. Second, three optimization measures, i.e., maximizing memory throughput, optimizing flow control instructions, and overlapping data transfer and kernel execution, are taken to further improve performance. The experiments achieved significant speedup ratios of 102.9 and 184.2 for MTFC and GR, respectively. Next, two multi-GPU strategies, i.e., cooperative processing (CP) and independent processing (IP), are proposed. The experimental results show that IP is the best option if the number of images to be processed is a multiple of the number of GPUs; otherwise, CP is the best choice. In addition, both the Intel Core i7 and the NVIDIA GTX 580 fully support the IEEE 754-2008 floating-point precision standard; hence, correctness of our GPU implementation can be fully guaranteed.
引用
收藏
页码:4952 / 4972
页数:21
相关论文
共 41 条
[1]  
[Anonymous], 2012, NVIDIA CUDA C programming guide
[2]  
[Anonymous], 2012, CUDA CUFFT US GUID V
[3]  
[Anonymous], 2012, NVIDIAS WHIT PAP NEX
[4]  
[Anonymous], 2012, CUDA C BEST PRACT GU
[5]  
Binstock A., 2010, THREADING MODELS HIG
[6]  
Cai X. M., 2007, THESIS NANJING U SCI
[7]   Remote Sensing Processing: From Multicore to GPU [J].
Christophe, Emmanuel ;
Michel, Julien ;
Inglada, Jordi .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2011, 4 (03) :643-652
[8]  
Corbley K. P., 2009, SOLID PLAN CONTINUIT
[9]  
Dai C., 2011, P INT S IM DAT FUS, P1, DOI [10.1109/isidf.2011.6024247, DOI 10.1109/ISIDF.2011.6024247]
[10]  
Fang L. Y., 2011, INT S IM DAT FUS TEN