A parallel PDE-based numerical algorithm for computing the Optical Flow in hybrid systems

被引:8
作者
Cuomo, Salvatore [1 ]
De Michele, Pasquale [1 ]
Galletti, Ardelio [2 ]
Marcellino, Livia [2 ]
机构
[1] Univ Naples Federico II, Dept Math & Applicat, Str Vicinale Cupa Cintia 21, I-80126 Naples, Italy
[2] Univ Naples Parthenope, Dept Sci & Technol, Ctr Direz Isola C3, I-80143 Naples, Italy
关键词
Optical Flow; Partial differential equations; High performance computing; Graphic processor units; Hybrid architectures;
D O I
10.1016/j.jocs.2017.03.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper, we propose a fine-to-coarse parallelization strategy in order to exploit, in a case study, a parallel hybrid architecture. We consider the Optical Flow numerical problem, modelled by partial differential equations, and implement a parallel multilevel software. Our hybrid software solution is a smart combination between codes on Graphic Processor Units (GPUs) and standard scientific parallel computing libraries on a cluster. Numerical experiments, on real satellite image sequences coming from a large dataset in a big data scenario, together with application profiling, highlight good results in terms of performance for the proposed approach. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:228 / 236
页数:9
相关论文
共 36 条
[1]  
Agullo E., 2014, TECH REP
[2]  
ANDERSON E., 1999, LAPACK USERSGUIDE, V3rd
[3]  
[Anonymous], 2014, APPL MATH SCI, DOI DOI 10.12988/AMS.2014.49686
[4]  
Balay S, 1997, MODERN SOFTWARE TOOLS FOR SCIENTIFIC COMPUTING, P163
[5]   Exploiting product forms solution techniques in multiformalism modeling [J].
Barbierato, Enrico ;
Dei Rossi, Gian-Luca ;
Gribaudo, Marco ;
Iacono, Mauro ;
Marin, Andrea .
ELECTRONIC NOTES IN THEORETICAL COMPUTER SCIENCE, 2013, 296 :61-77
[6]  
Barron J. L., 1992, Proceedings. 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No.92CH3168-2), P236, DOI 10.1109/CVPR.1992.223269
[7]  
Bell N., 2012, CUSP: Generic parallel algorithms for sparse matrix and graph computations
[8]  
Bell N., 2015, THRUST PARALLEL TEMP
[9]  
Blackford L., 1997, ScaLAPACK Users Guide
[10]   A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems [J].
Braun, TD ;
Siegel, HJ ;
Beck, N ;
Bölöni, LL ;
Maheswaran, M ;
Reuther, AI ;
Robertson, JP ;
Theys, MD ;
Yao, B ;
Hensgen, D ;
Freund, RF .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (06) :810-837