Scheduling in Heterogeneous Computing Environments for Proximity Queries

被引:5
作者
Kim, Duksu [1 ]
Lee, Jinkyu [2 ]
Lee, Junghwan [1 ]
Shin, Insik [1 ]
Kim, John [1 ]
Yoon, Sung-Eui [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Comp Sci, Taejon 305701, Chungcheongnam, South Korea
[2] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
关键词
Heterogeneous system; proximity query; scheduling; collision detection; ray tracing; motion planning; GPU; APPROXIMATION; ALGORITHMS; STANDARD;
D O I
10.1109/TVCG.2013.71
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a novel, linear programming (LP)-based scheduling algorithm that exploits heterogeneous multicore architectures such as CPUs and GPUs to accelerate a wide variety of proximity queries. To represent complicated performance relationships between heterogeneous architectures and different computations of proximity queries, we propose a simple, yet accurate model that measures the expected running time of these computations. Based on this model, we formulate an optimization problem that minimizes the largest time spent on computing resources, and propose a novel, iterative LP-based scheduling algorithm. Since our method is general, we are able to apply our method into various proximity queries used in five different applications that have different characteristics. Our method achieves an order of magnitude performance improvement by using four different GPUs and two hexa-core CPUs over using a hexa-core CPU only. Unlike prior scheduling methods, our method continually improves the performance, as we add more computing resources. Also, our method achieves much higher performance improvement compared with prior methods as heterogeneity of computing resources is increased. Moreover, for one of tested applications, our method achieves even higher performance than a prior parallel method optimized manually for the application. We also show that our method provides results that are close (e.g., 75 percent) to the performance provided by a conservative upper bound of the ideal throughput. These results demonstrate the efficiency and robustness of our algorithm that have not been achieved by prior methods. In addition, we integrate one of our contributions with a work stealing method. Our version of the work stealing method achieves 18 percent performance improvement on average over the original work stealing method. This result shows wide applicability of our approach.
引用
收藏
页码:1513 / 1525
页数:13
相关论文
共 39 条
[1]   Linear Programming-Based Affinity Scheduling of Independent Tasks on Heterogeneous Computing Systems [J].
Al-Azzoni, Issam ;
Down, Douglas G. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2008, 19 (12) :1671-1682
[2]  
[Anonymous], 2010, P IPDPS
[3]  
[Anonymous], 2007, P 18 EUROGRAPHICS C
[4]  
[Anonymous], 2006, Planning algorithms
[5]  
[Anonymous], 1998, COMBINATORIAL OPTIMI
[6]  
[Anonymous], 2012, Scheduling
[7]  
[Anonymous], 2013, OMPSS PROGRAMMING MO
[8]   StarPU: a unified platform for task scheduling on heterogeneous multicore architectures [J].
Augonnet, Cedric ;
Thibault, Samuel ;
Namyst, Raymond ;
Wacrenier, Pierre-Andre .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2011, 23 (02) :187-198
[9]   Scheduling multithreaded computations by work stealing [J].
Blumofe, RD ;
Leiserson, CE .
JOURNAL OF THE ACM, 1999, 46 (05) :720-748
[10]  
BLUMOFE RD, 1995, SIGPLAN NOTICES, V30, P207