Paraprox: Pattern-Based Approximation for Data Parallel Applications

被引:110
作者
Samadi, Mehrzad [1 ]
Jamshidi, Davoud Anoushe [1 ]
Lee, Janghaeng [1 ]
Mahlke, Scott [1 ]
机构
[1] Univ Michigan, Adv Comp Architecture Lab, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Approximation; Accuracy-aware computing; Data parallel; GPU; Design; Performance;
D O I
10.1145/2541940.2541948
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Approximate computing is an approach where reduced accuracy of results is traded off for increased speed, throughput, or both. Loss of accuracy is not permissible in all computing domains, but there are a growing number of dataintensive domains where the output of programs need not be perfectly correct to provide useful results or even noticeable differences to the end user. These soft domains include multimedia processing, machine learning, and data mining/analysis. An important challenge with approximate computing is transparency to insulate both software and hardware developers from the time, cost, and difficulty of using approximation. This paper proposes a software-only system, Paraprox, for realizing transparent approximation of dataparallel programs that operates on commodity hardware systems. Paraprox starts with a data-parallel kernel implemented using OpenCL or CUDA and creates a parameterized approximate kernel that is tuned at runtime to maximize performance subject to a target output quality (TOQ) that is supplied by the user. Approximate kernels are created by recognizing common computation idioms found in data-parallel programs (e.g., Map, Scatter/Gather, Reduction, Scan, Stencil, and Partition) and substituting approximate implementations in their place. Across a set of 13 soft data-parallel applications with at most 10% quality degradation, Paraprox yields an average performance gain of 2.7x on a NVIDIA GTX 560 GPU and 2.5x on an Intel Core i7 quad-core processor compared to accurate execution on each platform.
引用
收藏
页码:35 / 50
页数:16
相关论文
共 36 条
[31]  
Sartori J, 2012, INT CONFER PARA, P427
[32]   A statistical evaluation of recent full reference image quality assessment algorithms [J].
Sheikh, Hamid Rahim ;
Sabir, Muhammad Farooq ;
Bovik, Alan Conrad .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (11) :3440-3451
[33]  
Temam O, 2012, CONF PROC INT SYMP C, P356, DOI 10.1109/ISCA.2012.6237031
[34]  
Venkataramani Swagath, 2013, 2013 46th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). Proceedings, P1, DOI 10.1145/2540708.2540710
[35]  
Wong Henry, 2010, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS 2010), P235, DOI 10.1109/ISPASS.2010.5452013
[36]  
Xiao-Long Wu, 2010, Proceedings of the 2010 IEEE 10th International Conference on Computer and Information Technology (CIT 2010), P1175, DOI 10.1109/CIT.2010.213