Understanding the Design Trade-offs among Current Multicore Systems for Numerical Computations
被引:0
|
作者:
Kang, Seunghwa
论文数: 0引用数: 0
h-index: 0
机构:
Georgia Inst Technol, Atlanta, GA 30332 USAGeorgia Inst Technol, Atlanta, GA 30332 USA
Kang, Seunghwa
[1
]
Bader, David A.
论文数: 0引用数: 0
h-index: 0
机构:
Georgia Inst Technol, Atlanta, GA 30332 USAGeorgia Inst Technol, Atlanta, GA 30332 USA
Bader, David A.
[1
]
Vuduc, Richard
论文数: 0引用数: 0
h-index: 0
机构:
Georgia Inst Technol, Atlanta, GA 30332 USAGeorgia Inst Technol, Atlanta, GA 30332 USA
Vuduc, Richard
[1
]
机构:
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
来源:
2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL & DISTRIBUTED PROCESSING, VOLS 1-5
|
2009年
关键词:
PERFORMANCE;
GRAPHICS;
D O I:
暂无
中图分类号:
TP301 [理论、方法];
学科分类号:
081202 ;
摘要:
In this paper we empirically evaluate fundamental design trade-offs among the most recent multicore processors and accelerator technologies. Our primary aim is to aid application designers in better mapping their software to the most suitable architecture, with an additional goal of influencing future computing system design. We specifically examine five architectures, based on: the Intel quadcore Harpertown processor the AMD quad-core Barcelona processor, the Sony-Toshiba-IBM Cell Broadband Engine processors (both the first-generation chip and the second-generation PowerXCell 8i), and the NVIDIA Tesla C1060 GPU. We illustrate the software implementation process on each platform for a set of widely-used kernels from computational statistics that are simple to reason about; measure and analyze the performance of each implementation; and discuss the impact of different architectural design choices on each implementation.