Dynamically Adjusting Core Frequencies to Accelerate Time Warp Simulations in Many-Core Processors

被引：6

作者：

Kunz, Georg ^{[1
]}

Schemmel, Daniel ^{[2
]}

Gross, James ^{[2
]}

Wehrle, Klaus ^{[1
]}

机构：

[1] Rhein Westfal TH Aachen, Aachen, Germany

[2] Rhein Westfal TH Aachen, Mobile Network Performance Grp, Aachen, Germany

来源：

2012 ACM/IEEE/SCS 26TH WORKSHOP ON PRINCIPLES OF ADVANCED AND DISTRIBUTED SIMULATION (PADS) | 2012年

关键词：

parallel simulation; time warp synchronization; many-core processors; run time tuning;

D O I：

10.1109/PADS.2012.15

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Developing complex technical systems requires a systematic exploration of the given design space in order to identify optimal system configurations. However, studying the effects and interactions of even a small number of system parameters often requires an extensive number of simulation runs. This in turn results in excessive runtime demands which severely hamper thorough design space explorations. In this paper, we present a parallel discrete event simulation scheme that enables cost-and time-efficient execution of large scale parameter studies on GPUs. In order to efficiently accommodate the stream-processing paradigm of GPUs, our parallelization scheme exploits two orthogonal levels of parallelism: External parallelism among the inherently independent simulations of a parameter study and internal parallelism among independent events within each individual simulation of a parameter study. Specifically, we design an event aggregation strategy based on external parallelism that generates workloads suitable for GPUs. In addition, we define a pipelined event execution mechanism based on internal parallelism to hide the transfer latencies between host-and GPU-memory. We analyze the performance characteristics of our parallelization scheme by means of a prototype implementation and show a 25-fold performance improvement over purely CPU-based execution.

引用

页码：23 / 32

页数：10

共 26 条

[1]

[Anonymous], 2006, P 2006 WORKSH NS 3 P, DOI DOI 10.1145/1190455.1190468

[2]

[Anonymous], NVIDIAS NEXT GEN CUD

[3]

Bauer D. W., 2008, P 40 WINT SIM C

[4]

Chatterjee D., 2009, P 46 ACM IEEE DES AU

[5]

FUJIMOTO RM, 1990, COMMUNICATIONS ACM, V33

[6]

FUJIMOTO RM, 1999, P 13 WORKSH PAR DIST

[7]

Han S., 2010, P ACM SIGCOMM C

[8] Cloning: A novel method for interactive parallel simulation [J].

Hybinette, M ;

Fujimoto, R .

PROCEEDINGS OF THE 1997 WINTER SIMULATION CONFERENCE, 1997, :444-451

[9]

Hybinette M., 2001, ACM Transactions on Modeling and Computer Simulation, V11, P378, DOI DOI 10.1145/508366.508370

[10]

Jain R., 1991, ART COMPUTER SYSTEMS, V182

← 1 2 3 →