Memory Considerations for Low Energy Ray Tracing

被引:18
作者
Kopta, D. [1 ]
Shkurko, K. [1 ]
Spjut, J. [1 ,2 ,3 ]
Brunvand, E. [1 ]
Davis, A. [1 ]
机构
[1] Univ Utah, Sch Comp, Salt Lake City, UT 84112 USA
[2] NVIDIA, Santa Clara, CA USA
[3] Harvey Mudd Coll, Dept Engn, Claremont, CA 91711 USA
基金
美国国家科学基金会;
关键词
architecture for accelerated graphics computing; hardware; graphics hardware; ray casting; tracing hardware;
D O I
10.1111/cgf.12458
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We propose two hardware mechanisms to decrease energy consumption on massively parallel graphics processors for ray tracing. First, we use a streaming data model and configure part of the L2 cache into a ray stream memory to enable efficient data processing through ray reordering. This increases L1 hit rates and reduces off-chip memory energy substantially through better management of off-chip memory access patterns. To evaluate this model, we augment our architectural simulator with a detailed memory system simulation that includes accurate control, timing and power models for memory controllers and off-chip dynamic random-access memory . These details change the results significantly over previous simulations that used a simpler model of off-chip memory, indicating that this type of memory system simulation is important for realistic simulations that involve external memory. Secondly, we employ reconfigurable special-purpose pipelines that are constructed dynamically under program control. These pipelines use shared execution units that can be configured to support the common compute kernels that are the foundation of the ray tracing algorithm. This reduces the overhead incurred by on-chip memory and register accesses. These two synergistic features yield a ray tracing architecture that reduces energy by optimizing both on-chip and off-chip memory activity when compared to a more traditional approach.
引用
收藏
页码:47 / 59
页数:13
相关论文
共 65 条
  • [51] Shevtsov Maxim, 2007, P GRAPHICON 2007 MOS
  • [52] Silicon Arts Coproration, 2013, RAYCORE SER 1000, V1000
  • [53] Silpa B. V. N., 2010, Proceedings of the 2010 International Symposium on Electronic System Design (ISED 2010), DOI 10.1109/ISED.2010.61
  • [54] Smits B., 1998, Journal of Graphics Tools, V3, P1, DOI 10.1080/10867651.1998.10487488
  • [55] TRaX: A Multicore Hardware Architecture for Real-Time Ray Tracing
    Spjut, Josef
    Kensler, Andrew
    Kopta, Daniel
    Brunvand, Erik
    [J]. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2009, 28 (12) : 1802 - 1815
  • [56] Spjut Joseph, 2012, P 3 WORKSH SOCS HET
  • [57] Steinhurst J., 2005, Proceedings of Graphics Interface, P97
  • [58] Tsakok J.A., 2009, Proc. High Performance Graphics, P151
  • [59] Wald I, 2001, COMPUT GRAPH FORUM, V20, pC153, DOI 10.1111/1467-8659.00508
  • [60] Wald I., 2014, ACM SIGGRAP IN PRESS