Stackless KD-Tree traversal for high performance GPU ray tracing

被引:115
作者
Popov, Stefan [1 ]
Guenther, Johannes
Seidel, Hans-Peter
Slusallek, Philipp
机构
[1] Univ Saarland, D-6600 Saarbrucken, Germany
[2] MPI Informat, Saarbrucken, Germany
关键词
D O I
10.1111/j.1467-8659.2007.01064.x
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Significant advances have been achieved for realtime ray tracing recently, but realtime performance for complex scenes still requires large computational resources not yet available from the CPUs in standard PCs. Incidentally, most of these PCs also contain modern GPUs that do offer much larger raw compute power However limitations in the programming and memory model have so far kept the performance of GPU ray tracers well below that of their CPU counterparts. In this paper we present a novel packet ray traversal implementation that completely eliminates the need for maintaining a stack during kd-tree traversal and that reduces the number of traversal steps per ray. While CPUs benefit moderately from the stackless approach, it improves GPU performance significantly. We achieve a peak performance of over 16 million rays per second,for reasonably complex scenes, including complex shading and secondary rays. Several examples show that with this new technique GPUs can actually outperform equivalent CPU based ray tracers.
引用
收藏
页码:415 / 424
页数:10
相关论文
共 34 条
[11]  
Friedrich H., 2006, P SANDB S 2006 ACM S, V1, P41
[12]  
Havran V., 1997, Journal of Graphics Tools, V2, P15, DOI 10.1080/10867651.1997.10487481
[13]  
Horn DR, 2007, I3D 2007: ACM SIGGRAPH SYMPOSIUM ON INTERACTIVE 3D GRAPHICS AND GAMES, PROCEEDINGS, P167
[14]  
HOUSTON M, SUPERCOMPUTING 2006
[15]  
KAPLAN MR, 1985, COMPUTER GRAPHICS, P149
[16]  
KARLSSON F, 2004, THESIS CHALMERS U TE
[17]  
MacDonald J. D., 1989, Proceedings. Graphics Interface'89, P152
[18]  
*NVIDIA, CUDA HOM
[19]  
Pabst HF, 2006, RT 06: IEEE SYMPOSIUM ON INTERACTIVE RAY TRACING 2006, PROCEEDINGS, P151
[20]  
Purcell T. J., 2004, THESIS STANFORD U