Creating and preserving locality of Java']Java applications at allocation and garbage collection times

被引:23
作者
Shuf, Y
Gupta, M
Franke, H
Appel, A
Singh, JP
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
[2] Princeton Univ, Dept Comp Sci, Princeton, NJ 08544 USA
关键词
languages; algorithms; performance; experimentation; !text type='Java']Java[!/text; JVM; locality; memory management; run-time systems; memory allocation; garbage collection; object co-allocation; object placement; heap traversal; locality based graph traversal; prolific types;
D O I
10.1145/583854.582422
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The growing gap between processor and memory speeds is motivating the need for optimization strategies that improve data locality. A major challenge is to devise techniques suitable for pointer-intensive applications. This paper presents two techniques aimed at improving the memory behavior of pointer-intensive applications with dynamic memory allocation, such as those written in Java. First, we present an allocation time object placement technique based on the recently introduced notion of prolific (frequently instantiated) types. We attempt to co-locate, at allocation time, objects of prolific types that are connected via object references. Then, we present a novel locality based graph traversal technique. The benefits of this technique, when applied to garbage collection (GC), are twofold: (i) it improves the performance of GC due to better locality during a heap traversal and (ii) it restructures surviving objects in a way that enhances locality. On multiprocessors, this technique can further reduce overhead due to synchronization and false sharing. The experimental results, on a well-known suite of Java benchmarks (SPECjvm98 [26], SPECjbb2000 [27], and jOlden [4]), from an implementation of these techniques in the Jikes RVM [1], are very encouraging. The object co-allocation technique improves application performance by up to 21% (10% on average) in the Jikes RVM configured with a non-copying mark-and-sweep collector. The locality-based traversal technique reduces GC times by up to 20% (10% on average) and improves the performance of applications by up to 14% (6% on average) in the Jikes RVM configured with a copying semi-space collector. Both techniques combined can improve application performance by up to 22% (10% on average) in the Jikes RVM configured with a non-copying mark-and-sweep collector.
引用
收藏
页码:13 / 25
页数:13
相关论文
共 31 条