Understanding and Improving Computational Science Storage Access through Continuous Characterization

被引:143
作者
Carns, Philip [1 ]
Harms, Kevin [2 ]
Allcock, William [2 ]
Bacon, Charles [2 ]
Lang, Samuel [1 ]
Latham, Robert [1 ]
Ross, Robert [1 ]
机构
[1] Argonne Natl Lab, Div Math & Comp Sci, Argonne, IL 60439 USA
[2] Argonne Natl Lab, Argonne Leadership Comp Facil, Argonne, IL 60439 USA
关键词
Measurement; Performance; I/O characterization; parallel file systems;
D O I
10.1145/2027066.2027068
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Computational science applications are driving a demand for increasingly powerful storage systems. While many techniques are available for capturing the I/O behavior of individual application trial runs and specific components of the storage system, continuous characterization of a production system remains a daunting challenge for systems with hundreds of thousands of compute cores and multiple petabytes of storage. As a result, these storage systems are often designed without a clear understanding of the diverse computational science workloads they will support. In this study, we outline a methodology for scalable, continuous, systemwide I/O characterization that combines storage device instrumentation, static file system analysis, and a new mechanism for capturing detailed application-level behavior. This methodology allows us to identify both system-wide trends and application-specific I/O strategies. We demonstrate the effectiveness of our methodology by performing a multilevel, two-month study of Intrepid, a 557-teraflop IBM Blue Gene/P system. During that time, we captured application-level I/O characterizations from 6,481 unique jobs spanning 38 science and engineering projects. We used the results of our study to tune example applications, highlight trends that impact the design of future storage systems, and identify opportunities for improvement in I/O characterization methodology.
引用
收藏
页数:26
相关论文
共 29 条
  • [1] Agrawal Nitin, 2008, Performance Evaluation Review, V36, P52, DOI 10.1145/1453175.1453184
  • [2] Anderson E., 2009, Proccedings of the 7th conference on File and stroage technologies table of contents, P139
  • [3] [Anonymous], 2008, P USENIX ANN TECHN C
  • [4] CARNS P., 2009, P WORKSH INT ARCH SC
  • [5] Darshan, 2010, DARSHAN
  • [6] Dayal S., 2008, CMUPDL08109
  • [7] Ganger G.R., 1995, Proceedings of the Computer Measurement Group (CMG) Conference, P1263
  • [8] Godard S., 2010, Sysstat utilities home page
  • [9] Kim YR, 2010, PSYCHIAT INVEST, V7, P1, DOI [10.1186/1743-8977-7-20, 10.4306/pi.2010.7.1.1]
  • [10] KLUNDT R, 2008, SAND20083684 SAND NA