Revisiting I/O Behavior in Large-Scale Storage Systems: The Expected and the Unexpected

被引:38
作者
Patel, Tirthak [1 ]
Byna, Surendra [2 ]
Lockwood, Glenn K. [2 ]
Tiwari, Devesh [1 ]
机构
[1] Northeastern Univ, Boston, MA 02115 USA
[2] Lawrence Berkeley Natl Lab, Berkeley, CA USA
来源
PROCEEDINGS OF SC19: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS | 2019年
基金
美国国家科学基金会;
关键词
D O I
10.1145/3295500.3356183
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large-scale applications typically spend a large fraction of their execution time performing I/O to a parallel storage system. However, with rapid progress in compute and storage system stack of large-scale systems, it is critical to investigate and update our understanding of the I/O behavior of large-scale applications. Toward that end, in this work, we monitor, collect and analyze a year worth of storage system data from a large-scale production parallel storage system. We perform temporal, spatial and correlative analysis of the system and uncover surprising patterns which defy existing assumptions and have important implications for future systems.
引用
收藏
页数:13
相关论文
共 59 条
[41]   Best Practices and Lessons Learned from Deploying and Operating Large-Scale Data-Centric Parallel File Systems [J].
Oral, Sarp ;
Simmons, James ;
Hill, Jason ;
Leverman, Dustin ;
Wang, Feiyi ;
Ezell, Matt ;
Miller, Ross ;
Fuller, Douglas ;
Gunasekaran, Raghul ;
Kim, Youngjae ;
Gupta, Saurabh ;
Tiwari, Devesh ;
Vazhkudai, Sudharshan S. ;
Rogers, James H. ;
Dillow, David ;
Shipman, Galen M. ;
Bland, Arthur S. .
SC14: INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS, 2014, :217-228
[42]   Big Data Meets HPC Log Analytics: Scalable Approach to Understanding Systems at Extreme Scale [J].
Park, Byung H. ;
Hukerikar, Saurabh ;
Adamson, Ryan ;
Engelmann, Christian .
2017 IEEE INTERNATIONAL CONFERENCE ON CLUSTER COMPUTING (CLUSTER), 2017, :758-765
[43]  
Park IY, 2017, 2017 IEEE INTERNATIONAL SYMPOSIUM ON ELECTROMAGNETIC COMPATIBILITY & SIGNAL/POWER INTEGRITY (EMCSI), P181, DOI 10.1109/ISEMC.2017.8077863
[44]  
Ross R., 2019, TECHNICAL REPORT
[45]  
Sigelman Benjamin H, 2010, Tech. Rep.
[46]  
Sim Hyogi., 2015, High Performance Computing, Networking, Storage and Analysis, 2015 SC-International Conference for, P1
[47]  
Snyder S, 2016, PROCEEDINGS OF ESPT 2016: 5TH WORKSHOP ON EXTREME-SCALE PROGRAMMING TOOLS, P9, DOI 10.1109/ESPT.2016.006
[48]   Toward Managing HPC Burst Buffers Effectively: Draining Strategy to Regulate Bursty I/O Behavior [J].
Tang, Kun ;
Huang, Ping ;
He, Xubin ;
Lu, Tao ;
Vazhkudai, Sudharshan S. ;
Tiwari, Devesh .
2017 IEEE 25TH INTERNATIONAL SYMPOSIUM ON MODELING, ANALYSIS, AND SIMULATION OF COMPUTER AND TELECOMMUNICATION SYSTEMS (MASCOTS), 2017, :87-98
[49]  
Tiwari Devesh., 2013, FAST, P119
[50]  
Vazhkudai S. S., 2017, P INT C HIGH PERF CO, P45