Improving Storage Systems Using Machine Learning

被引:4
作者
Akgun, Ibrahim Umit [1 ]
Aydin, Ali Selman [1 ]
Burford, Andrew [1 ]
McNeill, Michael [1 ]
Arkhangelskiy, Michael [1 ]
Zadok, Erez [1 ]
机构
[1] SUNY Stony Brook, Dept Comp Sci, New Comp Sci Bldg, Stony Brook, NY 11794 USA
关键词
Operating systems; storage systems; Machine Learning; storage performance optimization; FILE; FRAMEWORK;
D O I
10.1145/3568429
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Operating systems include many heuristic algorithms designed to improve overall storage performance and throughput. Because such heuristics cannot work well for all conditions and workloads, system designers resorted to exposing numerous tunable parameters to users-thus burdening users with continually optimizing their own storage systems and applications. Storage systems are usually responsible for most latency in I/O-heavy applications, so even a small latency improvement can be significant. Machine learning (ML) techniques promise to learn patterns, generalize from them, and enable optimal solutions that adapt to changing workloads. We propose that ML solutions become a first-class component in OSs and replace manual heuristics to optimize storage systems dynamically. In this article, we describe our proposedML architecture, called KML. We developed a prototype KML architecture and applied it to two case studies: optimizing readahead and NFS read-size values. Our experiments show that KML consumes less than 4 KB of dynamic kernel memory, has a CPU overhead smaller than 0.2%, and yet can learn patterns and improve I/O throughput by as much as 2.3x and 15x for two case studies-even for complex, never-seen-before, concurrently running mixed workloads on different storage devices.
引用
收藏
页数:30
相关论文
共 107 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Abd-El-Malek Michael, 2005, P FAST 05 C FILE STO
[3]  
Agarwal R., 2020, ARXIV
[4]  
Ahmed A, 2002, IEEE IPCCC, P131, DOI 10.1109/IPCCC.2002.995144
[5]  
Akgun Ibrahim Umit, 2020, P 13 ACM INT SYSTEMS
[6]  
Al Maruf H, 2020, PROCEEDINGS OF THE 2020 USENIX ANNUAL TECHNICAL CONFERENCE, P843
[7]  
Amvrosiadis George, 2019, DAT STOR RES VIS 202
[8]  
[Anonymous], 2023, ACM T STORAGE, V19
[9]  
[Anonymous], 2019, RocksDB
[10]  
Atikoglu Berk, 2012, Performance Evaluation Review, V40, P53, DOI 10.1145/2318857.2254766