Investigating Machine Learning Algorithms for Modeling SSD I/O Performance for Container-Based Virtualization

被引:16
作者
Dartois, Jean-Emile [1 ,2 ]
Boukhobza, Jalil [1 ,3 ]
Knefati, Anas [1 ]
Barais, Olivier [1 ,2 ]
机构
[1] IRT B Com, F-35510 Cesson Sevigne, France
[2] Univ Rennes, IRISA, INRIA, CNRS, F-35000 Rennes, France
[3] Univ Bretagne Occidentale, F-29238 Brest, France
关键词
Cloud computing; performance and QoS; I/O interference; solid state drives; flash memory; container; machine learning; LOGISTIC-REGRESSION;
D O I
10.1109/TCC.2019.2898192
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One of the cornerstones of the cloud provider business is to reduce hardware resources cost by maximizing their utilization. This is done through smartly sharing processor, memory, network and storage, while fully satisfying SLOs negotiated with customers. For the storage part, while SSDs are increasingly deployed in data centers mainly for their performance and energy efficiency, their internal mechanisms may cause a dramatic SLO violation. In effect, we measured that I/O interference may induce a 10x performance drop. We are building a framework based on autonomic computing which aims to achieve intelligent container placement on storage systems by preventing bad I/O interference scenarios. One prerequisite to such a framework is to design SSD performance models that take into account interactions between running processes/containers, the operating system and the SSD. These interactions are complex. In this paper, we investigate the use of machine learning for building such models in a container based Cloud environment. We have investigated five popular machine learning algorithms along with six different I/O intensive applications and benchmarks. We analyzed the prediction accuracy, the learning curve, the feature importance and the training time of the tested algorithms on four different SSD models. Beyond describing modeling component of our framework, this paper aims to provide insights for cloud providers to implement SLO compliant container placement algorithms on SSDs. Our machine learning-based framework succeeded in modeling I/O interference with a median Normalized Root-Mean-Square Error (NRMSE) of 2.5 percent.
引用
收藏
页码:1103 / 1116
页数:14
相关论文
共 75 条
[1]  
Agrawal Nitin, 2008, Performance Evaluation Review, V36, P52, DOI 10.1145/1453175.1453184
[2]  
Ahn Sungyong, 2016, P 8 USENIX WORKSH HO
[3]  
ALLEN B., 2018, SMARTMONTOOLS PROJEC
[4]  
Alpaydin E, 2014, ADAPT COMPUT MACH LE, P1
[5]  
[Anonymous], 2016, 8 SURPRISING FACTS R
[6]  
Axboe J., 2014, Fio-Flexible Io Tester
[7]  
Bellard F., 2012, FFMPEG
[8]   The PARSEC Benchmark Suite: Characterization and Architectural Implications [J].
Bienia, Christian ;
Kumar, Sanjeev ;
Singh, Jaswinder Pal ;
Li, Kai .
PACT'08: PROCEEDINGS OF THE SEVENTEENTH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES, 2008, :72-81
[9]  
Binbin Zhang, 2010, Proceedings of the Fifth ChinaGrid Annual Conference (ChinaGrid 2010), P117, DOI 10.1109/ChinaGrid.2010.54
[10]   SmcHD1, containing a structural-maintenance-of-chromosomes hinge domain, has a critical role in X inactivation [J].
Blewitt, Marnie E. ;
Gendrel, Anne-Valerie ;
Pang, Zhenyi ;
Sparrow, Duncan B. ;
Whitelaw, Nadia ;
Craig, Jeffrey M. ;
Apedaile, Anwyn ;
Hilton, Douglas J. ;
Dunwoodie, Sally L. ;
Brockdorff, Neil ;
Kay, Graham F. ;
Whitelaw, Emma .
NATURE GENETICS, 2008, 40 (05) :663-669