On the Performance of the Spotify Backend

被引:0
作者
Rerngvit Yanggratoke
Gunnar Kreitz
Mikael Goldmann
Rolf Stadler
Viktoria Fodor
机构
[1] KTH Royal Institute of Technology,ACCESS Linnaeus Center
[2] Spotify AB,undefined
来源
Journal of Network and Systems Management | 2015年 / 23卷
关键词
Key-value store; Distributed object store; Object allocation policy; Performance modeling; Performance measurements; Response times;
D O I
暂无
中图分类号
学科分类号
摘要
We model and evaluate the performance of a distributed key-value storage system that is part of the Spotify backend. Spotify is an on-demand music streaming service, offering low-latency access to a library of over 20 million tracks and serving over 20 million users currently. We first present a simplified model of the Spotify storage architecture, in order to make its analysis feasible. We then introduce an analytical model for the distribution of the response time, a key metric in the Spotify service. We parameterize and validate the model using measurements from two different testbed configurations and from the operational Spotify infrastructure. We find that the model is accurate—measurements are within 11 % of predictions—within the range of normal load patterns. In addition, we model the capacity of the Spotify storage system under different object allocation policies and find that measurements on our testbed are within 9 % of the model predictions. The model helps us justify the object allocation policy adopted for Spotify storage system.
引用
收藏
页码:210 / 237
页数:27
相关论文
共 38 条
[1]  
Mosberger D.(1998)httperf—a tool for measuring web server performance SIGMETRICS Perform. Eval. Rev. 26 31-37
[2]  
Jin T.(1951)The Kolmogorov–Smirnov test for goodness of fit J. Am. Stat. Assoc. 46 68-78
[3]  
Massey F.J.(2004)Problems with fitting to the power-law distribution Eur. Phys. J. B 41 255-258
[4]  
Goldstein M.(2005)Approximating the distribution of pareto sums Pure Appl. Geophys. 162 1187-1228
[5]  
Morris S.(2011)Io performance prediction in consolidated virtualized environments SIGSOFT Softw. Eng. Notes 36 295-306
[6]  
Yen G.(2012)Performance analysis of cloud computing centers using m/g/m/m+r queuing systems IEEE Trans. Parallel Distrib. Syst. 23 936-943
[7]  
Zaliapin I.V.(2007)Dynamo: amazon’s highly available key-value store SIGOPS Oper. Syst. Rev. 41 205-220
[8]  
Kagan Y.Y.(2010)Cassandra: a decentralized structured storage system SIGOPS Oper. Syst. Rev. 44 35-40
[9]  
Schoenberg F.P.(2002)Hierarchical web caching systems: modeling, design and experimental results IEEE J. Sel. Areas Commun. 20 1305-1314
[10]  
Kraft S.(2004)On expiration-based hierarchical caching systems IEEE J. Sel. Areas Commun. 22 134-150