Solving Large-Scale Granular Resource Allocation Problems Efficiently with POP

被引:25
|
作者
Narayanan, Deepak [1 ]
Kazhamiaka, Fiodar [1 ]
Abuzaid, Firas [1 ]
Kraft, Peter [1 ]
Agrawal, Akshay [1 ]
Kandula, Srikanth [2 ]
Boyd, Stephen [1 ]
Zaharia, Matei [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Microsoft Res, Redmond, WA USA
关键词
Resource scheduling; optimization problems in computer systems; cluster scheduling; traffic engineering; load balancing;
D O I
10.1145/3477132.3483588
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Resource allocation problems in many computer systems can be formulated as mathematical optimization problems. However, finding exact solutions to these problems using off-the-shelf solvers is often intractable for large problem sizes with tight SLAs, leading system designers to rely on cheap, heuristic algorithms. We observe, however, that many allocation problems are granular: they consist of a large number of clients and resources, each client requests a small fraction of the total number of resources, and clients can interchangeably use different resources. For these problems, we propose an alternative approach that reuses the original optimization problem formulation and leads to better allocations than domain-specific heuristics. Our technique, Partitioned Optimization Problems (POP), randomly splits the problem into smaller problems (with a subset of the clients and resources in the system) and coalesces the resulting sub-allocations into a global allocation for all clients. We provide theoretical and empirical evidence as to why random partitioning works well. In our experiments, POP achieves allocations within 1.5% of the optimal with orders-of-magnitude improvements in runtime compared to existing systems for cluster scheduling, traffic engineering, and load balancing.
引用
收藏
页码:521 / 537
页数:17
相关论文
共 50 条
  • [31] Solving very large-scale structural optimization problems
    Huettner, F.
    Grosspietsch, M.
    AIAA JOURNAL, 2007, 45 (11) : 2729 - 2736
  • [32] Solving large-scale multicriteria problems by the decomposition method
    Ya. I. Rabinovich
    Computational Mathematics and Mathematical Physics, 2012, 52 : 60 - 74
  • [33] Efficiently solving large-scale electrostatic lightning problems by integrating characteristic basis functions into boundary element method
    Huang, Ruoyu
    Lai, Junan
    Yin, Wenjing
    Deng, Hong
    Ma, Xingyu
    Luo, Zhiyao
    He, Kun
    Chen, Weijiang
    Dong, Tianyu
    ELECTRIC POWER SYSTEMS RESEARCH, 2024, 235
  • [34] HRAS: Hybrid Resource Allocation System for Large-Scale Disasters
    Tsai, Rong-Guei
    Tsai, Pei-Hsuan
    PROCEEDINGS OF THE 2017 IEEE INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATION AND ENGINEERING (IEEE-ICICE 2017), 2017, : 235 - 237
  • [35] Optimized resource allocation approach for large-scale public emergency
    Shenzhen Graduate School, Harbin Institute of Technology, Shenzhen 518055, China
    不详
    Liaoning Gongcheng Jishu Daxue Xuebao (Ziran Kexue Ban), 2007, SUPPL. 1 (256-258):
  • [36] An Approach to Robust Resource Allocation in Large-Scale Systems of Systems
    Kosak, Oliver
    Anders, Gerrit
    Siefert, Florian
    Reif, Wolfgang
    2015 IEEE NINTH INTERNATIONAL CONFERENCE ON SELF-ADAPTIVE AND SELF-ORGANIZING SYSTEMS - SASO 2015, 2015, : 1 - 10
  • [37] Resource Co-Allocation for Large-Scale Distributed Environments
    Castillo, Claris
    Rouskas, George N.
    Harfoush, Khaled
    HPDC'09: 18TH ACM INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, 2009, : 131 - 140
  • [38] Video Management and Resource Allocation for a Large-Scale VoD Cloud
    Chang, Zhangyu
    Chan, S. -H. Gary
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2016, 12 (05)
  • [39] Resource allocation for energy efficient large-scale distributed systems
    Lee Y.C.
    Zomaya A.Y.
    Communications in Computer and Information Science, 2010, 54 : 16 - 19
  • [40] Resource Allocation for Energy Efficient Large-Scale Distributed Systems
    Lee, Young Choon
    Zomaya, Albert Y.
    INFORMATION SYSTEMS, TECHNOLOGY AND MANAGEMENT, PROCEEDINGS, 2010, 54 : 16 - 19