Resource Bundles: Using Aggregation for Statistical Large-Scale Resource Discovery and Management

被引:8
作者
Cardosa, Michael [1 ]
Chandra, Abhishek [1 ]
机构
[1] Univ Minnesota, Dept Comp Sci & Engn, Minneapolis, MN 55455 USA
基金
美国国家科学基金会;
关键词
Resource discovery; aggregation; resource management; machine learning;
D O I
10.1109/TPDS.2009.143
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Resource discovery is an important process for finding suitable nodes that satisfy application requirements in large loosely coupled distributed systems. Besides internode heterogeneity, many of these systems also show a high degree of intranode dynamism, so that selecting nodes based only on their recently observed resource capacities can lead to poor deployment decisions resulting in application failures or migration overheads. However, most existing resource discovery mechanisms rely mainly on recent observations to achieve scalability in large systems. In this paper, we propose the notion of a resource bundle-a representative resource usage distribution for a group of nodes with similar resource usage patterns-that employs two complementary techniques to overcome the limitations of existing techniques: resource usage histograms to provide statistical guarantees for resource capacities and clustering-based resource aggregation to achieve scalability. Using trace-driven simulations and data analysis of a month-long PlanetLab trace, we show that resource bundles are able to provide high accuracy for statistical resource discovery, while achieving high scalability. We also show that resource bundles are ideally suited for identifying group-level characteristics (e. g., hot spots, total group capacity). To automatically parameterize the bundling algorithm, we present an adaptive algorithm that can detect online fluctuations in resource heterogeneity.
引用
收藏
页码:1089 / 1102
页数:14
相关论文
共 50 条
[41]   Design of a resource advertisement and discovery protocol for large and dense MANETs [J].
Wang, Shun-Te ;
Wu, Jean-Lien C. ;
Hsu, Chun-Yen .
JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2006, 29 (07) :1161-1171
[42]   THE CRITICAL CHALLENGE OF USING LARGE-SCALE DIGITAL EXPERIMENT PLATFORMS FOR SCIENTIFIC DISCOVERY [J].
Abbasi, Ahmed ;
Somanchi, Sriram ;
Kelley, Ken .
MIS Quarterly: Management Information Systems, 2025, 49 (01) :1-28
[43]   A simple resource advertisement and discovery protocol for large and dense MANETs [J].
Wu, JLC ;
Wang, ST ;
Hsu, CY .
ITRE 2005: 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: RESEARCH AND EDUCATION, PROCEEDINGS, 2005, :18-22
[44]   AI-Driven Resource Management for Energy-Efficient Aerial Computing in Large-Scale Healthcare SDN-IoT Systems [J].
Lv, Jianhui ;
Babbar, Himanshi ;
Rani, Shalli .
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (13) :23536-23549
[45]   Middleware-level collaborative resource discovery for large clusters [J].
Al-Jaroodi, J ;
Mohamed, N .
2005 INTERNATIONAL SYMPOSIUM ON COLLABORATIVE TECHNOLOGIES AND SYSTEMS, PROCEEDINGS, 2005, :187-195
[46]   Large-Scale Nodes Classification With Deep Aggregation Network [J].
Li, Jiangtao ;
Wu, Jianshe ;
He, Weiquan ;
Zhou, Peng .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2021, 33 (06) :2560-2572
[47]   A Decentralized Approach for Resource Discovery using Metadata Replication in Edge Networks [J].
Murturi, Ilir ;
Dustdar, Schahram .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2022, 15 (05) :2526-2537
[48]   Resource Central: Understanding and Predicting Workloads for Improved Resource Management in Large Cloud Platforms [J].
Cortez, Eli ;
Bonde, Anand ;
Muzio, Alexandre ;
Russinovich, Mark ;
Fontoura, Marcus ;
Bianchini, Ricardo .
PROCEEDINGS OF THE TWENTY-SIXTH ACM SYMPOSIUM ON OPERATING SYSTEMS PRINCIPLES (SOSP '17), 2017, :153-167
[49]   Large-scale analysis of neuroimaging data on commercial clouds with content-aware resource allocation strategies [J].
Minervini, Massimo ;
Rusu, Cristian ;
Damiano, Mario ;
Tucci, Valter ;
Bifone, Angelo ;
Gozzi, Alessandro ;
Tsaftaris, Sotirios A. .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2015, 29 (04) :473-488
[50]   SORMSYS: Towards a Resource Management Platform for Self-Organizing Large Scale Distributed Systems [J].
Pop, Florin .
12TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2010), 2011, :534-541