Cluman: Advanced cluster management for the large-scale infrastructures

被引:0
|
作者
Babik, Marian [1 ]
Fedorko, Ivan [1 ]
Rodrigues, David [1 ]
机构
[1] CERN, European Org Nucl Res, CH-1211 Geneva 23, Switzerland
关键词
D O I
10.1088/1742-6596/331/5/052002
中图分类号
O57 [原子核物理学、高能物理学];
学科分类号
070202 ;
摘要
The recent uptake of multi-core computing has produced a rapid growth of virtualisation and cloud computing services. With the increased use of the many-core processors this trend will likely accelerate and computing centres will be faced with the management of the tens of thousands of the virtual machines. Furthermore, these machines will likely be geographically distributed and need to be allocated on demand. In order to cope with such complexity we have designed and developed an advanced cluster management system that can execute administrative tasks targeting thousands of machines as well as provide an interactive high-density visualisation of the fabrics. The job management subsystem can perform complex tasks while following their progress and output and report aggregated information back to the system administrators. The visualisation subsystem can display tree maps of the infrastructure elements with data and monitoring information, thus providing a very detailed overview of the large clusters at a glance. The initial experience with development and testing of the system will be presented as well as an evaluation of its performance.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Large-scale cluster quantum microcombs
    Ze Wang
    Kangkang Li
    Yue Wang
    Xin Zhou
    Yinke Cheng
    Boxuan Jing
    Fengxiao Sun
    Jincheng Li
    Zhilin Li
    Bingyan Wu
    Qihuang Gong
    Qiongyi He
    Bei-Bei Li
    Qi-Fan Yang
    Light: Science & Applications, 14 (1)
  • [22] Large-Scale Spherical Silicon Solar Cell for Advanced Light Management
    El-Atab, Nazek
    Qaiser, Nadeem
    Bahabry, Rabab
    Hussain, Muhammad Mustafa
    2020 47TH IEEE PHOTOVOLTAIC SPECIALISTS CONFERENCE (PVSC), 2020, : 170 - 172
  • [23] Reflections on Collaborative Archaeology and Large-Scale Online Research Infrastructures
    Wright, Holly
    Richards, Julian D.
    JOURNAL OF FIELD ARCHAEOLOGY, 2018, 43 : S60 - S67
  • [24] Managing RFID events in large-scale distributed RFID infrastructures
    Dutta, Kaushik
    VanderMeer, Debra
    Ramamritham, Krithi
    INFORMATION TECHNOLOGY & MANAGEMENT, 2011, 12 (03): : 253 - 272
  • [25] Managing RFID events in large-scale distributed RFID infrastructures
    Kaushik Dutta
    Debra VanderMeer
    Krithi Ramamritham
    Information Technology and Management, 2011, 12 : 253 - 272
  • [26] A heuristic approach for the allocation of resources in large-scale computing infrastructures
    Lee, Kevin
    Buss, Georg
    Veit, Daniel
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2016, 28 (05): : 1527 - 1547
  • [27] Interactive visualization in large-scale, distributed computing infrastructures with GVK
    Heinzlreiter, P
    Kranzlmüller, D
    Kurka, G
    Volkert, J
    6TH WORLD MULTICONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL IV, PROCEEDINGS: MOBILE/WIRELESS COMPUTING AND COMMUNICATION SYSTEMS I, 2002, : 348 - 352
  • [28] Organizational innovation of integrated design of infrastructures in large-scale park
    Wu, Dingyuan
    Gang, Feng
    Computer Modelling and New Technologies, 2014, 18 (12): : 983 - 988
  • [29] Motorway Preservation challenges for the heritage of Italian large-scale infrastructures
    Peron, Verdiana
    IN SITU-REVUE DE PATRIMOINES, 2023, (49):
  • [30] Adaptive System Anomaly Prediction for Large-Scale Hosting Infrastructures
    Tan, Yongmin
    Gu, Xiaohui
    Wang, Haixun
    PODC 2010: PROCEEDINGS OF THE 2010 ACM SYMPOSIUM ON PRINCIPLES OF DISTRIBUTED COMPUTING, 2010, : 173 - 182