HARDLESS: A Generalized Serverless Compute Architecture for Hardware Processing Accelerators

被引:8
作者
Werner, Sebastian [1 ]
Schirmer, Trever [2 ,3 ]
机构
[1] TU Berlin, ISE, Berlin, Germany
[2] TU Berlin, MCC, Berlin, Germany
[3] TU Berlin, ECDF, Berlin, Germany
来源
2022 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2022) | 2022年
关键词
serverless engineering; accelerated computing;
D O I
10.1109/IC2E55432.2022.00016
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The increasing use of hardware processing accelerators tailored for specific applications, such as the Vision Processing Unit (VPU) for image recognition, further increases developers' configuration, development, and management overhead. Developers have successfully used fully automated elastic cloud services such as serverless computing to counter these additional efforts and shorten development cycles for applications running on CPUs. Unfortunately, current cloud solutions do not yet provide these simplifications for applications that require hardware acceleration. However, as the development of specialized hardware acceleration continues to provide performance and cost improvements, it will become increasingly important to enable ease of use in the cloud. In this paper, we present an initial design and implementation of HARDLESS, an extensible and generalized serverless computing architecture that can support workloads for arbitrary hardware accelerators. We show how HARDLESS can scale across different commodity hardware accelerators and support a variety of workloads using the same execution and programming model common in serverless computing today.
引用
收藏
页码:79 / 84
页数:6
相关论文
共 18 条
[1]   BATCH: Machine Learning Inference Serving on Serverless Platforms with Adaptive Batching [J].
Ali, Ahsan ;
Pinciroli, Riccardo ;
Yan, Feng ;
Smirni, Evgenia .
PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
[2]  
Bohr M., 2007, IEEE Solid-State Circuits Society Newsletter, V12, P11, DOI DOI 10.1109/N-SSC.2007.4785534
[3]  
Carreira J., 2018, WORKSHOP SYSTEMS ML
[4]   CIRRUS: a Serverless Framework for End-to-end ML Workflows [J].
Carreira, Joao ;
Fonseca, Pedro ;
Tumanov, Alexey ;
Zhang, Andrew ;
Katz, Randy .
PROCEEDINGS OF THE 2019 TENTH ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '19), 2019, :13-24
[5]   The Rise of Serverless Computing [J].
Castro, Paul ;
Ishakian, Vatche ;
Muthusamy, Vinod ;
Slominski, Aleksander .
COMMUNICATIONS OF THE ACM, 2019, 62 (12) :44-54
[6]  
Horovitz Shay, 2019, Economics of Grids, Clouds, Systems, and Services. 15th International Conference, GECON 2018. Proceedings: Lecture Notes in Computer Science (LNCS 11113), P171, DOI 10.1007/978-3-030-13342-9_15
[7]  
Joyner S., 2020, RIPPLE PRACTICAL DEC
[8]  
Kuhlenkamp J., 2019, PROC 34 ACMSIGAPP S, V19, P284
[9]  
Kuhlenkamp J, 2019, INT CONF UTIL CLOUD, P1, DOI 10.1145/3344341.3368796
[10]   The Ifs and Buts of Less is More: A Serverless Computing Reality Check [J].
Kuhlenkamp, Jorn ;
Werner, Sebastian ;
Tai, Stefan .
2020 IEEE INTERNATIONAL CONFERENCE ON CLOUD ENGINEERING (IC2E 2020), 2020, :154-161