DLFaaS:Serverless Platform for Data-Intensive Tasks Based on Interval Access Patterns

被引:0
作者
Cao, Yang [1 ]
Song, Wenbin [1 ]
Wu, Hanqian [2 ,3 ]
Yuan, Shengchao [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Southeast Univ, Sch Cyber Sci & Engn, Nanjing, Peoples R China
[3] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年
关键词
cloud computing; serverless; data-intensive; cache;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Serverless architecture is a novel paradigm in cloud computing.The increasing trend in applications is shifting towards serverless platforms to achieve on-demand payment and elastic scalability. However, for data-intensive functions, using serverless architectures can result in performance losses due to frequent data transfers, incurring significant time overhead for data movement. Current serverless systems lack optimization for data migration times in tasks involving data-intensive functions. We conducted an analysis of serverless architecture, function invocation patterns, and latency for data-intensive functions. We found that the lack of awareness of access patterns for these functions leads to inefficient cache eviction, resulting in low cache hit rates. Additionally, bursty calls result in repeated remote storage accesses, contributing to time overhead in data-intensive tasks. Therefore, we proposed a serverless platform based on function invocation patterns, optimized for data-intensive tasks. Specifically, we designed a two-tier caching queue, where data objects accessed multiple times for read and write operations were cached based on historical access patterns. These objects were placed in different cache lists according to various time intervals. We employed an eviction strategy based on function access intervals to enhance overall hit rates. Compared to popular existing solutions, we reduced time latency by 32.1% and improved cache hit rates by 12%.
引用
收藏
页码:675 / 680
页数:6
相关论文
共 27 条
  • [21] Improving Performance on Data-Intensive Applications Using a Load Balancing Methodology Based on Divisible Load Theory
    Rosas, Claudia
    Sikora, Anna
    Jorba, Josep
    Moreno, Andreu
    Cesar, Eduardo
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2014, 42 (01) : 94 - 118
  • [22] Improving Performance on Data-Intensive Applications Using a Load Balancing Methodology Based on Divisible Load Theory
    Claudia Rosas
    Anna Sikora
    Josep Jorba
    Andreu Moreno
    Eduardo César
    International Journal of Parallel Programming, 2014, 42 : 94 - 118
  • [23] A Sensor Data Processing and Access Platform based on Hadoop for Smart Environments
    Lin, Chi-Yi
    Li, Chia-Chen
    Huang, Wei-Hsun
    Liao, Wei-Che
    Chen, Wei-Ming
    2014 17TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2014), 2014, : 453 - 458
  • [24] Cloud-based Data-intensive Framework towards Fault Diagnosis in Large-scale Petrochemical Plants
    Huo, Zhiqiang
    Mukherjee, Mithun
    Shu, Lei
    Chen, Yuanfang
    Zhou, Zhangbing
    2016 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING CONFERENCE (IWCMC), 2016, : 1080 - 1085
  • [25] CAPER 3.0: A Scalable Cloud-Based System for Data-Intensive Analysis of Chromosome-Centric Human Proteome Project Data Sets
    Yang, Shuai
    Zhang, Xinlei
    Diao, Lihong
    Guo, Feifei
    Wang, Dan
    Liu, Zhongyang
    Li, Honglei
    Zheng, Junjie
    Pan, Jingshan
    Nice, Edouard C.
    Li, Dong
    He, Fuchu
    JOURNAL OF PROTEOME RESEARCH, 2015, 14 (09) : 3720 - 3728
  • [26] Collaborative, Distributed, Scalable and Low-Cost Platform Based on Microservices, Containers, Mobile Devices and Cloud Services to Solve Compute-Intensive Tasks
    Petrocelli, David
    De Giusti, Armando
    Naiouf, Marcelo
    EURO-PAR 2021: PARALLEL PROCESSING WORKSHOPS, 2022, 13098 : 545 - 548
  • [27] A replication strategy for a distributed high-speed caching system based on spatiotemporal access patterns of geospatial data
    Li, Rui
    Feng, Wei
    Wu, Huayi
    Huang, Qunying
    COMPUTERS ENVIRONMENT AND URBAN SYSTEMS, 2017, 61 : 163 - 171