DLFaaS:Serverless Platform for Data-Intensive Tasks Based on Interval Access Patterns

被引:0
作者
Cao, Yang [1 ]
Song, Wenbin [1 ]
Wu, Hanqian [2 ,3 ]
Yuan, Shengchao [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Southeast Univ, Sch Cyber Sci & Engn, Nanjing, Peoples R China
[3] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年
关键词
cloud computing; serverless; data-intensive; cache;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Serverless architecture is a novel paradigm in cloud computing.The increasing trend in applications is shifting towards serverless platforms to achieve on-demand payment and elastic scalability. However, for data-intensive functions, using serverless architectures can result in performance losses due to frequent data transfers, incurring significant time overhead for data movement. Current serverless systems lack optimization for data migration times in tasks involving data-intensive functions. We conducted an analysis of serverless architecture, function invocation patterns, and latency for data-intensive functions. We found that the lack of awareness of access patterns for these functions leads to inefficient cache eviction, resulting in low cache hit rates. Additionally, bursty calls result in repeated remote storage accesses, contributing to time overhead in data-intensive tasks. Therefore, we proposed a serverless platform based on function invocation patterns, optimized for data-intensive tasks. Specifically, we designed a two-tier caching queue, where data objects accessed multiple times for read and write operations were cached based on historical access patterns. These objects were placed in different cache lists according to various time intervals. We employed an eviction strategy based on function access intervals to enhance overall hit rates. Compared to popular existing solutions, we reduced time latency by 32.1% and improved cache hit rates by 12%.
引用
收藏
页码:675 / 680
页数:6
相关论文
共 27 条
  • [11] A scalable Cloud-based system for data-intensive spatial analysis
    Sinnott, R. O.
    Voorsluys, W.
    INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2016, 18 (06) : 587 - 605
  • [12] Scalable Pointer-based Memory Protection for Data-intensive Computing
    An, Baik Song
    11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1602 - 1604
  • [13] A novel cloud model based data placement strategy for data-intensive application in clouds
    Zhang, Xinxin
    Hu, Zhigang
    Zheng, Meiguang
    Li, Jia
    Yang, Liu
    COMPUTERS & ELECTRICAL ENGINEERING, 2019, 77 : 445 - 456
  • [14] An Efficient Combination of Genetic Algorithm and Particle Swarm Optimization for Scheduling Data-Intensive Tasks in Heterogeneous Cloud Computing
    Shao, Kaili
    Fu, Hui
    Wang, Bo
    ELECTRONICS, 2023, 12 (16)
  • [15] A trust model-based task scheduling algorithm for data-intensive application
    Xu Y.
    Qu W.
    Proceedings - 2011 6th Annual ChinaGrid Conference, ChinaGrid 2011, 2011, : 227 - 233
  • [16] Genetic Based Data Placement for Geo-Distributed Data-Intensive Applications in Cloud Computing
    Fan, Weifeng
    Peng, Jun
    Zhang, Xiaoyong
    Huang, Zhiwu
    ADVANCES IN SERVICES COMPUTING, 2016, 10065 : 253 - 265
  • [17] A LNS-based data placement strategy for data-intensive e-science applications
    Zhang, Tiantian
    Cui, Lizhen
    Xu, Meng
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2014, 5 (04) : 249 - 262
  • [18] A WSRF based adaptive data transmission mechanism in large-scale data-intensive simulation grid
    Wang, K
    Du, ZH
    Chai, YP
    Li, SL
    System Simulation and Scientific Computing, Vols 1 and 2, Proceedings, 2005, : 651 - 655
  • [19] MEMORY-BASED HIGH-PERFORMANCE OPTIMIZATION FOR HIGH CONCURRENT DATA-INTENSIVE PROBLEMS
    Deng, Mingzhu
    Liu, Guangming
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND APPLICATIONS (ICISA 2013), 2013,
  • [20] ABS: Agent-based Scheduling for Data-Intensive Workflow in Software-as-a-Service Environments
    Chen, Huangke
    Meng, Jiayang
    Zhu, Jianghan
    Wang, Jianjiang
    2016 FOURTH INTERNATIONAL CONFERENCE ON ADVANCED CLOUD AND BIG DATA (CBD 2016), 2016, : 19 - 24