DLFaaS:Serverless Platform for Data-Intensive Tasks Based on Interval Access Patterns

被引:0
作者
Cao, Yang [1 ]
Song, Wenbin [1 ]
Wu, Hanqian [2 ,3 ]
Yuan, Shengchao [1 ]
机构
[1] Southeast Univ, Sch Comp Sci & Engn, Nanjing, Peoples R China
[2] Southeast Univ, Sch Cyber Sci & Engn, Nanjing, Peoples R China
[3] Southeast Univ, Key Lab Comp Network & Informat Integrat, Minist Educ, Nanjing, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年
关键词
cloud computing; serverless; data-intensive; cache;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00121
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Serverless architecture is a novel paradigm in cloud computing.The increasing trend in applications is shifting towards serverless platforms to achieve on-demand payment and elastic scalability. However, for data-intensive functions, using serverless architectures can result in performance losses due to frequent data transfers, incurring significant time overhead for data movement. Current serverless systems lack optimization for data migration times in tasks involving data-intensive functions. We conducted an analysis of serverless architecture, function invocation patterns, and latency for data-intensive functions. We found that the lack of awareness of access patterns for these functions leads to inefficient cache eviction, resulting in low cache hit rates. Additionally, bursty calls result in repeated remote storage accesses, contributing to time overhead in data-intensive tasks. Therefore, we proposed a serverless platform based on function invocation patterns, optimized for data-intensive tasks. Specifically, we designed a two-tier caching queue, where data objects accessed multiple times for read and write operations were cached based on historical access patterns. These objects were placed in different cache lists according to various time intervals. We employed an eviction strategy based on function access intervals to enhance overall hit rates. Compared to popular existing solutions, we reduced time latency by 32.1% and improved cache hit rates by 12%.
引用
收藏
页码:675 / 680
页数:6
相关论文
共 27 条
  • [1] A Data-Intensive CDSS Platform Based on Knowledge Graph
    Sheng, Ming
    Hu, Qingcheng
    Zhang, Yong
    Xing, Chunxiao
    Zhang, Tingting
    HEALTH INFORMATION SCIENCE (HIS 2018), 2018, 11148 : 146 - 155
  • [2] Optimized container scheduling for data-intensive serverless edge computing
    Rausch, Thomas
    Rashed, Alexander
    Dustdar, Schahram
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2021, 114 : 259 - 271
  • [3] On a Cyberinfrastructure Platform for Multidisciplinary, Data-intensive Scientific Research
    Ma, Xiangrong
    Fu, Zhao
    Jiang, Yingtao
    Yang, Mei
    Stephen, Haroon
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [4] Deadline based scheduling for data-intensive applications in clouds
    Fu Xiong
    Cang Yeliang
    Zhu Lipeng
    Hu Bin
    Deng Song
    Wang Dong
    The Journal of China Universities of Posts and Telecommunications, 2016, (06) : 8 - 15
  • [5] Data-Intensive HPC Tasks Scheduling with SDN to Enable HPC-as-a-Service
    Jamalian, Saba
    Rajaei, Hassan
    2015 IEEE 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, 2015, : 596 - 603
  • [6] Accelerating Data-Intensive Applications: A Cloud Computing Approach to Parallel Image Pattern Recognition Tasks
    Han, Liangxiu
    Saengngam, Tantana
    van Hemert, Jano
    PROCEEDINGS OF THE FOURTH INTERNATIONAL CONFERENCE ON ADVANCED ENGINEERING COMPUTING AND APPLICATIONS IN SCIENCES (ADVCOMP 2010), 2010, : 148 - 153
  • [7] A Data-Intensive Workflow Scheduling Algorithm for Large-scale Cooperative Work Platform
    Cui, Lizhen
    Xu, Meng
    Wang, Haiyang
    2009 13TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, 2009, : 486 - 491
  • [8] A scalable Cloud-based system for data-intensive spatial analysis
    R. O. Sinnott
    W. Voorsluys
    International Journal on Software Tools for Technology Transfer, 2016, 18 : 587 - 605
  • [9] Data-intensive Service Mashup Based on Game Theory and Hybrid Fireworks Optimization Algorithm in the Cloud
    Yang, Wanchun
    Zhang, Chenxi
    Mu, Bin
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (04): : 421 - 429
  • [10] Hypergraph-Based Data Reduced Scheduling Policy for Data-Intensive Workflow in Clouds
    Hu, Zhigang
    Li, Jia
    Zheng, Meiguang
    Zhang, Xinxin
    Kang, Hui
    Tao, Yong
    Yang, Jiao
    DATA SCIENCE, PT II, 2017, 728 : 335 - 349