Graph-Based Data Deduplication in Mobile Edge Computing Environment

被引:7
作者
Luo, Ruikun [1 ,2 ]
Jin, Hai [1 ,2 ]
He, Qiang [3 ]
Wu, Song [1 ,2 ]
Zeng, Zilai [1 ,2 ]
Xia, Xiaoyu [4 ]
机构
[1] Huazhong Univ Sci & Technol, Natl Engn Res Ctr Big Data Technol, Wuhan, Peoples R China
[2] Huazhong Univ Sci & Technol, Syst Serv Comp Technol & Syst Lab, Cluster & Grid Comp Lab, Wuhan, Peoples R China
[3] Swinburne Univ Technol, Hawthorn, Vic, Australia
[4] Deakin Univ, Burwood, Australia
来源
SERVICE-ORIENTED COMPUTING (ICSOC 2021) | 2021年 / 13121卷
基金
美国国家科学基金会;
关键词
Mobile edge computing; Edge data storage; Data deduplication; Integer programming; Approximation algorithm;
D O I
10.1007/978-3-030-91431-8_31
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Mobile edge computing (MEC) extends cloud computing by deploying edge servers with computing and storage resources at base stations within users' geographic proximity. The networked edge servers in an area constitute an edge storage system (ESS), where edge servers cooperate to provide services for the users in the area. However, the potential of ESSs is challenged by edge servers' constrained storage resources due to their limited physical sizes. A straightforward method to tackle this challenge is to reduce data redundancy in the ESS. The unique characteristics and constraints in the MEC environment, e.g., edge servers' geographic coverage and distribution, render conventional data deduplication techniques designed for cloud storage systems obsolete. In this paper, we make the first attempt to study this novel Edge Data Deduplication (EDDE) problem. First, we model it as a constrained optimization problem with the aim to maximize data deduplication ratio under latency constraint by taking advantage of the collaboration between edge servers. Then, we prove that the EDDE problem is NP-hard and propose an approach named EDDE-O for solving the EDDE problem optimally based on integer programming. To accommodate large-scale EDDE scenarios, we propose a ln alpha+ 1-approximation algorithm, namely EDDE-A, to find sub-optimal EDDE solutions efficiently. The results of extensive experiments conducted on a widely-used dataset demonstrate that EDDE-O and EDDE-A can solve the EDDE problem effectively and efficiently, outperforming four representative approaches significantly.
引用
收藏
页码:499 / 515
页数:17
相关论文
共 20 条
[1]  
AWS, About us
[2]   Improved approximation algorithms for the uncapacitated facility location problem [J].
Chudak, FA ;
Shmoys, DB .
SIAM JOURNAL ON COMPUTING, 2003, 33 (01) :1-25
[3]  
Dubnicki Cezary., 2009, FAST 09 P 7 C FILE S, P197
[4]   A Game-Theoretical Approach for Mitigating Edge DDoS Attack [J].
He, Qiang ;
Wang, Cheng ;
Cui, Guangming ;
Li, Bo ;
Zhou, Rui ;
Zhou, Qingguo ;
Xiang, Yang ;
Jin, Hai ;
Yang, Yun .
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2022, 19 (04) :2333-2348
[5]   A Game-Theoretical Approach for User Allocation in Edge Computing Environment [J].
He, Qiang ;
Cui, Guangming ;
Zhang, Xuyun ;
Chen, Feifei ;
Deng, Shuiguang ;
Jin, Hai ;
Li, Yanhui ;
Yang, Yun .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2020, 31 (03) :515-529
[6]   Optimal Edge User Allocation in Edge Computing with Variable Sized Vector Bin Packing [J].
Lai, Phu ;
He, Qiang ;
Abdelrazek, Mohamed ;
Chen, Feifei ;
Hosking, John ;
Grundy, John ;
Yang, Yun .
SERVICE-ORIENTED COMPUTING (ICSOC 2018), 2018, 11236 :230-245
[7]  
Li SJ, 2020, IEEE INFOCOM SER, P247, DOI [10.1109/INFOCOM41043.2020.9155233, 10.1109/infocom41043.2020.9155233]
[8]   EF-dedup: Enabling Collaborative Data Deduplication at the Network Edge [J].
Li, Shijing ;
Lan, Tian ;
Balasubramanian, Bharath ;
Ra, Moo-Ryong ;
Lee, Hee Won ;
Panta, Rajesh .
2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, :986-996
[9]   Lifecycle-Aware Online Video Caching [J].
Li, Tong ;
Braud, Tristan ;
Li, Yong ;
Hui, Pan .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2021, 20 (08) :2624-2636
[10]  
Meister Dirk, 2012, IEEE P INT C HIGH PE, P1, DOI 10.1109/SC.2012.14