A multi-feature-based intelligent redundancy elimination scheme for cloud-assisted health systems

被引:0
作者
Xiao, Ling [1 ,2 ]
Zou, Beiji [1 ,2 ]
Kui, Xiaoyan [1 ,2 ]
Zhu, Chengzhang [1 ,2 ,3 ]
Zhang, Wensheng [4 ]
Yang, Xuebing [4 ]
Zhang, Bob [5 ]
机构
[1] Cent South Univ, Sch Comp Sci & Engn, Changsha, Peoples R China
[2] Cent South Univ, Hunan Engn Res Ctr Machine Vis & Intelligent Med, Changsha, Peoples R China
[3] Cent South Univ, Coll Literature & Journalism, Changsha, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Beijing, Peoples R China
[5] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
big data; cloud computing; compression; data compression; medical applications; performance evaluation; ONLINE LEARNING CONTROL; REINFORCEMENT;
D O I
10.1049/cit2.12211
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Redundancy elimination techniques are extensively investigated to reduce storage overheads for cloud-assisted health systems. Deduplication eliminates the redundancy of duplicate blocks by storing one physical instance referenced by multiple duplicates. Delta compression is usually regarded as a complementary technique to deduplication to further remove the redundancy of similar blocks, but our observations indicate that this is disobedient when data have sparse duplicate blocks. In addition, there are many overlapped deltas in the resemblance detection process of post-deduplication delta compression, which hinders the efficiency of delta compression and the index phase of resemblance detection inquires abundant non-similar blocks, resulting in inefficient system throughput. Therefore, a multi-feature-based redundancy elimination scheme, called MFRE, is proposed to solve these problems. The similarity feature and temporal locality feature are excavated to assist redundancy elimination where the similarity feature well expresses the duplicate attribute. Then, similarity-based dynamic post-deduplication delta compression and temporal locality-based dynamic delta compression discover more similar base blocks to minimise overlapped deltas and improve compression ratios. Moreover, the clustering method based on block-relationship and the feature index strategy based on bloom filters reduce IO overheads and improve system throughput. Experiments demonstrate that the proposed method, compared to the state-of-the-art method, improves the compression ratio and system throughput by 9.68% and 50%, respectively.
引用
收藏
页码:491 / 510
页数:20
相关论文
共 41 条
[1]   Internet of Things-based healthcare system on patient demographic data in Health 4.0 [J].
Abdullayeva, Fargana J. .
CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2022, 7 (04) :644-657
[2]  
[Anonymous], 2017, STANFORD MED HLTH TR
[3]   On the resemblance and containment of documents [J].
Broder, AZ .
COMPRESSION AND COMPLEXITY OF SEQUENCES 1997 - PROCEEDINGS, 1998, :21-29
[4]  
Broder AZ, 2000, LECT NOTES COMPUT SC, V1848, P1
[5]   A Scalable Multicloud Storage Architecture for Cloud-Supported Medical Internet of Things [J].
Cao, Ronghui ;
Tang, Zhuo ;
Liu, Chubo ;
Veeravalli, Bharadwaj .
IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (03) :1641-1654
[6]   GenoDedup: Similarity-Based Deduplication and Delta-Encoding for Genome Sequencing Data [J].
Cogo, Vinicius ;
Paulo, Joao ;
Bessani, Alysson .
IEEE TRANSACTIONS ON COMPUTERS, 2021, 70 (05) :669-681
[7]  
Collet Y., 2020, LZ4 FAST LZ COMPRESS
[8]  
Fu M., 2015, 13th USENIX Conference on File and Storage Technologies (FAST 15), P331
[9]   Toward Smart Treatment Management for Personalized Healthcare [J].
Gai, Keke ;
Lu, Zhihui ;
Qiu, Meikang ;
Zhu, Liehuang .
IEEE NETWORK, 2019, 33 (06) :30-36
[10]   Buffer-Aware Data Migration Scheme for Hybrid Storage Systems [J].
Lin, Mingwei ;
Chen, Riqing ;
Lin, Li ;
Li, Xuan ;
Huang, Jingchang .
IEEE ACCESS, 2018, 6 :47646-47656