Real-time progressive compression method of massive data based on improved clustering algorithm

被引:0
作者
Yang, Hengxiang [1 ]
Li, Lumin [1 ]
Li, Kai [1 ]
机构
[1] State Grid Xinjiang Informat & Telecommun Co, Urumqi 830000, Xinjiang, Peoples R China
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2023年 / 26卷 / 06期
关键词
Improved clustering algorithm; Massive data; Real-time progressive; Data compression; Clustering characteristics; Huffman coding;
D O I
10.1007/s10586-022-03780-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In order to realize the real-time progressive compression of massive data and ensure the quality of compressed data, a real-time progressive compression method of massive data based on improved clustering algorithm is proposed in this paper. Through the micro clustering stage of birch method based on K-Medoids clustering, Clustering Feature Tree hierarchy is constructed and numerical clustering features are extracted; Taking this feature as the input of macro clustering order, the Clustering Feature Tree leaf nodes are clustered based on the improved K-Medoids clustering method, and the clustering data cluster set is output; The set is used as the original data of real-time progressive compression, and the data is denoised and compressed by lifting format wavelet transform. On this basis, Huffman coding is used to compress the data losslessly. The test results show that this method has good clustering effect under the optimal number of clustering centers, can complete the real-time progressive compression of a large number of data, and the availability of compressed data is more than 92%.
引用
收藏
页码:3781 / 3791
页数:11
相关论文
共 22 条
[21]  
Zhu L.F., 2020, IEEE ACCESS, V23, P1
[22]   A new unsupervised feature selection algorithm using similarity-based feature clustering [J].
Zhu, Xiaoyan ;
Wang, Yu ;
Li, Yingbin ;
Tan, Yonghui ;
Wang, Guangtao ;
Song, Qinbao .
COMPUTATIONAL INTELLIGENCE, 2019, 35 (01) :2-22