K-Means VQ algorithm using a low-cost parallel cluster computing

被引:0
作者
de Souza, PSL [1 ]
Britto, AD [1 ]
Sabourin, R [1 ]
de Souza, SRS [1 ]
Borges, DL [1 ]
机构
[1] Univ Estadual Ponta Grossa, BR-84100000 Ponta Grossa, Brazil
来源
Proceedings of the IASTED International Conference on Parallel and Distributed Computing and Networks | 2004年
关键词
vector quantization; K-means algorithm; parallel computing;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
It is well known that the time and memory necessary to create a codebook from large training databases have hindered the vector quantization based systems for real applications. To overcome this problem, we present a parallel approach for the K-means Vector Quantization (VQ) algorithm based on master/slave paradigm and low-cost parallel cluster computing. Distributing the training samples over the slaves' local disks reduces the overhead associated with the communication process. In addition, models predicting computation and communication time have been developed. These models are useful to predict the optimal number of slaves taking into account the number of training samples and codebook size. The experiments have shown the efficiency of the proposed models and also a linear speed up of the vector quantization process used in a two-stage Hidden Markov Model (HMM)-based system for recognizing handwritten numeral strings.
引用
收藏
页码:392 / 397
页数:6
相关论文
共 9 条
[1]   Parallel codebook design for vector quantization on a message passing MIMD architecture [J].
Abbas, HM ;
Bayoumi, MM .
PARALLEL COMPUTING, 2002, 28 (7-8) :1079-1093
[2]   On distributing the clustering process [J].
Boutsinas, B ;
Gnardellis, T .
PATTERN RECOGNITION LETTERS, 2002, 23 (08) :999-1008
[3]  
Britto A. Jr., 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition, P396, DOI 10.1109/ICDAR.2001.953820
[4]  
CANTUPAZ E, 1997, 97004 ILLIGAL
[5]  
DHILLON IS, 1999, IEEE T KNOWL DAT ENG
[6]  
HWANG K, 1997, SCALABLE PARALLEL CO
[7]  
Kobayashi K, 1997, IEICE T ELECTRON, VE80C, P970
[8]  
MacQueen J., 1967, P 5 BERKELEY S MATH, V1, P281
[9]  
Manohar M., 1992, DCC '92. Data Compression Conference (Cat. No.92TH0436-6), P181, DOI 10.1109/DCC.1992.227463