A K-Means-Based Multi-Prototype High-Speed Learning System with FPGA-Implemented Coprocessor for 1-NN Searching

被引:16
作者
An, Fengwei [1 ]
Koide, Tetsushi [1 ]
Mattausch, Hans Juergen [1 ]
机构
[1] Hiroshima Univ, Res Inst Nanodevice & Bio Syst, Higashihiroshima 7398527, Japan
关键词
multi-prototype learning system; K-means clustering; software-hardware cooperation; one nearest neighbor (1-NN); FPGA-implemented coprocessor; nearest euclidean distance searching; ARTIFICIAL NEURAL-NETWORKS; SUPPORT VECTOR MACHINE; ASSOCIATIVE-MEMORY; HARDWARE; ARCHITECTURE; CLASSIFIER; ALGORITHMS; MOSFETS; DESIGN;
D O I
10.1587/transinf.E95.D.2327
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper. we propose a hardware solution for overcoming the problem of high computational demands in a nearest neighbor (NN) based multi-prototype learning system. The multiple prototypes are obtained by a high-speed K-means clustering algorithm utilizing a concept of software-hardware cooperation that takes advantage of the flexibility of the software and the efficiency of the hardware. The one nearest neighbor (1-NN) classifier is used to recognize an object by searching for the nearest Euclidean distance among the prototypes. The major deficiency in conventional implementations for both K-means and 1-NN is the high computational demand of the nearest neighbor searching. This deficiency is resolved by an FPGA-implemented coprocessor that is a VLSI circuit for searching the nearest Euclidean distance. The coprocessor requires 12.9% logic elements and 58% block memory bits of an Altera Stratix III E110 FPGA device. The hardware communicates with the software by a PCI Express (x4) local-bus-compatible interface. We benchmark our learning system against the popular case of handwritten digit recognition in which abundant previous works for comparison are available. In the case of the MNIST database, we could attain the most efficient accuracy rate of 97.91% with 930 prototypes, the learning speed of 1.3 x 10(-4) s/sample and the classification speed of 3.94 x 10(-8) s/character.
引用
收藏
页码:2327 / 2338
页数:12
相关论文
共 63 条
[1]   Mixed digital-analog associative memory enabling fully-parallel nearest Euclidean distance search [J].
Abedin, Md. Anwarul ;
Tanaka, Yuki ;
Ahmadi, Ali ;
Koide, Tetsushi ;
Mattausch, Hans Juergen .
JAPANESE JOURNAL OF APPLIED PHYSICS PART 1-REGULAR PAPERS BRIEF COMMUNICATIONS & REVIEW PAPERS, 2007, 46 (4B) :2231-2237
[2]   A digital architecture for support vector machines: Theory, algorithm, and FPGA implementation [J].
Anguita, D ;
Boni, A ;
Ridella, S .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (05) :993-1009
[3]  
[Anonymous], 2004, Proceedings of the 2004 ACM/IEEE conference on Supercomputing, page, DOI DOI 10.1109/SC.2004.26
[4]  
[Anonymous], NEURAL COMPUTING SUR
[5]  
[Anonymous], 2008, P 31 ANN INT ACM SIG, DOI 10.1145/1390334.1390356
[6]  
[Anonymous], 1973, Pattern Classification and Scene Analysis
[7]  
Bajramovic F, 2006, LECT NOTES COMPUT SC, V4179, P1186
[8]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[9]   Multiple-prototype classifier design [J].
Bezdek, JC ;
Reichherzer, TR ;
Lim, GS ;
Attikiouzel, Y .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 1998, 28 (01) :67-79
[10]  
Biasi I, 2005, IEEE IJCNN, P2867