VLSI realization of learning vector quantization with hardware/software co-design for different applications

被引:14
作者
An, Fengwei [1 ]
Akazawa, Toshinobu [2 ]
Yamasaki, Shogo [2 ]
Chen, Lei [3 ]
Mattausch, Hans Juergen [2 ,3 ]
机构
[1] Hiroshima Univ, Grad Sch Engn, Hiroshima 7398527, Japan
[2] Hiroshima Univ, Res Inst Nanodevice & Bio Syst, Hiroshima 7398530, Japan
[3] Hiroshima Univ, HiSIM Res Ctr, Hiroshima 7398530, Japan
关键词
NEURAL-NETWORK; IMPLEMENTATION; ALGORITHM; HARDWARE; CHIP; MAP; ARCHITECTURE; SYSTEM;
D O I
10.7567/JJAP.54.04DE05
中图分类号
O59 [应用物理学];
学科分类号
摘要
This paper reports a VLSI realization of learning vector quantization (LVQ) with high flexibility for different applications. It is based on a hardware/software (HW/SW) co-design concept for on-chip learning and recognition and designed as a SoC in 180nm CMOS. The time consuming nearest Euclidean distance search in the LVQ algorithm's competition layer is efficiently implemented as a pipeline with parallel p-word input. Since neuron number in the competition layer, weight values, input and output number are scalable, the requirements of many different applications can be satisfied without hardware changes. Classification of a d-dimensional input vector is completed in n x [d/p] + R clock cycles, where R is the pipeline depth, and n is the number of reference feature vectors (FVs). Adjustment of stored reference FVs during learning is done by the embedded 32-bit RISC CPU, because this operation is not time critical. The high flexibility is verified by the application of human detection with different numbers for the dimensionality of the FVs. (C) 2015 The Japan Society of Applied Physics
引用
收藏
页数:5
相关论文
共 30 条
[1]   Partial least squares regression and projection on latent structure regression (PLS Regression) [J].
Abdi, Herve .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (01) :97-106
[2]  
An F., 2014, EXT ABSTR SOLID STAT, P104
[3]   K-means clustering algorithm for multimedia applications with flexible HW/SW co-design [J].
An, Fengwei ;
Mattausch, Hans Juergen .
JOURNAL OF SYSTEMS ARCHITECTURE, 2013, 59 (03) :155-164
[4]   A K-Means-Based Multi-Prototype High-Speed Learning System with FPGA-Implemented Coprocessor for 1-NN Searching [J].
An, Fengwei ;
Koide, Tetsushi ;
Mattausch, Hans Juergen .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (09) :2327-2338
[5]   A digital architecture for support vector machines: Theory, algorithm, and FPGA implementation [J].
Anguita, D ;
Boni, A ;
Ridella, S .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2003, 14 (05) :993-1009
[6]   Improved neural network for SVM learning [J].
Anguita, D ;
Boni, A .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (05) :1243-1244
[7]   A CNN-based chip for robot locomotion control [J].
Arena, P ;
Fortuna, L ;
Frasca, M ;
Patané, L .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2005, 52 (09) :1862-1871
[8]  
Benenson R, 2012, PROC CVPR IEEE, P2903, DOI 10.1109/CVPR.2012.6248017
[9]   Histograms of oriented gradients for human detection [J].
Dalal, N ;
Triggs, B .
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, :886-893
[10]  
Dalal N., 2005, INRIA HUMAN DATASE