Moving Learning Machine towards Fast Real-Time Applications: A High-Speed FPGA-Based Implementation of the OS-ELM Training Algorithm

被引:14
作者
Frances-Villora, Jose V. [1 ]
Rosado-Munoz, Alfredo [1 ]
Bataller-Mompean, Manuel [1 ]
Barrios-Aviles, Juan [1 ]
Guerrero-Martinez, Juan F. [1 ]
机构
[1] Univ Valencia, Dept Elect Engn, Proc & Digital Design Grp, Burjassot 46100, Spain
关键词
online sequential ELM; OS-ELM; FPGA; on-chip training; on-line learning; real-time learning; hardware implementation; extreme learning machine; NEURAL-NETWORKS; EOS-ELM; EXTREME; IMBALANCE;
D O I
10.3390/electronics7110308
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Currently, there are some emerging online learning applications handling data streams in real-time. The On-line Sequential Extreme Learning Machine (OS-ELM) has been successfully used in real-time condition prediction applications because of its good generalization performance at an extreme learning speed, but the number of trainings by a second (training frequency) achieved in these continuous learning applications has to be further reduced. This paper proposes a performance-optimized implementation of the OS-ELM training algorithm when it is applied to real-time applications. In this case, the natural way of feeding the training of the neural network is one-by-one, i.e., training the neural network for each new incoming training input vector. Applying this restriction, the computational needs are drastically reduced. An FPGA-based implementation of the tailored OS-ELM algorithm is used to analyze, in a parameterized way, the level of optimization achieved. We observed that the tailored algorithm drastically reduces the number of clock cycles consumed for the training execution up to approximately the 1%. This performance enables high-speed sequential training ratios, such as 14 KHz of sequential training frequency for a 40 hidden neurons SLFN, or 180 Hz of sequential training frequency for a 500 hidden neurons SLFN. In practice, the proposed implementation computes the training almost 100 times faster, or more, than other applications in the bibliography. Besides, clock cycles follows a quadratic complexity the number of hidden neurons, and are poorly influenced by the number of input neurons. However, it shows a pronounced sensitivity to data type precision even facing small-size problems, which force to use double floating-point precision data types to avoid finite precision arithmetic effects. In addition, it has been found that distributed memory is the limiting resource and, thus, it can be stated that current FPGA devices can support OS-ELM-based on-chip learning of up to 500 hidden neurons. Concluding, the proposed hardware implementation of the OS-ELM offers great possibilities for on-chip learning in portable systems and real-time applications where frequent and fast training is required.
引用
收藏
页数:23
相关论文
共 47 条
[1]  
[Anonymous], 2012, MATRIX COMPUTATIONS
[2]  
[Anonymous], AMBA AXI and ACE Protocol Specification
[3]  
ARM, AMBA AXI4 STREAM PRO
[4]   Support Tool for the Combined Software/Hardware Design of On-Chip ELM Training for SLFF Neural Networks [J].
Bataller-Mompean, Manuel ;
Miguel Martinez-Villena, Jose ;
Rosado-Munoz, Alfredo ;
Frances-Villora, Jose V. ;
Guerrero-Martinez, Juan F. ;
Wegrzyn, Marek ;
Adamski, Marian .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2016, 12 (03) :1114-1123
[5]   Extreme Learning Machines [J].
Cambria, Erik ;
Huang, Guang-Bin .
IEEE INTELLIGENT SYSTEMS, 2013, 28 (06) :30-31
[6]   Predicting Infectious Disease Using Deep Learning and Big Data [J].
Chae, Sangwon ;
Kwon, Sungjun ;
Lee, Donghyun .
INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH, 2018, 15 (08)
[7]   Online sequential condition prediction method of natural circulation systems based on EOS-ELM and phase space reconstruction [J].
Chen, Hanying ;
Gao, Puzhen ;
Tan, Sichao ;
Tang, Jiguo ;
Yuan, Hongsheng .
ANNALS OF NUCLEAR ENERGY, 2017, 110 :1107-1120
[8]   Electricity Price Forecasting With Extreme Learning Machine and Bootstrapping [J].
Chen, Xia ;
Dong, Zhao Yang ;
Meng, Ke ;
Ku, Yan ;
Wong, Kit Po ;
Ngan, H. W. .
IEEE TRANSACTIONS ON POWER SYSTEMS, 2012, 27 (04) :2055-2062
[9]   Realtime training on mobile devices for face recognition applications [J].
Choi, Kwontaeg ;
Toh, Kar-Ann ;
Byun, Hyeran .
PATTERN RECOGNITION, 2011, 44 (02) :386-400
[10]  
Chong E.K., 2013, An introduction to optimization, V76