Optimizing echo state networks for continuous gesture recognition in mobile devices: A comparative study

被引:3
作者
Yadav, Alok [1 ]
Pasupa, Kitsuchart [1 ]
Loo, Chu Kiong [2 ]
Liu, Xiaofeng [3 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Sch Informat Technol, Bangkok 10520, Thailand
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
[3] Hohai Univ, Coll IoT Engn, Changzhou 213022, Peoples R China
关键词
Echo state networks; Continuous gesture recognition; Behaviour space analysis; ALGORITHMS;
D O I
10.1016/j.heliyon.2024.e27108
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Continuous gesture recognition can be used to enhance human -computer interaction. This can be accomplished by capturing human movement with the use of the Inertial Measurement Units in smartphones and using machine learning algorithms to predict the intended gestures. Echo State Networks (ESNs) consist of a fixed internal reservoir that is able to generate rich and diverse nonlinear dynamics in response to input signals that capture temporal dependencies within the signal. This makes ESNs well -suited for time series prediction tasks, such as continuous gesture recognition. However, their application has not been rigorously explored, with regard to gesture recognition. In this study, we sought to enhance the efficacy of ESN models in continuous gesture recognition by exploring diverse model structures, fine-tuning hyperparameters, and experimenting with various training approaches. We used three different training schemes that used the Leave -one -out Cross -validation (LOOCV) protocol to investigate the performance in realworld scenarios with different levels of data availability: Leaving out data from one user to use for testing (F1 -score: 0.89), leaving out a fraction of data from all users to use in testing (F1 -score: 0.96), and training and testing using LOOCV on a single user (F1 -score: 0.99). The obtained results outperformed the Long Short -Term Memory (LSTM) performance from past research (F1 -score: 0.87) while maintaining a low training time of approximately 13 seconds compared to 63 seconds for the LSTM model. Additionally, we further explored the performance of the ESN models through behaviour space analysis using memory capacity, Kernel Rank, and Generalization Rank. Our results demonstrate that ESNs can be optimized to achieve high performance on gesture recognition in mobile devices on multiple levels of data availability. These findings highlight the practical ability of ESNs to enhance human -computer interaction.
引用
收藏
页数:15
相关论文
共 27 条
[1]   New results on recurrent network training: Unifying the algorithms and accelerating convergence [J].
Atiya, AF ;
Parlos, AG .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2000, 11 (03) :697-709
[3]   Connectivity, Dynamics, and Memory in Reservoir Computing with Binary and Analog Neurons [J].
Buesing, Lars ;
Schrauwen, Benjamin ;
Legenstein, Robert .
NEURAL COMPUTATION, 2010, 22 (05) :1272-1311
[4]  
Dale M, 2019, Arxiv, DOI arXiv:1810.07135
[5]   Next generation reservoir computing [J].
Gauthier, Daniel J. ;
Bollt, Erik ;
Griffith, Aaron ;
Barbosa, Wendson A. S. .
NATURE COMMUNICATIONS, 2021, 12 (01)
[6]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]
[7]   Support vector machines [J].
Hearst, MA .
IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (04) :18-21
[8]   RIDGE REGRESSION - BIASED ESTIMATION FOR NONORTHOGONAL PROBLEMS [J].
HOERL, AE ;
KENNARD, RW .
TECHNOMETRICS, 1970, 12 (01) :55-&
[9]  
Jaeger H. J. B, 2001, GMD Report
[10]   Optimization and applications of echo state networks with leaky-integrator neurons [J].
Jaegera, Herbert ;
Lukosevicius, Mantas ;
Popovici, Dan ;
Siewert, Udo .
NEURAL NETWORKS, 2007, 20 (03) :335-352