Optimizing echo state networks for continuous gesture recognition in mobile devices: A comparative study

被引:3
作者
Yadav, Alok [1 ]
Pasupa, Kitsuchart [1 ]
Loo, Chu Kiong [2 ]
Liu, Xiaofeng [3 ]
机构
[1] King Mongkuts Inst Technol Ladkrabang, Sch Informat Technol, Bangkok 10520, Thailand
[2] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
[3] Hohai Univ, Coll IoT Engn, Changzhou 213022, Peoples R China
关键词
Echo state networks; Continuous gesture recognition; Behaviour space analysis; ALGORITHMS;
D O I
10.1016/j.heliyon.2024.e27108
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Continuous gesture recognition can be used to enhance human -computer interaction. This can be accomplished by capturing human movement with the use of the Inertial Measurement Units in smartphones and using machine learning algorithms to predict the intended gestures. Echo State Networks (ESNs) consist of a fixed internal reservoir that is able to generate rich and diverse nonlinear dynamics in response to input signals that capture temporal dependencies within the signal. This makes ESNs well -suited for time series prediction tasks, such as continuous gesture recognition. However, their application has not been rigorously explored, with regard to gesture recognition. In this study, we sought to enhance the efficacy of ESN models in continuous gesture recognition by exploring diverse model structures, fine-tuning hyperparameters, and experimenting with various training approaches. We used three different training schemes that used the Leave -one -out Cross -validation (LOOCV) protocol to investigate the performance in realworld scenarios with different levels of data availability: Leaving out data from one user to use for testing (F1 -score: 0.89), leaving out a fraction of data from all users to use in testing (F1 -score: 0.96), and training and testing using LOOCV on a single user (F1 -score: 0.99). The obtained results outperformed the Long Short -Term Memory (LSTM) performance from past research (F1 -score: 0.87) while maintaining a low training time of approximately 13 seconds compared to 63 seconds for the LSTM model. Additionally, we further explored the performance of the ESN models through behaviour space analysis using memory capacity, Kernel Rank, and Generalization Rank. Our results demonstrate that ESNs can be optimized to achieve high performance on gesture recognition in mobile devices on multiple levels of data availability. These findings highlight the practical ability of ESNs to enhance human -computer interaction.
引用
收藏
页数:15
相关论文
共 27 条
[21]   ReservoirPy: An Efficient and User-Friendly Library to Design Echo State Networks [J].
Trouvain, Nathan ;
Pedrelli, Luca ;
Dinh, Thanh Trung ;
Hinaut, Xavier .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 :494-505
[22]   An experimental unification of reservoir computing methods [J].
Verstraeten, D. ;
Schrauwen, B. ;
D'Haene, M. ;
Stroobandt, D. .
NEURAL NETWORKS, 2007, 20 (03) :391-403
[23]   Echo state networks regulated by local intrinsic plasticity rules for regression [J].
Wang, Xinjie ;
Jin, Yaochu ;
Hao, Kuangrong .
NEUROCOMPUTING, 2019, 351 :111-122
[24]  
Wang YT, 2018, 2018 IEEE 20TH INTERNATIONAL CONFERENCE ON E-HEALTH NETWORKING, APPLICATIONS AND SERVICES (HEALTHCOM)
[25]  
Wold S., 1987, International Journal of Livestock Research, DOI [10.1016/0169-7439(87)80084-9., DOI 10.5455/IJLR.20170415115235]
[26]   Revisiting Echo State Networks for Continuous Gesture Recognition [J].
Yadav, Alok ;
Pasupa, Kitsuchart ;
Loo, Chu Kiong .
2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, :978-983
[27]   Optimal echo state network parameters based on behavioural spaces [J].
Zhang, ZhaoZhao ;
Zhu, YingQin ;
Wang, XiaoHui ;
Yu, Wen .
NEUROCOMPUTING, 2022, 503 :299-313