Hand-Gesture Recognition Based on EMG and Event-Based Camera Sensor Fusion: A Benchmark in Neuromorphic Computing

被引:118
作者
Ceolini, Enea [1 ]
Frenkel, Charlotte [1 ,2 ]
Shrestha, Sumit Bam [3 ]
Taverni, Gemma [1 ]
Khacef, Lyes [4 ]
Payvand, Melika [1 ]
Donati, Elisa [1 ]
机构
[1] Univ Zurich, Inst Neuroinformat, ETH Zurich, Zurich, Switzerland
[2] Catholic Univ Louvain, ICTEAM Inst, Louvain La Neuve, Belgium
[3] Natl Univ Singapore, Temasek Labs, Singapore, Singapore
[4] Univ Cote dAzur, CNRS, LEAT, Nice, France
基金
欧盟地平线“2020”;
关键词
hand-gesture classification; spiking neural networks (SNNs); electromyography (EMG) signal processing; event-based camera; sensor fusion; neuromorphic engineering; NEURAL-NETWORK; ARCHITECTURE; SYSTEM;
D O I
10.3389/fnins.2020.00637
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Hand gestures are a form of non-verbal communication used by individuals in conjunction with speech to communicate. Nowadays, with the increasing use of technology, hand-gesture recognition is considered to be an important aspect of Human-Machine Interaction (HMI), allowing the machine to capture and interpret the user's intent and to respond accordingly. The ability to discriminate between human gestures can help in several applications, such as assisted living, healthcare, neuro-rehabilitation, and sports. Recently, multi-sensor data fusion mechanisms have been investigated to improve discrimination accuracy. In this paper, we present a sensor fusion framework that integrates complementary systems: the electromyography (EMG) signal from muscles and visual information. This multi-sensor approach, while improving accuracy and robustness, introduces the disadvantage of high computational cost, which grows exponentially with the number of sensors and the number of measurements. Furthermore, this huge amount of data to process can affect the classification latency which can be crucial in real-case scenarios, such as prosthetic control. Neuromorphic technologies can be deployed to overcome these limitations since they allow real-time processing in parallel at low power consumption. In this paper, we present a fully neuromorphic sensor fusion approach for hand-gesture recognition comprised of an event-based vision sensor and three different neuromorphic processors. In particular, we used the event-based camera, called DVS, and two neuromorphic platforms, Loihi and ODIN + MorphIC. The EMG signals were recorded using traditional electrodes and then converted into spikes to be fed into the chips. We collected a dataset of five gestures from sign language where visual and electromyography signals are synchronized. We compared a fully neuromorphic approach to a baseline implemented using traditional machine learning approaches on a portable GPU system. According to the chip's constraints, we designed specific spiking neural networks (SNNs) for sensor fusion that showed classification accuracy comparable to the software baseline. These neuromorphic alternatives have increased inference time, between 20 and 40%, with respect to the GPU system but have a significantly smaller energy-delay product (EDP) which makes them between 30x and 600x more efficient. The proposed work represents a new benchmark that moves neuromorphic computing toward a real-world scenario.
引用
收藏
页数:15
相关论文
共 77 条
[11]   Neurogrid: A Mixed-Analog-Digital Multichip System for Large-Scale Neural Simulations [J].
Benjamin, Ben Varkey ;
Gao, Peiran ;
McQuinn, Emmett ;
Choudhary, Swadesh ;
Chandrasekaran, Anand R. ;
Bussat, Jean-Marie ;
Alvarez-Icaza, Rodrigo ;
Arthur, John V. ;
Merolla, Paul A. ;
Boahen, Kwabena .
PROCEEDINGS OF THE IEEE, 2014, 102 (05) :699-716
[12]   Classifier Level Fusion of Accelerometer and sEMG Signals for Automatic Fitness Activity Diarization [J].
Biagetti, Giorgio ;
Crippa, Paolo ;
Falaschetti, Laura ;
Turchetti, Claudio .
SENSORS, 2018, 18 (09)
[13]  
Boski M, 2017, 2017 10TH INTERNATIONAL WORKSHOP ON MULTIDIMENSIONAL (ND) SYSTEMS (NDS)
[14]   Learning real-world stimuli in a neural network with spike-driven synaptic dynamics [J].
Brader, Joseph M. ;
Senn, Walter ;
Fusi, Stefano .
NEURAL COMPUTATION, 2007, 19 (11) :2881-2912
[15]  
Braun S., 2019, WORKSHOP APPL SIGNAL, P1, DOI DOI 10.1109/IJCNN.2019.8852396
[16]   A Review of Data Fusion Techniques [J].
Castanedo, Federico .
SCIENTIFIC WORLD JOURNAL, 2013,
[17]   AER EAR: A matched silicon cochlea pair with address event representation interface [J].
Chan, Vincent ;
Liu, Shih-Chii ;
van Schaik, Andre .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2007, 54 (01) :48-59
[18]   Hand gesture recognition based on motor unit spike trains decoded from high-density electromyography [J].
Chen, Chen ;
Yu, Yang ;
Ma, Shihan ;
Sheng, Xinjun ;
Lin, Chuang ;
Farina, Dario ;
Zhu, Xiangyang .
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 55
[19]   A review of hand gesture and sign language recognition techniques [J].
Cheok, Ming Jin ;
Omar, Zaid ;
Jaward, Mohamed Hisham .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2019, 10 (01) :131-153
[20]   A Kinect-based Gesture Recognition Approach for a Natural Human Robot Interface [J].
Cicirelli, Grazia ;
Attolico, Carmela ;
Guaragnella, Cataldo ;
D'Orazio, Tiziana .
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2015, 12