American Sign Language Translation Using Wearable Inertial and Electromyography Sensors for Tracking Hand Movements and Facial Expressions

被引:7
作者
Gu, Yutong [1 ]
Zheng, Chao [2 ]
Todoh, Masahiro [3 ]
Zha, Fusheng [4 ]
机构
[1] Hokkaido Univ, Grad Sch Engn, Sapporo, Japan
[2] China State Shipbuilding Corp Ltd, Wuhan Ship Design & Res Inst 2, Wuhan, Peoples R China
[3] Hokkaido Univ, Fac Engn, Sapporo, Japan
[4] Harbin Inst Technol, State Key Lab Robot & Syst, Harbin, Peoples R China
关键词
American sign language; inertial measurement units; electromyography; long short-term memory; transformer;
D O I
10.3389/fnins.2022.962141
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
A sign language translation system can break the communication barrier between hearing-impaired people and others. In this paper, a novel American sign language (ASL) translation method based on wearable sensors was proposed. We leveraged inertial sensors to capture signs and surface electromyography (EMG) sensors to detect facial expressions. We applied a convolutional neural network (CNN) to extract features from input signals. Then, long short-term memory (LSTM) and transformer models were exploited to achieve end-to-end translation from input signals to text sentences. We evaluated two models on 40 ASL sentences strictly following the rules of grammar. Word error rate (WER) and sentence error rate (SER) are utilized as the evaluation standard. The LSTM model can translate sentences in the testing dataset with a 7.74% WER and 9.17% SER. The transformer model performs much better by achieving a 4.22% WER and 4.72% SER. The encouraging results indicate that both models are suitable for sign language translation with high accuracy. With complete motion capture sensors and facial expression recognition methods, the sign language translation system has the potential to recognize more sentences.
引用
收藏
页数:12
相关论文
共 34 条
[1]   Solving Robotic Manipulation With Sparse Reward Reinforcement Learning Via Graph-Based Diversity and Proximity [J].
Bing, Zhenshan ;
Zhou, Hongkuan ;
Li, Rui ;
Su, Xiaojie ;
Morin, Fabrice O. ;
Huang, Kai ;
Knoll, Alois .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 70 (03) :2759-2769
[2]   Toward Cognitive Navigation: Design and Implementation of a Biologically Inspired Head Direction Cell Network [J].
Bing, Zhenshan ;
Sewisy, Amir Ei ;
Zhuang, Genghang ;
Walter, Florian ;
Morin, Fabrice O. ;
Huang, Kai ;
Knoll, Alois .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (05) :2147-2158
[3]   Robotic Manipulation in Dynamic Scenarios via Bounding-Box-Based Hindsight Goal Generation [J].
Bing, Zhenshan ;
Alvarez, Erick ;
Cheng, Long ;
Morin, Fabrice O. ;
Li, Rui ;
Su, Xiaojie ;
Huang, Kai ;
Knoll, Alois .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) :5037-5050
[4]   Complex Robotic Manipulation via Graph-Based Hindsight Goal Generation [J].
Bing, Zhenshan ;
Brucker, Matthias ;
Morin, Fabrice O. ;
Li, Rui ;
Su, Xiaojie ;
Huang, Kai ;
Knoll, Alois .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (12) :7863-7876
[5]   Sign Language Recognition, Generation, and Translation: An Interdisciplinary Perspective [J].
Bragg, Danielle ;
Koller, Oscar ;
Bellard, Mary ;
Berke, Larwan ;
Boudreault, Patrick ;
Braffort, Annelies ;
Caselli, Naomi ;
Huenerfauth, Matt ;
Kacorri, Hernisa ;
Verhoef, Tessa ;
Vogler, Christian ;
Morris, Meredith Ringel .
ASSETS'19: THE 21ST INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2019, :16-31
[6]   Sign Language Transformers: Joint End-to-end Sign Language Recognition and Translation [J].
Camgoz, Necati Cihan ;
Koller, Oscar ;
Hadfield, Simon ;
Bowden, Richard .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :10020-10030
[7]   A Novel Phonology- and Radical-Coded Chinese Sign Language Recognition Framework Using Accelerometer and Surface Electromyography Sensors [J].
Cheng, Juan ;
Chen, Xun ;
Liu, Aiping ;
Peng, Hu .
SENSORS, 2015, 15 (09) :23303-23324
[8]   Filtering the surface EMG signal: Movement artifact and baseline noise contamination [J].
De Luca, Carlo J. ;
Gilmore, L. Donald ;
Kuznetsov, Mikhail ;
Roy, Serge H. .
JOURNAL OF BIOMECHANICS, 2010, 43 (08) :1573-1579
[9]  
EDMONDS Jack., 1971, MATH PROGRAMMING ONL, V1, P127, DOI DOI 10.1007/BF01584082
[10]   DeepASL: Enabling Ubiquitous and Non-Intrusive Word and Sentence-Level Sign Language Translation [J].
Fang, Biyi ;
Co, Jillian ;
Zhang, Mi .
PROCEEDINGS OF THE 15TH ACM CONFERENCE ON EMBEDDED NETWORKED SENSOR SYSTEMS (SENSYS'17), 2017,