CONTINUOUS SIGN LANGUAGE RECOGNITION VIA REINFORCEMENT LEARNING

被引:0
作者
Zhang, Zhihao [1 ]
Pu, Junfu [1 ]
Zhuang, Liansheng [1 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, EEIS Dept, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei, Anhui, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
关键词
sign language recognition; reinforcement learning; self-critic;
D O I
10.1109/icip.2019.8802972
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, we propose an approach to apply the Transformer with reinforcement learning (RL) for continuous sign language recognition (CSLR) task. The Transformer has an encoder-decoder structure, where the encoder network encodes the sign video into the context vector representation, while the decoder network generates the target sentence word by word based on the context vector. To avoid the intrinsic defects of supervised learning (SL) in our task, e.g., the exposure bias and non-differentiable task metrics issues, we propose to train the Transformer directly on non-differentiable metrics, i.e., word error rate (WER), through RL. Moreover, a policy gradient algorithm with baseline, which we call Self-critic REINFORCE, is employed to reduce variance while training. Experimental results on RWTH-PHOENIX-Weather benchmark verify the effectiveness of our method and demonstrate that our method achieves the comparable performance.
引用
收藏
页码:285 / 289
页数:5
相关论文
共 50 条
[11]   On the role of multimodal learning in the recognition of sign language [J].
Pedro M. Ferreira ;
Jaime S. Cardoso ;
Ana Rebelo .
Multimedia Tools and Applications, 2019, 78 :10035-10056
[12]   On the role of multimodal learning in the recognition of sign language [J].
Ferreira, Pedro M. ;
Cardoso, Jaime S. ;
Rebelo, Ana .
MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (08) :10035-10056
[13]   British Sign Language Recognition via Late Fusion of Computer Vision and Leap Motion with Transfer Learning to American Sign Language [J].
Bird, Jordan J. ;
Ekart, Aniko ;
Faria, Diego R. .
SENSORS, 2020, 20 (18) :1-19
[14]   Continuous word level sign language recognition using an expert system based on machine learning [J].
Sreemathy R. ;
Turuk M.P. ;
Chaudhary S. ;
Lavate K. ;
Ushire A. ;
Khurana S. .
International Journal of Cognitive Computing in Engineering, 2023, 4 :170-178
[15]   Continuous Chinese Sign Language Recognition with CNN-LSTM [J].
Yang, Su ;
Zhu, Qing .
NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
[16]   Recent Advances on Deep Learning for Sign Language Recognition [J].
Zhang, Yanqiong ;
Jiang, Xianwei .
CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES, 2024, 139 (03) :2399-2450
[17]   Chinese Sign Language Recognition with Sequence to Sequence Learning [J].
Mao, Chensi ;
Huang, Shiliang ;
Li, Xiaoxu ;
Ye, Zhongfu .
COMPUTER VISION, PT I, 2017, 771 :180-191
[18]   Self-directed-Learning for Sign Language Recognition [J].
Jiang, Huaqiang ;
Hu, Huosheng ;
Pan, Hong .
PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMPUTATIONAL GEOMETRY AND ARTIFICIAL VISION (ISCGAV'09), 2009, :139-+
[19]   Recent Advances of Deep Learning for Sign Language Recognition [J].
Zheng, Lihong ;
Liang, Bin ;
Jiang, Ailian .
2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, :454-460
[20]   Review of Sign Language Recognition Based on Deep Learning [J].
Zhang Shujun ;
Zhang Qun ;
Li Hui .
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (04) :1021-1032