CONTINUOUS SIGN LANGUAGE RECOGNITION VIA REINFORCEMENT LEARNING

被引:0
作者
Zhang, Zhihao [1 ]
Pu, Junfu [1 ]
Zhuang, Liansheng [1 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, EEIS Dept, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei, Anhui, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
关键词
sign language recognition; reinforcement learning; self-critic;
D O I
10.1109/icip.2019.8802972
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, we propose an approach to apply the Transformer with reinforcement learning (RL) for continuous sign language recognition (CSLR) task. The Transformer has an encoder-decoder structure, where the encoder network encodes the sign video into the context vector representation, while the decoder network generates the target sentence word by word based on the context vector. To avoid the intrinsic defects of supervised learning (SL) in our task, e.g., the exposure bias and non-differentiable task metrics issues, we propose to train the Transformer directly on non-differentiable metrics, i.e., word error rate (WER), through RL. Moreover, a policy gradient algorithm with baseline, which we call Self-critic REINFORCE, is employed to reduce variance while training. Experimental results on RWTH-PHOENIX-Weather benchmark verify the effectiveness of our method and demonstrate that our method achieves the comparable performance.
引用
收藏
页码:285 / 289
页数:5
相关论文
共 50 条
[31]   EvCSLR: Event-Guided Continuous Sign Language Recognition and Benchmark [J].
Jiang, Yu ;
Wang, Yuehang ;
Li, Siqi ;
Zhang, Yongji ;
Guo, Qianren ;
Chu, Qi ;
Gao, Yue .
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 :1349-1361
[32]   Sign Language Recognition with Multimodal Sensors and Deep Learning Methods [J].
Lu, Chenghong ;
Kozakai, Misaki ;
Jing, Lei .
ELECTRONICS, 2023, 12 (23)
[33]   Tunisian Sign Language Recognition and Translation Using Deep Learning [J].
El Askri, Marah ;
Basly, Hend ;
Bchir, Riadh ;
Zayene, Mohamed Amine ;
Sayadi, Fatma Ezzahra .
INTELLIGENT SYSTEMS AND PATTERN RECOGNITION, ISPR 2024, PT III, 2025, 2305 :26-39
[34]   Challenges with Sign Language Datasets for Sign Language Recognition and Translation [J].
De Sisto, Mirella ;
Vandeghinste, Vincent ;
Gomez, Santiago Egea ;
De Coster, Mathieu ;
Shterionov, Dimitar ;
Saggion, Horacio .
LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, :2478-2487
[35]   Sign Transition Modeling and a Scalable Solution to Continuous Sign Language Recognition for Real-World Applications [J].
Li, Kehuang ;
Zhou, Zhengyu ;
Lee, Chin-Hui .
ACM TRANSACTIONS ON ACCESSIBLE COMPUTING, 2016, 8 (02)
[36]   Towards subject independent continuous sign language recognition: A segment and merge approach [J].
Kong, W. W. ;
Ranganath, Surendra .
PATTERN RECOGNITION, 2014, 47 (03) :1294-1308
[37]   Multi-level Temporal Relation Graph for Continuous Sign Language Recognition [J].
Guo, Jingjing ;
Xue, Wanli ;
Guo, Leming ;
Yu, Tiantian ;
Chen, Shengyong .
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2022, PT III, 2022, 13536 :408-419
[38]   Continuous Sign Language Recognition based on Multi-Part Skeleton Data [J].
Wang, Zhuocheng ;
Zhang, Jingqiao .
2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[39]   Continuous Sign Language Recognition and Its Translation into Intonation-Colored Speech [J].
Amangeldy, Nurzada ;
Ukenova, Aru ;
Bekmanova, Gulmira ;
Razakhova, Bibigul ;
Milosz, Marek ;
Kudubayeva, Saule .
SENSORS, 2023, 23 (14)
[40]   A Modified LSTM Model for Continuous Sign Language Recognition Using Leap Motion [J].
Mittal, Anshul ;
Kumar, Pradeep ;
Roy, Partha Pratim ;
Balasubramanian, Raman ;
Chaudhuri, Bidyut B. .
IEEE SENSORS JOURNAL, 2019, 19 (16) :7056-7063