CONTINUOUS SIGN LANGUAGE RECOGNITION VIA REINFORCEMENT LEARNING

被引：0

作者：

Zhang, Zhihao ^{[1
]}

Pu, Junfu ^{[1
]}

Zhuang, Liansheng ^{[1
]}

Zhou, Wengang ^{[1
]}

Li, Houqiang ^{[1
]}

机构：

[1] Univ Sci & Technol China, EEIS Dept, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei, Anhui, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年

关键词：

sign language recognition; reinforcement learning; self-critic;

D O I：

10.1109/icip.2019.8802972

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

In this paper, we propose an approach to apply the Transformer with reinforcement learning (RL) for continuous sign language recognition (CSLR) task. The Transformer has an encoder-decoder structure, where the encoder network encodes the sign video into the context vector representation, while the decoder network generates the target sentence word by word based on the context vector. To avoid the intrinsic defects of supervised learning (SL) in our task, e.g., the exposure bias and non-differentiable task metrics issues, we propose to train the Transformer directly on non-differentiable metrics, i.e., word error rate (WER), through RL. Moreover, a policy gradient algorithm with baseline, which we call Self-critic REINFORCE, is employed to reduce variance while training. Experimental results on RWTH-PHOENIX-Weather benchmark verify the effectiveness of our method and demonstrate that our method achieves the comparable performance.

引用

页码：285 / 289

页数：5

共 50 条

[1] Semantic Boundary Detection With Reinforcement Learning for Continuous Sign Language Recognition
Wei, Chengcheng
Zhao, Jian
Zhou, Wengang
Li, Houqiang
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1138 - 1149
[2] Visual feature segmentation with reinforcement learning for continuous sign language recognition
Fang, Yuchun
Wang, Liangjun
Lin, Shiquan
Ni, Lan
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
[3] Visual feature segmentation with reinforcement learning for continuous sign language recognition
Yuchun Fang
Liangjun Wang
Shiquan Lin
Lan Ni
International Journal of Multimedia Information Retrieval, 2023, 12
[4] Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
Pu, Junfu
Zhou, Wengang
Hu, Hezhen
Li, Houqiang
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1497 - 1505
[5] Continuous Sign Language Recognition Based on Pseudo-supervised Learning
Pei, Xiankun
Guo, Dan
Zhao, Ye
PROCEEDINGS OF THE 2ND WORKSHOP ON MULTIMEDIA FOR ACCESSIBLE HUMAN COMPUTER INTERFACES (MAHCI '19), 2019, : 33 - 39
[6] Deep Learning Approaches for Continuous Sign Language Recognition: A Comprehensive Review
Khan, Asma
Jin, Seyong
Lee, Geon-Hee
Arzu, Gul E.
Dang, L. Minh
Nguyen, Tan N.
Choi, Woong
Moon, Hyeonjoon
IEEE ACCESS, 2025, 13 : 55524 - 55544
[7] Self-Mutual Distillation Learning for Continuous Sign Language Recognition
Hao, Aiming
Min, Yuecong
Chen, Xilin
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11283 - 11292
[8] Pattern recognition considerations for continuous sign language recognition
Sherry, G
Foulds, R
PROCEEDINGS OF THE IEEE 29TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2003, : 291 - 293
[9] Continuous Sign Language Recognition Via Temporal Super-Resolution Network
Zhu, Qidan
Li, Jing
Yuan, Fei
Gan, Quan
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10697 - 10711
[10] Continuous Sign Language Recognition Via Temporal Super-Resolution Network
Qidan Zhu
Jing Li
Fei Yuan
Quan Gan
Arabian Journal for Science and Engineering, 2023, 48 : 10697 - 10711

← 1 2 3 4 5 →