CONTINUOUS SIGN LANGUAGE RECOGNITION VIA REINFORCEMENT LEARNING

被引:0
|
作者
Zhang, Zhihao [1 ]
Pu, Junfu [1 ]
Zhuang, Liansheng [1 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, EEIS Dept, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei, Anhui, Peoples R China
来源
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP) | 2019年
关键词
sign language recognition; reinforcement learning; self-critic;
D O I
10.1109/icip.2019.8802972
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
In this paper, we propose an approach to apply the Transformer with reinforcement learning (RL) for continuous sign language recognition (CSLR) task. The Transformer has an encoder-decoder structure, where the encoder network encodes the sign video into the context vector representation, while the decoder network generates the target sentence word by word based on the context vector. To avoid the intrinsic defects of supervised learning (SL) in our task, e.g., the exposure bias and non-differentiable task metrics issues, we propose to train the Transformer directly on non-differentiable metrics, i.e., word error rate (WER), through RL. Moreover, a policy gradient algorithm with baseline, which we call Self-critic REINFORCE, is employed to reduce variance while training. Experimental results on RWTH-PHOENIX-Weather benchmark verify the effectiveness of our method and demonstrate that our method achieves the comparable performance.
引用
收藏
页码:285 / 289
页数:5
相关论文
共 50 条
  • [1] Semantic Boundary Detection With Reinforcement Learning for Continuous Sign Language Recognition
    Wei, Chengcheng
    Zhao, Jian
    Zhou, Wengang
    Li, Houqiang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (03) : 1138 - 1149
  • [2] Visual feature segmentation with reinforcement learning for continuous sign language recognition
    Fang, Yuchun
    Wang, Liangjun
    Lin, Shiquan
    Ni, Lan
    INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2023, 12 (02)
  • [3] Visual feature segmentation with reinforcement learning for continuous sign language recognition
    Yuchun Fang
    Liangjun Wang
    Shiquan Lin
    Lan Ni
    International Journal of Multimedia Information Retrieval, 2023, 12
  • [4] Boosting Continuous Sign Language Recognition via Cross Modality Augmentation
    Pu, Junfu
    Zhou, Wengang
    Hu, Hezhen
    Li, Houqiang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 1497 - 1505
  • [5] Continuous Sign Language Recognition Based on Pseudo-supervised Learning
    Pei, Xiankun
    Guo, Dan
    Zhao, Ye
    PROCEEDINGS OF THE 2ND WORKSHOP ON MULTIMEDIA FOR ACCESSIBLE HUMAN COMPUTER INTERFACES (MAHCI '19), 2019, : 33 - 39
  • [6] Deep Learning Approaches for Continuous Sign Language Recognition: A Comprehensive Review
    Khan, Asma
    Jin, Seyong
    Lee, Geon-Hee
    Arzu, Gul E.
    Dang, L. Minh
    Nguyen, Tan N.
    Choi, Woong
    Moon, Hyeonjoon
    IEEE ACCESS, 2025, 13 : 55524 - 55544
  • [7] Self-Mutual Distillation Learning for Continuous Sign Language Recognition
    Hao, Aiming
    Min, Yuecong
    Chen, Xilin
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11283 - 11292
  • [8] Pattern recognition considerations for continuous sign language recognition
    Sherry, G
    Foulds, R
    PROCEEDINGS OF THE IEEE 29TH ANNUAL NORTHEAST BIOENGINEERING CONFERENCE, 2003, : 291 - 293
  • [9] Continuous Sign Language Recognition Via Temporal Super-Resolution Network
    Zhu, Qidan
    Li, Jing
    Yuan, Fei
    Gan, Quan
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2023, 48 (08) : 10697 - 10711
  • [10] Continuous Sign Language Recognition Via Temporal Super-Resolution Network
    Qidan Zhu
    Jing Li
    Fei Yuan
    Quan Gan
    Arabian Journal for Science and Engineering, 2023, 48 : 10697 - 10711