Continuous Sign Language Recognition with Correlation Network

被引:43
|
作者
Hu, Lianyu [1 ]
Gao, Liqing [1 ]
Liu, Zekang [1 ]
Feng, Wei [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
基金
中国国家自然科学基金;
关键词
D O I
10.1109/CVPR52729.2023.00249
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human body trajectories are a salient cue to identify actions in the video. Such body trajectories are mainly conveyed by hands and face across consecutive frames in sign language. However, current methods in continuous sign language recognition (CSLR) usually process frames independently, thus failing to capture cross-frame trajectories to effectively identify a sign. To handle this limitation, we propose correlation network (CorrNet) to explicitly capture and leverage body trajectories across frames to identify signs. In specific, a correlation module is first proposed to dynamically compute correlation maps between the current frame and adjacent frames to identify trajectories of all spatial patches. An identification module is then presented to dynamically emphasize the body trajectories within these correlation maps. As a result, the generated features are able to gain an overview of local temporal movements to identify a sign. Thanks to its special attention on body trajectories, CorrNet achieves new state-of-the-art accuracy on four large-scale datasets, i.e., PHOENIX14, PHOENIX14-T, CSL-Daily, and CSL. A comprehensive comparison with previous spatial-temporal reasoning methods verifies the effectiveness of CorrNet. Visualizations demonstrate the effects of CorrNet on emphasizing human body trajectories across adjacent frames.
引用
收藏
页码:2529 / 2539
页数:11
相关论文
共 50 条
  • [1] Continuous Sign Language Recognition with Correlation Network
    Hu, Lianyu
    Gao, Liqing
    Liu, Zekang
    Feng, Wei
    arXiv, 2023,
  • [2] SLOWFAST NETWORK FOR CONTINUOUS SIGN LANGUAGE RECOGNITION
    Ahn, Junseok
    Jang, Youngjoon
    Chung, Joon Son
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3920 - 3924
  • [3] Iterative Alignment Network for Continuous Sign Language Recognition
    Pu, Junfu
    Zhou, Wengang
    Li, Houqiang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4160 - 4169
  • [4] Multiscale temporal network for continuous sign language recognition
    Zhu, Qidan
    Li, Jing
    Yuan, Fei
    Gan, Quan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [5] Self-Emphasizing Network for Continuous Sign Language Recognition
    Hu, Lianyu
    Gao, Liqing
    Liu, Zekang
    Feng, Wei
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 1, 2023, : 854 - 862
  • [6] Dynamical semantic enhancement network for continuous sign language recognition
    Wang, Suyang
    Guo, Leming
    Xue, Wanli
    MULTIMEDIA SYSTEMS, 2024, 30 (06)
  • [7] Selfie Continuous Sign Language Recognition using Neural Network
    Kumar, D. Anil
    Kishore, P. V. V.
    Sastry, A. S. C. S.
    Swamy, P. Reddy Gurunatha
    2016 IEEE ANNUAL INDIA CONFERENCE (INDICON), 2016,
  • [8] Continuous sign language recognition based on hierarchical memory sequence network
    Xue, Cuihong
    Jia, Jingli
    Yu, Ming
    Yan, Gang
    Guo, Yingchun
    Liu, Yuehao
    IET COMPUTER VISION, 2024, 18 (02) : 247 - 259
  • [9] Spatial-Temporal Enhanced Network for Continuous Sign Language Recognition
    Yin, Wenjie
    Hou, Yonghong
    Guo, Zihui
    Liu, Kailin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (03) : 1684 - 1695
  • [10] Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition
    Pu, Junfu
    Zhou, Wengang
    Li, Houqiang
    PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 885 - 891