Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition

被引:0
|
作者
Pu, Junfu [1 ]
Zhou, Wengang [1 ]
Li, Houqiang [1 ]
机构
[1] Univ Sci & Technol China, EEIS Dept, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei, Peoples R China
来源
PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2018年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel deep neural architecture with iterative optimization strategy for real-world continuous sign language recognition. Generally, a continuous sign language recognition system consists of visual input encoder for feature extraction and a sequence learning model to learn the correspondence between the input sequence and the output sentence-level labels. We use a 3D residual convolutional network (3D-ResNet) to extract visual features. After that, a stacked dilated convolutional network with Connectionist Temporal Classification (CTC) is applied for learning the mapping between the sequential features and the text sentence. The deep network is hard to train since the CTC loss has limited contribution to early CNN parameters. To alleviate this problem, we design an iterative optimization strategy to train our architecture. We generate pseudo-labels for video clips from sequence learning model with CTC, and fine-tune the 3D-ResNet with the supervision of pseudolabels for a better feature representation. We alternately optimize feature extractor and sequence learning model with iterative steps. Experimental results on RWTH-PHOENIX-Weather, a large real-world continuous sign language recognition benchmark, demonstrate the advantages and effectiveness of our proposed method.
引用
收藏
页码:885 / 891
页数:7
相关论文
共 50 条
  • [1] Iterative Alignment Network for Continuous Sign Language Recognition
    Pu, Junfu
    Zhou, Wengang
    Li, Houqiang
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 4160 - 4169
  • [2] Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization
    Cui, Runpeng
    Liu, Hu
    Zhang, Changshui
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1610 - 1618
  • [3] Continuous sign language recognition based on iterative alignment network and attention mechanism
    Xue, Cuihong
    Yu, Ming
    Yan, Gang
    Gao, Yang
    Liu, Yuehao
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (11) : 17195 - 17212
  • [4] Continuous sign language recognition based on iterative alignment network and attention mechanism
    Cuihong Xue
    Ming Yu
    Gang Yan
    Yang Gao
    Yuehao Liu
    Multimedia Tools and Applications, 2023, 82 : 17195 - 17212
  • [5] SLOWFAST NETWORK FOR CONTINUOUS SIGN LANGUAGE RECOGNITION
    Ahn, Junseok
    Jang, Youngjoon
    Chung, Joon Son
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 3920 - 3924
  • [6] Continuous Sign Language Recognition with Correlation Network
    Hu, Lianyu
    Gao, Liqing
    Liu, Zekang
    Feng, Wei
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 2529 - 2539
  • [7] Continuous Sign Language Recognition with Correlation Network
    Hu, Lianyu
    Gao, Liqing
    Liu, Zekang
    Feng, Wei
    arXiv, 2023,
  • [8] Recognition Bangla Sign Language using Convolutional Neural Network
    Islalm, Md Shafiqul
    Rahman, Md Moklesur
    Rahman, Md. Hafizur
    Arifuzzaman, Md
    Sassi, Roberto
    Aktaruzzaman, Md
    2019 INTERNATIONAL CONFERENCE ON INNOVATION AND INTELLIGENCE FOR INFORMATICS, COMPUTING, AND TECHNOLOGIES (3ICT), 2019,
  • [9] Multiscale temporal network for continuous sign language recognition
    Zhu, Qidan
    Li, Jing
    Yuan, Fei
    Gan, Quan
    JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (02)
  • [10] Indonesia Sign Language Recognition using Convolutional Neural Network
    Dwijayanti, Suci
    Hermawati
    Taqiyyah, Sahirah Inas
    Hikmarika, Hera
    Suprapto, Bhakti Yudho
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (10) : 415 - 422