An isolated Vietnamese Sign Language Recognition method using a fusion of Heatmap and Depth information based on Convolutional Neural Networks

被引:0
作者
Xuan-Phuoc Nguyen [1 ]
Thi-Huong Nguyen [2 ]
Duc-Tan Tran [3 ]
Tien-Son Bui [4 ]
Van-Toi Nguyen [3 ]
机构
[1] PHENIKAA Univ, Fac Comp Sci, Hanoi 12116, Vietnam
[2] Hanoi Coll High Technol, Hanoi, Vietnam
[3] PHENIKAA Univ, Fac Elect & Elect Engn, Hanoi 12116, Vietnam
[4] Hanoi Univ Ind, Hanoi, Vietnam
来源
2024 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC | 2024年
关键词
sign language recognition;
D O I
10.1109/APSIPAASC63619.2025.10848961
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, interest in sign language recognition has continuously increased. However, recognition methods for exploiting the combination of RGB and depth data are limited, especially applied to Vietnamese sign language. This paper presents an isolated Vietnamese sign language recognition method using a novel streams-enhanced 3D ConvNet. The experimental results demonstrate the superiority of the proposed method over other methods using variations from RGB, depth, and RGB-D data. The speed and accuracy of our method are better than those of previous methods.
引用
收藏
页数:6
相关论文
共 41 条
[1]  
Asghari-Esfeden S, 2020, IEEE WINT CONF APPL, P546, DOI 10.1109/WACV45572.2020.9093500
[2]   SkeleMotion: A New Representation of Skeleton Joint Sequences Based on Motion Information for 3D Action Recognition [J].
Caetano, Carlos ;
Sena, Jessica ;
Bremond, Francois ;
dos Santos, Jefersson A. ;
Schwartz, William Robson .
2019 16TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2019,
[3]   JOLO-GCN: Mining Joint-Centered Light-Weight Information for Skeleton-Based Action Recognition [J].
Cai, Jinmiao ;
Jiang, Nianjuan ;
Han, Xiaoguang ;
Jia, Kui ;
Lu, Jiangbo .
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :2734-2743
[4]   Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset [J].
Carreira, Joao ;
Zisserman, Andrew .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :4724-4733
[5]   Channel-wise Topology Refinement Graph Convolution for Skeleton-Based Action Recognition [J].
Chen, Yuxin ;
Zhang, Ziqi ;
Yuan, Chunfeng ;
Li, Bing ;
Deng, Ying ;
Hu, Weiming .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :13339-13348
[6]   PoTion: Pose MoTion Representation for Action Recognition [J].
Choutas, Vasileios ;
Weinzaepfel, Philippe ;
Revaud, Jerome ;
Schmid, Cordelia .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7024-7033
[7]  
Courtney PG, 2015, IEEE COMP SEMICON
[8]  
Das Srijan, 2020, EUROPEAN C COMPUTER, P2
[9]  
Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714
[10]   X3D: Expanding Architectures for Efficient Video Recognition [J].
Feichtenhofer, Christoph .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :200-210