Towards Indian Sign Language Sentence Recognition using INSIGNVID: Indian Sign Language Video Dataset

被引:0
作者
Mistree, Kinjal [1 ]
Thakor, Devendra [1 ]
Bhatt, Brijesh [2 ]
机构
[1] Uka Tarsadia Univ, CG Patel Inst Technol, Comp Engn Dept, Bardoli, India
[2] Dharmsinh Desai Univ, Dharmsinh Desai Inst Technol, Comp Engn Dept, Bardoli, India
关键词
Indian sign language; sign language recognition; pretrained models; transfer learning; vision-based approaches; GESTURE RECOGNITION;
D O I
10.14569/IJACSA.2021.0120881
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sign language, a language used by Deaf community, is a fully visual language with its own grammar. The Deaf people find it very difficult to express their feelings to the other people, since the other people lack the knowledge of the sign language used by the Deaf community. Due to the differences in vocabulary and grammar of the sign languages, complete adoption of methods used for other international sign languages is not possible for Indian Sign Language (ISL) recognition. It is difficult to handle continuous sign language sentence recognition and translation into text as no large video dataset for ISL sentences is available. INSIGNVID - the first Indian Sign Language video dataset has been proposed and with this dataset as input, a novel approach is presented that converts video of ISL sentence in appropriate English sentence using transfer learning. The proposed approach gives promising results on our dataset with MobilNetV2 as pretrained model.
引用
收藏
页码:697 / 707
页数:11
相关论文
共 44 条
[1]  
Adithya V, 2013, 2013 IEEE CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES (ICT 2013), P1080
[2]  
Agrawal SC, 2012, 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2012)
[3]   An efficient gesture based humanoid learning using wavelet descriptor and MFCC techniques [J].
Baranwal, Neha ;
Nandi, G. C. .
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (04) :1369-1388
[4]  
Bhavsar Hemina, 2020, Emerging Technology Trends in Electronics, Communication and Networking: Third International Conference, ET2ECN 2020. Communications in Computer and Information Science (1214), P235, DOI 10.1007/978-981-15-7219-7_20
[5]   Antibacterial Effects and Mode of Action of Selected Essential Oils Components against Escherichia coli and Staphylococcus aureus [J].
Cesar Lopez-Romero, Julio ;
Gonzalez-Ros, Humberto ;
Borges, Anabela ;
Simoes, Manuel .
EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2015, 2015
[6]   Action recognition from depth sequences using weighted fusion of 2D and 3D auto-correlation of gradients features [J].
Chen, Chen ;
Zhang, Baochang ;
Hou, Zhenjie ;
Jiang, Junjun ;
Liu, Mengyuan ;
Yang, Yun .
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (03) :4651-4669
[7]  
Er-Rady Adil, 2017, INT C ADV TECHN SIGN, P1
[8]  
Forster J, 2014, LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P1911
[9]  
Forster Jens, 2012, LREC
[10]   Dynamic Hand Gesture Recognition and Novel Sentence Interpretation Algorithm for Indian Sign Language Using Microsoft Kinect Sensor [J].
Ghotkar, Archana S. ;
Kharate, Gajanan K. .
JOURNAL OF PATTERN RECOGNITION RESEARCH, 2015, 10 (01) :24-38