Sign language detection dataset: A resource for AI-based recognition systems

被引:0
作者
Garg, Bindu [1 ]
Kasar, Manisha [1 ]
Paygude, Priyanka [1 ]
Dhumane, Amol [2 ]
Ambala, Srinivas [3 ]
Rajpurohit, Jitendra [2 ]
Sharma, Abhay [4 ]
Meshram, Vidula [5 ]
Vats, Amber [1 ]
Kashyap, Achyut [1 ]
机构
[1] Bharati Vidyapeeth Deemed Univ, Coll Engn, Pune, India
[2] Symbiosis Inst Technol, Pune, India
[3] Pimpri Chinchwad Coll Engn, Pune, India
[4] Manipal Univ Jaipur, Jaipur, India
[5] Vishwakarma Inst Technol, Pune, India
关键词
American Sign Language; Deep Learning; Convolutional Neural Network; Sign Language Recognition;
D O I
10.1016/j.dib.2025.111703
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sign language is a very important mode of communication among deaf and hard-of-hearing populations. Automatic sign language detection based on deep learning model is the theme of this study. Hand gestures are classified by the Convolutional Neural Network (CNN) model to different signs. For training purposes, there are 26,0 0 0 images available with 30 0 0 images for every alphabet letter such that there is complete representation of sign language gesture. Photos were taken in controlled lighting with a consistent black background to facilitate better feature extraction. The data contains varied participants of various age groups, skin types, and hand shapes to enhance generalization. Data collection was standardized through iPhone 15 Pro Max, black background cloth, tripod stand, and remote-controlled Drodcam app to maintain consistency in image quality and framing. For diversity and realism, three participants were involved in data collection, each providing 10 0 0 images per sign, resulting in a rich and diverse dataset. Preprocessing of data methods were used for achieving the best quality of data, such as resizing, conversion to grayscale, normalization, and augmentation. Different techniques of data augmentation like rotation, flipping, scaling, brightness change, and addition of Gaussian noise were used to introduce variations in hand gestures and make the model robust against various envi-ronmental conditions. The dataset was then partitioned into 70 % training, 15 % validation, and 15 % test sets for max-imizing model performance and ensuring good generaliza-tion. The dataset show high accuracy, reflecting the potential of the model for real-world usage, such as accessibility tools for the deaf community, educational tools, and real-time sign language recognition systems. (c) 2025 The Author(s). Published by Elsevier Inc. This is an open access article under the CC BY-NC license (http://creativecommons.org/licenses/by-nc/4.0/)
引用
收藏
页数:12
相关论文
共 13 条
[1]   Varying views of maxillary and mandibular aspects of teeth: A dataset [J].
Chaudhary, Shweta ;
Shah, Preetam ;
Paygude, Priyanka ;
Chiwhane, Shwetambari ;
Mahajan, Pratibha ;
Chavan, Prashant ;
Kasar, Manisha .
DATA IN BRIEF, 2024, 56
[2]   Image captioning by diffusion models: A survey [J].
Daneshfar, Fatemeh ;
Bartani, Ako ;
Lotfi, Pardis .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 138
[3]  
Garg B., 2025, **DATA OBJECT**, pV1, DOI 10.17632/8fmvr9m98w.1
[4]   A novel framework for aspect based sentiment analysis using a hybrid BERT (HybBERT) model [J].
Goud, Anushree ;
Garg, Bindu .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023,
[5]  
Karande Shital, 2024, Neural Computing and Applications, V36, P11919, DOI [10.1007/s00521-024-09670-6, 10.1007/s00521-024-09670-6]
[6]   EmoSense: Pioneering Facial Emotion Recognition with Precision Through Model Optimization and Face Emotion Constraints [J].
Kasar, M. ;
Kavimandan, P. ;
Suryawanshi, T. ;
Garg, B. .
INTERNATIONAL JOURNAL OF ENGINEERING, 2025, 38 (01) :35-45
[7]  
Orovwode H., 2023, INT C ART INT BIG DA, P1, DOI [DOI 10.1109/ICABCD59051.2023.10220456, 10.1109/icabcd59051.2023.10220456]
[8]   A dataset revolutionizing Indian bay leaf analysis [J].
Paygude, Priyanka ;
Thite, Sandip ;
Kumar, Ajay ;
Bhosle, Amol ;
Pawar, Rajendra ;
Mane, Renuka ;
Joshi, Rahul ;
Kasar, Manisha ;
Chavan, Prashant ;
Gayakwad, Milind .
DATA IN BRIEF, 2024, 57
[9]   Dried fish dataset for Indian seafood: A machine learning application [J].
Paygude, Priyanka ;
Gayakwad, Milind ;
Wategaonkar, Dhanashri ;
Pawar, Rajendra ;
Pujeri, Ramchandra ;
Joshi, Rahul .
DATA IN BRIEF, 2024, 55
[10]  
public.roboflow, About us