A comparative evaluation of machine learning and deep learning algorithms for question categorization of VQA datasets

被引:2
作者
Asudani, Deepak Suresh [1 ]
Nagwani, Naresh Kumar [1 ]
Singh, Pradeep [1 ]
机构
[1] Natl Inst Technol, Dept Comp Sci & Engn, Raipur, Chhattisgarh, India
关键词
Question Classification; Machine Learning; Deep Learning; SMOTE; BERT-based Transformers;
D O I
10.1007/s11042-023-17797-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Question classification primarily involves categorizing questions based on the type of answer, with less emphasis on the words or phrases used to form the query. Question classification is crucial in the Visual Question Answering (VQA) system, and the dataset's quality plays an essential role in the system's development. The available question categorization in the VQA and TDIUC datasets shows imbalance, and the VQA model trained on imbalanced datasets performs poorly in handling language-prior problems, failing to categorize questions, and predicting incorrect outcomes. Therefore, developing a better classification method for classifying questions into appropriate categories based on phrases is necessary. This paper examines the effectiveness of the synthetic minority oversampling technique (SMOTE) in addressing the class imbalance problem within the question classification task using the LSTM, selected machine learning models and BERT-based transformer model. The preprocessing and analysis module efficiently categorizes input question sets by identifying valuable phrases and obtaining an evenly distributed dataset based on question categories from both datasets. The performance evaluation of Naive Bayes, SVM, Random Forests, and XGBoost models shows that the XGBoost model outperforms other selected classifiers, and the LSTM model achieves higher accuracy but requires more computation time. The empirical assessment indicates that the BERT-based transformer model exceeds the traditional models employed for comparison. The ablation study also reveals that utilizing SMOTE techniques for question classification tasks achieves slightly improved accuracy at the expense of higher computation time and resources. It is concluded that the BERT-based transformer model efficiently and precisely performs question classification tasks.
引用
收藏
页码:57829 / 57859
页数:31
相关论文
共 66 条
[1]   Deep learning-based question answering: a survey [J].
Abdel-Nabi, Heba ;
Awajan, Arafat ;
Ali, Mostafa Z. .
KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (04) :1399-1485
[2]  
Abdullah M.I., 2023, 2023 IEEE 14 CONTR S, P93, DOI DOI 10.1109/ICSGRC57744.2023.10215477
[3]  
[Anonymous], 2010, J AM DENT ASSOC, V141, P658
[4]   VQA: Visual Question Answering [J].
Antol, Stanislaw ;
Agrawal, Aishwarya ;
Lu, Jiasen ;
Mitchell, Margaret ;
Batra, Dhruv ;
Zitnick, C. Lawrence ;
Parikh, Devi .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2425-2433
[5]   Impact of word embedding models on text analytics in deep learning environment: a review [J].
Asudani, Deepak Suresh ;
Nagwani, Naresh Kumar ;
Singh, Pradeep .
ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) :10345-10425
[6]   Exploring the effectiveness of word embedding based deep learning model for improving email classification [J].
Asudani, Deepak Suresh ;
Nagwani, Naresh Kumar ;
Singh, Pradeep .
DATA TECHNOLOGIES AND APPLICATIONS, 2022, 56 (04) :483-505
[7]  
Banerjee S, 2012, P WORK QUEST ANSW CO
[8]   E-mail classification with machine learning and word embeddings for improved customer support [J].
Borg, Anton ;
Boldt, Martin ;
Rosander, Oliver ;
Ahlstrand, Jim .
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (06) :1881-1902
[9]  
Bu Qiong, 2019, International Journal of Crowd Science, V3, P222, DOI [10.1108/ijcs-06-2019-0017, 10.1108/IJCS-06-2019-0017]
[10]   Review of artificial intelligence-based question-answering systems in healthcare [J].
Budler, Leona Cilar ;
Gosak, Lucija ;
Stiglic, Gregor .
WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2023, 13 (02)