Semi-automated Software Requirements Categorisation using Machine Learning Algorithms

被引:0
作者
Talele, Pratvina [1 ]
Apte, Siddharth [1 ]
Phalnikar, Rashmi [1 ]
Talele, Harsha [2 ]
机构
[1] Dr Vishwanath Karad MIT World Peace Univ, Dept Comp Engn & Technol, Pune, Maharashtra, India
[2] Pimpri Chinchwad Coll Engn, Dept Comp Engn, Pune, Maharashtra, India
关键词
Natural Language Processing; Machine Learning; Software Engineering; Supervised Machine Learning; CLASSIFICATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Requirement engineering is a mandatory phase of the Software development life cycle (SDLC) that includes defining and documenting system requirements in the Software Requirements Specification (SRS). As the complexity increases, it becomes difficult to categorise the requirements intofunctional andnon-functional requirements.Presently,the dearthofautomated techniques necessitates reliance on labour-intensive and time- consuming manual methods for this purpose. This research endeavours to address this gap by investigating and contrasting two prominent feature extraction techniques and their efficacy in automating the classification of requirements. Natural language processing methods are used in the text pre-processing phase, followed by the Term Frequency - Inverse Document Frequency (TF-IDF) and Word2Vec for feature extraction for further understanding. These features are used as input to the Machine Learning algorithms. This study compares existing machine learning algorithms and discusses their correctness in categorising the software requirements. In our study, we have assessed the algorithms Decision Tree (DT), Random Forest (RF), Logistic Regression (LR), Neural Network (NN), K-Nearest Neighbour (KNN) and Support Vector Machine (SVM) on the precision and accuracy parameters. The results obtained in this study showed that the TF-IDF feature selection algorithm performed better in categorising requirements than the Word2Vec algorithm, with an accuracy of 91.20% for the Support Vector Machine (SVM) and Random Forest algorithm as compared to 87.36% for the SVM algorithm. A 3.84% difference is seen between the two when applied to the publicly available PURE dataset. We believe these results will aid developers in building products that aid in requirement engineering.
引用
收藏
页码:1107 / 1114
页数:8
相关论文
共 25 条
[1]  
Althanoon AAA, 2021, Technium Romanian Journal of Applied Sciences and Technology, V3, P32, DOI 10.47577/technium.v3i11.5417
[2]   An end-to-end deep learning system for requirements classification using recurrent neural networks [J].
AlDhafer, Osamah ;
Ahmad, Irfan ;
Mahmood, Sajjad .
INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 147
[3]  
Alrumaih H, 2018, 2018 21ST SAUDI COMPUTER SOCIETY NATIONAL COMPUTER CONFERENCE (NCC)
[4]  
Apte S., Lecture Notes in Networks and Systems, V719
[5]   Automatic Multi-class Non-Functional Software Requirements Classification Using Neural Networks [J].
Baker, Cody ;
Deng, Lin ;
Chakraborty, Suranjan ;
Dehlinger, Josh .
2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 2, 2019, :610-615
[6]  
Binkhonain M, 2019, Expert Systems with Applications X, V1, P100001, DOI [10.1016/j.eswax.2019.100001, 10.1016/j.eswax.2019.100001, DOI 10.1016/J.ESWAX.2019.100001]
[7]   Identification of non-functional requirements in textual specifications: A semi-supervised learning approach [J].
Casamayor, Agustin ;
Godoy, Daniela ;
Campo, Marcelo .
INFORMATION AND SOFTWARE TECHNOLOGY, 2010, 52 (04) :436-445
[8]  
cio, Fixing the Software Requirements Mess
[9]  
cnr, Natural Language Requirements Dataset
[10]   Software Requirements Classification Using Machine Learning Algorithms [J].
Dias Canedo, Edna ;
Cordeiro Mendes, Bruno .
ENTROPY, 2020, 22 (09)