A machine learning approach for hierarchical classification of software requirements

被引:4
作者
Binkhonain, Manal [1 ]
Zhao, Liping [2 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[2] Univ Manchester, Dept Comp Sci, Manchester M13 9PL, England
来源
MACHINE LEARNING WITH APPLICATIONS | 2023年 / 12卷
关键词
Requirements engineering; Requirements classification; Machine learning; Hierarchical classification; Imbalanced classes; High Dimensional Data with Low Sample Size (HDLSS); FEATURE-SELECTION; TEXT CLASSIFICATION;
D O I
10.1016/j.mlwa.2023.100457
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Context: Classification of software requirements into different categories is a critically important task in requirements engineering (RE). Developing machine learning (ML) approaches for requirements classification has attracted great interest in the RE community since the 2000s. Objective: This paper aims to address two related problems that have been challenging real -world applications of ML approaches: the problems of class imbalance and high dimensionality with low sample size data (HDLSS). These problems can greatly degrade the classification performance of ML methods. Methods: The paper proposes HC4RC , a novel ML approach for multiclass classification of requirements. HC4RC solves the aforementioned problems through semantic -role based feature selection, dataset decomposition and hierarchical classification. We experimentally compare the effectiveness of HC4RC with three closely related approaches - two of which are based on a traditional statistical classification model whereas one using an advanced deep learning model. Results: Our experiment shows: (1) The class imbalance and HDLSS problems present a challenge to both traditional and advanced ML approaches. (2) The HC4RC approach is simple to use and can effectively address the class imbalance and HDLSS problems compared to similar approaches. Conclusion: This paper makes an important practical contribution to addressing the class imbalance and HDLSS problems in multiclass classification of software requirements.
引用
收藏
页数:12
相关论文
共 79 条
[1]   What Works Better? A Study of Classifying Requirements [J].
Abad, Zahra Shakeri Hossein ;
Karras, Oliver ;
Ghazi, Parisa ;
Glinz, Martin ;
Ruhe, Guenther ;
Schneider, Kurt .
2017 IEEE 25TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2017, :496-501
[2]   Automated demarcation of requirements in textual specifications: a machine learning-based approach [J].
Abualhaija, Sallam ;
Arora, Chetan ;
Sabetzadeh, Mehrdad ;
Briand, Lionel C. ;
Traynor, Michael .
EMPIRICAL SOFTWARE ENGINEERING, 2020, 25 (06) :5454-5497
[3]   A Machine Learning-Based Approach for Demarcating Requirements in Textual Specifications [J].
Abualhaija, Sallam ;
Arora, Chetan ;
Sabetzadeh, Mehrdad ;
Briand, Lionel C. ;
Vaz, Eduardo .
2019 27TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2019), 2019, :51-62
[4]  
Agarwal A., 2020, SN Comput Sci, V1, P1, DOI [DOI 10.1007/S42979-020-00165-4, 10.1007/s42979-020-00165-4]
[5]   Retraining a BERT Model for Transfer Learning in Requirements Engineering: A Preliminary Study [J].
Ajagbe, Muideen ;
Zhao, Liping .
2022 30TH IEEE INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2022), 2022, :309-315
[6]   A Zero-Shot Learning Approach to Classifying Requirements: A Preliminary Study [J].
Alhoshan, Waad ;
Zhao, Liping ;
Ferrari, Alessio ;
Letsholo, Keletso J. .
REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, REFSQ 2022, 2022, 13216 :52-59
[7]   A neural probabilistic language model [J].
Bengio, Y ;
Ducharme, R ;
Vincent, P ;
Jauvin, C .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1137-1155
[8]   Empirical evaluation of tools for hairy requirements engineering tasks [J].
Berry, Daniel M. .
EMPIRICAL SOFTWARE ENGINEERING, 2021, 26 (06)
[9]  
Binkhonain M, 2019, Expert Systems with Applications
[10]  
Binkhonain M., 2022, Supplementary material of "Multiclass classification of software requirements with imbalanced, high dimensional and low sample size data