Siamese Neural Networks Method for Semantic Requirements Similarity Detection

被引:0
作者
Alnajem, Nojoom A. [1 ]
Binkhonain, Manal [1 ]
Hossain, M. Shamim [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Software Engn, Riyadh, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Semantics; Transformers; Accuracy; Vectors; Software; Long short term memory; Requirements engineering; Computer architecture; Natural language processing; XML; Artificial intelligence; Neural networks; Artificial intelligence for requirements engineering; large language models; long short-term memory networks; requirements; requirements engineering; requirements similarity; semantic requirements similarity; Siamese neural networks; similarity; transformer models;
D O I
10.1109/ACCESS.2024.3469636
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting semantic similarity between textual requirements is a crucial task for various natural language processing (NLP)-based requirements engineering (RE) applications. It is also challenging due to the nature of these requirements, which are written in natural language (NL), include domain knowledge, and often follow pre-defined templates that contain duplicated words. Recently, deep neural networks (DNNs) have shown promising results in measuring semantic similarity between texts. Siamese neural networks (SNNs), a class of DNNs, are widely used for measuring similarity between various data types, demonstrating their capability and independence of language and domain. Nevertheless, SNNs have a limited use in measuring semantic requirements similarity (SRS). In this paper, a novel metric-based learning method is proposed using SNNs that combines a sentence Transformer model (LLM) and long short-term memory (LSTM) networks with a backward network layer to measure semantic similarity between pairs of requirements. The proposed method is evaluated on an annotated SRS dataset that was built based on public datasets (i.e., PROMISE and PURE) and compared with other state-of-the-art methods (i.e., fine-tuning and zero-shot methods) using accuracy, precision, recall, and F1-score classification metrics. The results show that the proposed method achieved an accuracy of 95.42% and an F1-score of 95.71%, outperforming the state-of-the-art methods.
引用
收藏
页码:140932 / 140947
页数:16
相关论文
共 50 条
[21]   Using Semantic Information for Coreference Resolution with Neural Networks in Russian [J].
Azerkovich, Ilya .
ANALYSIS OF IMAGES, SOCIAL NETWORKS AND TEXTS (AIST 2019), 2020, 1086 :85-93
[22]   Signature Recognition using Siamese Neural Networks [J].
Krishna, Voruganti Ajay ;
Reddy, AtthapuramAkshay ;
Nagajyothi, D. .
2021 IEEE INTERNATIONAL CONFERENCE ON MOBILE NETWORKS AND WIRELESS COMMUNICATIONS (ICMNWC), 2021,
[23]   BINARY HASHING USING SIAMESE NEURAL NETWORKS [J].
Jose, Abin ;
Yan, Shen ;
Heisterklaus, Iris .
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, :2916-2920
[24]   Neural Networks Merging Semantic and Non-semantic Features for Opinion Spam Detection [J].
Jiang, Chengzhi ;
Zhang, Xianguo .
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING (NLPCC 2019), PT I, 2019, 11838 :583-595
[25]   Class-balanced siamese neural networks [J].
Berlemont, Samuel ;
Lefebvre, Gregoire ;
Duffner, Stefan ;
Garcia, Christophe .
NEUROCOMPUTING, 2018, 273 :47-56
[26]   SViG: A Similarity-Thresholded Approach for Vision Graph Neural Networks [J].
Elsharkawi, Ismael ;
Sharara, Hossam ;
Rafea, Ahmed .
IEEE ACCESS, 2025, 13 :19379-19387
[27]   The application of the connectionist method of semantic similarity for kazakh language [J].
Kalimoldayev, Maksat N. ;
Koibagarov, Kairat Ch. ;
Pak, Alexandr A. ;
Zharmagambetov, Arman S. .
2015 TWELVE INTERNATIONAL CONFERENCE ON ELECTRONICS COMPUTER AND COMPUTATION (ICECCO), 2015, :60-62
[28]   Sentence Embedding and Convolutional Neural Network for Semantic Textual Similarity Detection in Arabic Language [J].
Mahmoud, Adnen ;
Zrigui, Mounir .
ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2019, 44 (11) :9263-9274
[29]   Sentence Embedding and Convolutional Neural Network for Semantic Textual Similarity Detection in Arabic Language [J].
Adnen Mahmoud ;
Mounir Zrigui .
Arabian Journal for Science and Engineering, 2019, 44 :9263-9274
[30]   Neural sentence embedding models for semantic similarity estimation in the biomedical domain [J].
Blagec, Kathrin ;
Xu, Hong ;
Agibetov, Asan ;
Samwald, Matthias .
BMC BIOINFORMATICS, 2019, 20 (1)