Siamese Neural Networks Method for Semantic Requirements Similarity Detection

被引:0
|
作者
Alnajem, Nojoom A. [1 ]
Binkhonain, Manal [1 ]
Hossain, M. Shamim [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Dept Software Engn, Riyadh, Saudi Arabia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Semantics; Transformers; Accuracy; Vectors; Software; Long short term memory; Requirements engineering; Computer architecture; Natural language processing; XML; Artificial intelligence; Neural networks; Artificial intelligence for requirements engineering; large language models; long short-term memory networks; requirements; requirements engineering; requirements similarity; semantic requirements similarity; Siamese neural networks; similarity; transformer models;
D O I
10.1109/ACCESS.2024.3469636
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Detecting semantic similarity between textual requirements is a crucial task for various natural language processing (NLP)-based requirements engineering (RE) applications. It is also challenging due to the nature of these requirements, which are written in natural language (NL), include domain knowledge, and often follow pre-defined templates that contain duplicated words. Recently, deep neural networks (DNNs) have shown promising results in measuring semantic similarity between texts. Siamese neural networks (SNNs), a class of DNNs, are widely used for measuring similarity between various data types, demonstrating their capability and independence of language and domain. Nevertheless, SNNs have a limited use in measuring semantic requirements similarity (SRS). In this paper, a novel metric-based learning method is proposed using SNNs that combines a sentence Transformer model (LLM) and long short-term memory (LSTM) networks with a backward network layer to measure semantic similarity between pairs of requirements. The proposed method is evaluated on an annotated SRS dataset that was built based on public datasets (i.e., PROMISE and PURE) and compared with other state-of-the-art methods (i.e., fine-tuning and zero-shot methods) using accuracy, precision, recall, and F1-score classification metrics. The results show that the proposed method achieved an accuracy of 95.42% and an F1-score of 95.71%, outperforming the state-of-the-art methods.
引用
收藏
页码:140932 / 140947
页数:16
相关论文
共 50 条
  • [1] Measuring Semantic Similarity Between Sentences Using a Siamese Neural Network
    Ichida, Alexandre Yukio
    Meneguzzi, Felipe
    Ruiz, Duncan D.
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [2] Using Siamese BiLSTM Models for Identifying Text Semantic Similarity
    Fradelos, Georgios
    Perikos, Isidoros
    Hatzilygeroudis, Ioannis
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS. AIAI 2023 IFIP WG 12.5 INTERNATIONAL WORKSHOPS, 2023, 677 : 381 - 392
  • [3] Plagiarism Detection of Multi-Threaded Programs via Siamese Neural Networks
    Tian, Zhenzhou
    Wang, Qing
    Gao, Cong
    Chen, Lingwei
    Wu, Dinghao
    IEEE ACCESS, 2020, 8 (08): : 160802 - 160814
  • [4] Siamese Neural Networks for Class Activity Detection
    Li, Hang
    Wang, Zhiwei
    Tang, Jiliang
    Ding, Wenbiao
    Liu, Zitao
    ARTIFICIAL INTELLIGENCE IN EDUCATION (AIED 2020), PT II, 2020, 12164 : 162 - 167
  • [5] Asymmetric Siamese Networks for Semantic Change Detection in Aerial Images
    Yang, Kunping
    Xia, Gui-Song
    Liu, Zicheng
    Du, Bo
    Yang, Wen
    Pelillo, Marcello
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [6] Modeling Functional Similarity in Source Code With Graph-Based Siamese Networks
    Mehrotra, Nikita
    Agarwal, Navdha
    Gupta, Piyush
    Anand, Saket
    Lo, David
    Purandare, Rahul
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2022, 48 (10) : 3771 - 3789
  • [7] Exploiting Siamese Neural Networks on Short Text Similarity Tasks for Multiple Domains and Languages
    Andrioli de Souza, Joao Vitor
    Oliveira, Lucas Emanuel Silva E.
    Gumiel, Yohan Bonescki
    Carvalho, Deborah Ribeiro
    Cabral Moro, Claudia Maria
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 357 - 367
  • [8] Monolingual Sentence Similarity Measurement using Siamese Neural Networks for Sinhala and Tamil Languages
    Nilaxan, Satkunanantham
    Ranathunga, Surangika
    MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON 2021) / 7TH INTERNATIONAL MULTIDISCIPLINARY ENGINEERING RESEARCH CONFERENCE, 2021, : 567 - 572
  • [9] Unsupervised Mobile User Behavior Detection Based on Siamese Neural Networks
    Liu, Yao
    Liu, Lu
    Liu, Qiao
    Lan, Tian
    Bai, Xiaoyu
    Zhou, Le
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 518 - 523
  • [10] Building siamese attention-augmented recurrent convolutional neural networks for document similarity scoring
    Han, Sifei
    Shi, Lingyun
    Richie, Russell
    Tsui, Fuchiang R. Rich
    INFORMATION SCIENCES, 2022, 615 : 90 - 102