Analysis and safety engineering of fuzzy string matching algorithms

被引:5
作者
Pikies, Malgorzata [1 ]
Ali, Junade [1 ]
机构
[1] Cloudflare, London, England
关键词
String similarity; Fuzzy string matching; Safety engineering; Natural language processing; Binary classification; Neural network;
D O I
10.1016/j.isatra.2020.10.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we explore fuzzy string matching in an automatic ticket classification and processing system. We compare performance of the following string similarity algorithms: Longest Common Subsequence (LCS), Dice coefficient, Cosine Similarity, Levenshtein (edit) distance and Damerau distance. Through optimisation, we accomplished a 15% improvement in the ratio of false positives to true positive classifications over the existing approach used by a customer support system for free customers. To introduce greater safety; we compliment fuzzy string matching algorithms with a second layer Convolutional Neural Network (CNN) binary classifier, achieving an improved keyword classification ratio for two ticket categories by a relative 69% and 78%. Such an approach allows for classification to only be applied where a desired level of safety achieved, such as in instances where automated answers. (C) 2020 ISA. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [31] Labview technology application for safety engineering
    Zhou F.-J.
    Yue X.-G.
    Yue, Xiao-Guang, 2016, American Scientific Publishers (13) : 5258 - 5261
  • [32] Assessment of Safety Processes in Requirements Engineering
    Vilela, Jessyka
    Castro, Jaelson
    Martins, Luiz Eduardo G.
    Gorschek, Tony
    2018 IEEE 26TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE 2018), 2018, : 358 - 363
  • [33] Novel elegant fuzzy genetic algorithms in classification problems
    Venkatanareshbabu, K.
    Nisheel, S.
    Sakthivel, R.
    Muralitharan, K.
    SOFT COMPUTING, 2019, 23 (14) : 5583 - 5603
  • [34] Novel elegant fuzzy genetic algorithms in classification problems
    K. Venkatanareshbabu
    S. Nisheel
    R. Sakthivel
    K. Muralitharan
    Soft Computing, 2019, 23 : 5583 - 5603
  • [35] An innovative integrated modelling of safety data using multiple correspondence analysis and fuzzy discretization techniques
    Dhalmahapatra, Krantiraditya
    Shingade, Rohan
    Maiti, J.
    SAFETY SCIENCE, 2020, 130
  • [36] A Review: Applications of Fuzzy Theory in Rock Engineering
    Namin, Farhad Samimi
    Rouhani, Mohammad Matin
    INDIAN GEOTECHNICAL JOURNAL, 2025, 55 (01) : 408 - 425
  • [37] Safety Practices in Requirements Engineering: The Uni-REPM Safety Module
    Vilela, Jessyka
    Castro, Jaelson
    Martins, Luiz Eduardo G.
    Gorschek, Tony
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2020, 46 (03) : 222 - 250
  • [38] Application of machine learning algorithms in the domain of financial engineering
    Liu, Xiang
    Salem, Sultan
    Bian, Lijun
    Seong, Jin-Taek
    Alshanbari, Huda M.
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 95 : 94 - 100
  • [39] Universe-inspired algorithms for control engineering: A review
    Bernardo, Rodrigo M. C.
    Torres, Delfim F. M.
    Herdeiro, Carlos A. R.
    dos Santos, Marco P. Soares
    HELIYON, 2024, 10 (11)
  • [40] Shallow semantic labeling using two-phase feature-enhanced string matching
    Chan, Samuel W. K.
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9729 - 9736