Analysis and safety engineering of fuzzy string matching algorithms

被引:5
|
作者
Pikies, Malgorzata [1 ]
Ali, Junade [1 ]
机构
[1] Cloudflare, London, England
关键词
String similarity; Fuzzy string matching; Safety engineering; Natural language processing; Binary classification; Neural network;
D O I
10.1016/j.isatra.2020.10.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we explore fuzzy string matching in an automatic ticket classification and processing system. We compare performance of the following string similarity algorithms: Longest Common Subsequence (LCS), Dice coefficient, Cosine Similarity, Levenshtein (edit) distance and Damerau distance. Through optimisation, we accomplished a 15% improvement in the ratio of false positives to true positive classifications over the existing approach used by a customer support system for free customers. To introduce greater safety; we compliment fuzzy string matching algorithms with a second layer Convolutional Neural Network (CNN) binary classifier, achieving an improved keyword classification ratio for two ticket categories by a relative 69% and 78%. Such an approach allows for classification to only be applied where a desired level of safety achieved, such as in instances where automated answers. (C) 2020 ISA. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [21] Fuzzy Clustering Algorithms - Review of the Applications
    Li, Jiamin
    Lewis, Harold W.
    2016 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD), 2016, : 282 - 288
  • [22] Safety engineering for autonomous vehicles
    Adler, Rasmus
    Feth, Patrik
    Schneider, Daniel
    2016 46TH ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS WORKSHOPS (DSN-W), 2016, : 200 - 205
  • [23] Advances in process safety engineering
    Schupp, BA
    Pasman, HJ
    Lemkowitz, SM
    PROGRESS IN SAFETY SCIENCE AND TECHNOLOGY, VOL III, PTS A AND B, 2002, 3 : 16 - 26
  • [24] Integrating Industry 4.0 Technologies for Enhanced Safety Engineering: A Comprehensive Review and Analysis
    Hutchins, Savannah
    Jhaveri, Niral
    Duffy, Vincent G.
    HCI INTERNATIONAL 2023 LATE BREAKING PAPERS, HCII 2023,PT IV, 2023, 14057 : 43 - 58
  • [25] Market revenue prediction and error analysis of products based on fuzzy logic and artificial intelligence algorithms
    Zhao Jian
    Zhang Qingyuan
    Tian Liying
    Journal of Ambient Intelligence and Humanized Computing, 2020, 11 : 4011 - 4018
  • [26] Market revenue prediction and error analysis of products based on fuzzy logic and artificial intelligence algorithms
    Zhao, Jian
    Zhang, Qingyuan
    Tian, Liying
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2020, 11 (10) : 4011 - 4018
  • [27] Assessment of Name Based Algorithms for Land Administration Ontology Matching
    Zarembo, Imants
    Teilans, Artis
    Rausis, Aldis
    Buls, Jazeps
    ICTE IN REGIONAL DEVELOPMENT, 2014, 43 : 53 - 61
  • [28] Analysis on the Training Strategy of Compound Talents in Safety Engineering Against the Background of "Belt and Road"
    Xie, Chengyu
    Lu, Hao
    Shi, Dongping
    Jia, Nan
    PROCEEDINGS OF THE 2018 INTERNATIONAL WORKSHOP ON EDUCATION REFORM AND SOCIAL SCIENCES (ERSS 2018), 2018, 300 : 417 - 420
  • [29] The Comprehensive Analysis of the Effect of Chinese Word Segmentation on Fuzzy-Based Classification Algorithms for Agricultural Questions
    Zhao, Xinyue
    Huang, Jianing
    Zhang, Jing
    Song, Yunsheng
    INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2024, 26 (08) : 2726 - 2749
  • [30] Grouping Test Results with the Common Root Cause Using String Similarity Algorithms
    Kramar, Vladimir T.
    Nurminen, Jukka K.
    Aalto, Tatu
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INNOVATIONS IN COMPUTING RESEARCH (ICR'22), 2022, 1431 : 214 - 224