Computing the Expected Edit Distance from a String to a PFA

被引:1
作者
Calvo-Zaragoza, Jorge [1 ]
de la Higuera, Colin [2 ]
Oncina, Jose [1 ]
机构
[1] Univ Alicante, DLSI, Alicante, Spain
[2] Univ Nantes, LINA Lab, UMR 6241, Nantes, France
来源
Implementation and Application of Automata | 2016年 / 9705卷
关键词
Edit distance; Probabilistic finite state automata;
D O I
10.1007/978-3-319-40946-7_4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In a number of fields one is to compare a witness string with a distribution. One possibility is to compute the probability of the string for that distribution. Another, giving a more global view, is to compute the expected edit distance from a string randomly drawn to the witness string. This number is often used to measure the performance of a prediction, the goal then being to return the median string, or the string with smallest expected distance. To be able to measure this, computing the distance between a hypothesis and that distribution is necessary. This paper proposes two solutions for computing this value, when the distribution is defined with a probabilistic finite state automaton. The first is exact but has a cost which can be exponential in the length of the input string, whereas the second is a FPRAS.
引用
收藏
页码:39 / 50
页数:12
相关论文
共 50 条
  • [21] MinJoin plus plus : a fast algorithm for string similarity joins under edit distance
    Karpov, Nikolai
    Zhang, Haoyu
    Zhang, Qin
    VLDB JOURNAL, 2024, 33 (02) : 281 - 299
  • [22] Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers
    Kari, Lila
    Konstantinidis, Stavros
    Kopecki, Steffen
    Yang, Meng
    ALGORITHMS, 2018, 11 (11)
  • [23] Markov edit distance
    Wei, J
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (03) : 311 - 321
  • [24] Burst Edit Distance
    Boneh, Itai
    Golan, Shay
    Levy, Avivit
    Porat, Ely
    Shalom, B. Riva
    STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2024, 2025, 14899 : 41 - 56
  • [25] Efficiently Supporting Edit Distance Based String Similarity Search Using B+-Trees
    Lu, Wei
    Du, Xiaoyong
    Hadjieleftheriou, Marios
    Ooi, Beng Chin
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) : 2983 - 2996
  • [27] The maximum edit distance from hereditary graph properties
    Alon, Noga
    Stav, Uri
    JOURNAL OF COMBINATORIAL THEORY SERIES B, 2008, 98 (04) : 672 - 697
  • [28] Computing the Edit-Distance between a Regular Language and a Context-Free Language
    Han, Yo-Sub
    Ko, Sang-Ki
    Salomaa, Kai
    DEVELOPMENTS IN LANGUAGE THEORY (DLT 2012), 2012, 7410 : 85 - 96
  • [29] A Fully Compressed Algorithm for Computing the Edit Distance of Run-Length Encoded Strings
    Kuan-Yu Chen
    Kun-Mao Chao
    Algorithmica, 2013, 65 : 354 - 370
  • [30] A Fully Compressed Algorithm for Computing the Edit Distance of Run-Length Encoded Strings
    Chen, Kuan-Yu
    Chao, Kun-Mao
    ALGORITHMICA, 2013, 65 (02) : 354 - 370