Computing the Expected Edit Distance from a String to a PFA

被引:1
作者
Calvo-Zaragoza, Jorge [1 ]
de la Higuera, Colin [2 ]
Oncina, Jose [1 ]
机构
[1] Univ Alicante, DLSI, Alicante, Spain
[2] Univ Nantes, LINA Lab, UMR 6241, Nantes, France
来源
Implementation and Application of Automata | 2016年 / 9705卷
关键词
Edit distance; Probabilistic finite state automata;
D O I
10.1007/978-3-319-40946-7_4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In a number of fields one is to compare a witness string with a distribution. One possibility is to compute the probability of the string for that distribution. Another, giving a more global view, is to compute the expected edit distance from a string randomly drawn to the witness string. This number is often used to measure the performance of a prediction, the goal then being to return the median string, or the string with smallest expected distance. To be able to measure this, computing the distance between a hypothesis and that distribution is necessary. This paper proposes two solutions for computing this value, when the distribution is defined with a probabilistic finite state automaton. The first is exact but has a cost which can be exponential in the length of the input string, whereas the second is a FPRAS.
引用
收藏
页码:39 / 50
页数:12
相关论文
共 50 条
[21]   MinJoin plus plus : a fast algorithm for string similarity joins under edit distance [J].
Karpov, Nikolai ;
Zhang, Haoyu ;
Zhang, Qin .
VLDB JOURNAL, 2024, 33 (02) :281-299
[22]   Efficient Algorithms for Computing the Inner Edit Distance of a Regular Language via Transducers [J].
Kari, Lila ;
Konstantinidis, Stavros ;
Kopecki, Steffen ;
Yang, Meng .
ALGORITHMS, 2018, 11 (11)
[23]   Markov edit distance [J].
Wei, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (03) :311-321
[24]   Burst Edit Distance [J].
Boneh, Itai ;
Golan, Shay ;
Levy, Avivit ;
Porat, Ely ;
Shalom, B. Riva .
STRING PROCESSING AND INFORMATION RETRIEVAL, SPIRE 2024, 2025, 14899 :41-56
[25]   Efficiently Supporting Edit Distance Based String Similarity Search Using B+-Trees [J].
Lu, Wei ;
Du, Xiaoyong ;
Hadjieleftheriou, Marios ;
Ooi, Beng Chin .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2014, 26 (12) :2983-2996
[27]   The maximum edit distance from hereditary graph properties [J].
Alon, Noga ;
Stav, Uri .
JOURNAL OF COMBINATORIAL THEORY SERIES B, 2008, 98 (04) :672-697
[28]   Computing the Edit-Distance between a Regular Language and a Context-Free Language [J].
Han, Yo-Sub ;
Ko, Sang-Ki ;
Salomaa, Kai .
DEVELOPMENTS IN LANGUAGE THEORY (DLT 2012), 2012, 7410 :85-96
[29]   A Fully Compressed Algorithm for Computing the Edit Distance of Run-Length Encoded Strings [J].
Kuan-Yu Chen ;
Kun-Mao Chao .
Algorithmica, 2013, 65 :354-370
[30]   A Fully Compressed Algorithm for Computing the Edit Distance of Run-Length Encoded Strings [J].
Chen, Kuan-Yu ;
Chao, Kun-Mao .
ALGORITHMICA, 2013, 65 (02) :354-370