Computing the Expected Edit Distance from a String to a PFA

被引:1
|
作者
Calvo-Zaragoza, Jorge [1 ]
de la Higuera, Colin [2 ]
Oncina, Jose [1 ]
机构
[1] Univ Alicante, DLSI, Alicante, Spain
[2] Univ Nantes, LINA Lab, UMR 6241, Nantes, France
来源
Implementation and Application of Automata | 2016年 / 9705卷
关键词
Edit distance; Probabilistic finite state automata;
D O I
10.1007/978-3-319-40946-7_4
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In a number of fields one is to compare a witness string with a distribution. One possibility is to compute the probability of the string for that distribution. Another, giving a more global view, is to compute the expected edit distance from a string randomly drawn to the witness string. This number is often used to measure the performance of a prediction, the goal then being to return the median string, or the string with smallest expected distance. To be able to measure this, computing the distance between a hypothesis and that distribution is necessary. This paper proposes two solutions for computing this value, when the distribution is defined with a probabilistic finite state automaton. The first is exact but has a cost which can be exponential in the length of the input string, whereas the second is a FPRAS.
引用
收藏
页码:39 / 50
页数:12
相关论文
共 50 条
  • [1] Computing the Expected Edit Distance from a String to a Probabilistic Finite-State Automaton
    Calvo-Zaragoza, Jorgc
    Oncina, Jose
    de la Higucra, Colin
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2017, 28 (05) : 603 - 621
  • [2] Computing the Shortest String and the Edit-Distance for Parsing Expression Languages
    Cheon, Hyunjoon
    Han, Yo-Sub
    DEVELOPMENTS IN LANGUAGE THEORY, DLT 2020, 2020, 12086 : 43 - 54
  • [3] Classes of cost functions for string edit distance
    S. V. Rice
    H. Bunke
    T. A. Nartker
    Algorithmica, 1997, 18 : 271 - 280
  • [4] Classes of cost functions for string edit distance
    Rice, SV
    Bunke, H
    Nartker, TA
    ALGORITHMICA, 1997, 18 (02) : 271 - 280
  • [5] Bounded Occurrence Edit Distance: A New Metric for String Similarity Joins with Edit Distance Constraints
    Komatsu, Tomoki
    Okuta, Ryosuke
    Narisawa, Kazuyuki
    Shinohara, Ayumi
    SOFSEM 2014: THEORY AND PRACTICE OF COMPUTER SCIENCE, 2014, 8327 : 363 - 374
  • [6] The String Edit Distance Matching Problem With Moves
    Cormode, Graham
    Muthukrishnan, S.
    ACM TRANSACTIONS ON ALGORITHMS, 2007, 3 (01)
  • [7] How hard is computing the edit distance?
    Pighizzini, G
    INFORMATION AND COMPUTATION, 2001, 165 (01) : 1 - 13
  • [8] Computing the edit distance of a regular language
    Konstantinidis, Stavros
    INFORMATION AND COMPUTATION, 2007, 205 (09) : 1307 - 1316
  • [9] Edit distance for a run-length-encoded string and an uncompressed string
    Liu, J. J.
    Huang, G. S.
    Wang, Y. L.
    Lee, R. C. T.
    INFORMATION PROCESSING LETTERS, 2007, 105 (01) : 12 - 16
  • [10] An algorithm for string edit distance allowing substring reversals
    Arslan, Abdullah N.
    BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 220 - +