Learning Probabilistic Languages by k-Testable Machines

被引:2
作者
Chu, Wenjing [1 ]
Bonsangue, Marcello [1 ]
机构
[1] Leiden Univ, Leiden Inst Adv Comp Sci, Leiden, Netherlands
来源
2020 INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF SOFTWARE ENGINEERING (TASE 2020) | 2020年
关键词
automata; k-testable language; probabilistic finite automata; passive learning; INFERENCE; MODEL;
D O I
10.1109/TASE49443.2020.00026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A k-testable machine is a finite automaton which recognizes a language L by only seeing a window of size k of each string in L. In this paper we use k-testable machines to recognize probabilistic languages and propose a novel algorithm to learn them. We work in the context of passive learning as our algorithm is based on a finite sample of strings belonging to the target language equipped with frequencies. Because our algorithm learns a probabilistic automaton, the resulting language is less sensitive to noise threshold than Garcia's algorithm. When compared with the ALERCIA learning algorithm, our method provides a better result in the case of the target language being a k-testable language. In fact, in this case, for the given window k we can learn at the limit the target language exactly.
引用
收藏
页码:129 / 136
页数:8
相关论文
共 23 条
[1]   INDUCTIVE INFERENCE - THEORY AND METHODS [J].
ANGLUIN, D ;
SMITH, CH .
COMPUTING SURVEYS, 1983, 15 (03) :237-269
[2]  
Angluin D, 1988, Identifying languages from stochastic examples
[3]  
[Anonymous], 2015, INT J RES COMPUTER A
[4]  
[Anonymous], 1963, Information Theory and Coding
[5]  
Carrasco R. C., 1994, Grammatical Inference and Applications. Second International Colloquium, ICGI-94 Proceedings, P139
[6]   Learning deterministic regular grammars from stochastic samples in polynomial time [J].
Carrasco, RC ;
Oncina, J .
RAIRO-INFORMATIQUE THEORIQUE ET APPLICATIONS-THEORETICAL INFORMATICS AND APPLICATIONS, 1999, 33 (01) :1-19
[7]  
Coste F., 2016, Topics in Grammatical Inference, P215, DOI 10.1007/978-3-662-48395-4_8
[8]  
De la Higuera C., 2010, Artificial intelligence techniques
[9]   INFERENCE OF KAPPA-TESTABLE LANGUAGES IN THE STRICT SENSE AND APPLICATION TO SYNTACTIC PATTERN-RECOGNITION [J].
GARCIA, P ;
VIDAL, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (09) :920-925
[10]  
Garcia P., 1990, Algorithmic Learning Theory, P325