Characteristic sets for polynomial grammatical inference

被引:71
作者
DelaHiguera, C
机构
[1] Dept. d'Informatique Fondamentale, LIRMM, 34 392 Montpellier Cedex 5
关键词
exact identification; grammatical inference; polynomial learning;
D O I
10.1023/A:1007353007695
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When concerned about efficient grammatical inference two issues are relevant: the first one is to determine the quality of the result, and the second is to try to use polynomial time and space. A typical idea to deal with the first point is to say that an algorithm performs well if it infers in the limit the correct language. The second point has led to debate about how to define polynomial time: the main definitions of polynomial inference have been proposed by Pitt and Angluin. We return in this paper to a definition proposed by Gold that requires a characteristic set of strings to exist for each grammar, and this set to be polynomial in the size of the grammar or automaton that is to be learned, where the size of the sample is the sum of the lengths of all strings it includes. The learning algorithm must also infer correctly as soon as the characteristic set is included in the data. We first show that this definition corresponds to a notion of teachability as defined by Goldman and Mathias. By adapting their teacher/learner model to grammatical inference we prove that languages given by context-free grammars, simple deterministic grammars, linear grammars and nondeterministic finite automata are not identifiable in the limit from polynomial time and data.
引用
收藏
页码:125 / 138
页数:14
相关论文
共 27 条
[1]  
Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1007/BF00116828
[2]   WHEN WONT MEMBERSHIP QUERIES HELP [J].
ANGLUIN, D ;
KHARITONOV, M .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1995, 50 (02) :336-355
[3]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[4]  
Anthony M., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P311, DOI 10.1145/130385.130420
[5]  
CASTELLANOS A, 1994, LECT NOTES ARTIF INT, V862, P93
[6]  
FREIVALDS R, 1989, LECT NOTES ARTIF INT, V397, P1
[7]   INFERENCE OF KAPPA-TESTABLE LANGUAGES IN THE STRICT SENSE AND APPLICATION TO SYNTACTIC PATTERN-RECOGNITION [J].
GARCIA, P ;
VIDAL, E .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (09) :920-925
[8]  
Garcia P., 1994, INT J PATTERN RECOGN, V4, P667
[9]   COMPLEXITY OF AUTOMATON IDENTIFICATION FROM GIVEN DATA [J].
GOLD, EM .
INFORMATION AND CONTROL, 1978, 37 (03) :302-320
[10]   LANGUAGE IDENTIFICATION IN LIMIT [J].
GOLD, EM .
INFORMATION AND CONTROL, 1967, 10 (05) :447-&