Semantic separator learning and its applications in unsupervised Chinese text parsing

被引:0
作者
Yuming Wu
Xiaodong Luo
Zhen Yang
机构
[1] Chinese Academy of Sciences,Key Laboratory of Intelligent Information Processing, Institute of Computing Technology
[2] Graduate University of the Chinese Academy of Sciences,undefined
[3] China Telecom Corporation Limited Shanghai Branch,undefined
[4] Shanghai Research Institute of China Telecom Corporation Limited,undefined
来源
Frontiers of Computer Science | 2013年 / 7卷
关键词
semantic separator; separator learning; unsupervised text parsing;
D O I
暂无
中图分类号
学科分类号
摘要
Grammar learning has been a bottleneck problem for a long time. In this paper, we propose a method of semantic separator learning, a special case of grammar learning. The method is based on the hypothesis that some classes of words, called semantic separators, split a sentence into several constituents. The semantic separators are represented by words together with their part-of-speech tags and other information so that rich semantic information can be involved. In the method, we first identify the semantic separators with the help of noun phrase boundaries, called subseparators. Next, the argument classes of the separators are learned from corpus by generalizing argument instances in a hypernym space. Finally, in order to evaluate the learned semantic separators, we use them in unsupervised Chinese text parsing. The experiments on a manually labeled test set show that the proposed method outperforms previous methods of unsupervised text parsing.
引用
收藏
页码:55 / 68
页数:13
相关论文
共 9 条
[1]  
Zhang C(2011)A Chinese time ontology for the semantic web Knowledge-Based Systems 24 1057-1074
[2]  
Cao C(1967)Language identification in the limit Information and Control 10 447-474
[3]  
Sui Y(2011)Languages as hyperplanes: grammatical inference with string kernels Machine Learning 82 351-373
[4]  
Wu X(1997)Stochastic attribute-value grammars Computational Linguistics 23 597-618
[5]  
Gold E(undefined)undefined undefined undefined undefined-undefined
[6]  
Clark A(undefined)undefined undefined undefined undefined-undefined
[7]  
Costa Florêncio C(undefined)undefined undefined undefined undefined-undefined
[8]  
Watkins C(undefined)undefined undefined undefined undefined-undefined
[9]  
Abney S(undefined)undefined undefined undefined undefined-undefined