Automatic extraction of semantic relationships for WordNet by means of pattern learning from Wikipedia

被引:0
作者
Ruiz-Casado, M [1 ]
Alfonseca, E [1 ]
Castells, P [1 ]
机构
[1] Univ Autonoma Madrid, Dept Comp Sci, Madrid 28049, Spain
来源
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, PROCEEDINGS | 2005年 / 3513卷
关键词
ONTOLOGY; WEB;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an automatic approach to identify lexical patterns which represent semantic relationships between concepts, from an on-line 14 encyclopedia. Next, these patterns can be applied to extend existing ontologies or semantic networks with new relations. The experiments have been performed with the Simple English Wikipedia and WordNet 1.7. A new algorithm has been devised for automatically generalising the lexical patterns found in the encyclopedia entries. We have found general patterns for the hyperonymy, hyponymy, 14 holonymy and meronymy relations and, using them, we have extracted more than 1 1200 new relationships that did not appear in WordNet originally. The precision of these relationships ranges between 0.61 and 0.69, depending on the relation.
引用
收藏
页码:67 / 79
页数:13
相关论文
共 34 条
  • [1] Alfonseca E, 2002, LECT NOTES ARTIF INT, V2473, P1
  • [2] Alfonseca E., 2003, WRAETLIC USER GUIDE
  • [3] ALFONSECA E, 2002, LANGUAGE RESOURCES E
  • [4] Alfonseca E., 2002, P 1 INT C GEN WORDN
  • [5] [Anonymous], P 14 EUR C ART INT
  • [6] Berland Matthew., 1999, P ACL 99
  • [7] The Semantic Web - A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities
    Berners-Lee, T
    Hendler, J
    Lassila, O
    [J]. SCIENTIFIC AMERICAN, 2001, 284 (05) : 34 - +
  • [8] CIMIANO P, 2004, P LREC 2004
  • [9] DEBONI M, 2002, P 1 INT C GEN WORDN
  • [10] DEGEN W, 2001, P INT C FORM ONT INF