WORD SENSE DISAMBIGUATION USING WORD ONTOLOGY AND CONCEPT DISTRIBUTION

被引:1
作者
Hung, Jason C. [1 ]
Yang, Che-Yu [2 ]
机构
[1] Overseas Chinese Inst Technol, Dept Informat Technol, Taichung 407, Taiwan
[2] China Univ Technol, Dept Informat Management, Hsinchu 300, Taiwan
关键词
word sense disambiguation; semantic relatedness; semantic similarity; natural language processing; wordnet;
D O I
10.1080/02533839.2009.9671494
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
This paper presents a method of word sense disambiguation that assigns a target word the sense that is most related to the senses of its neighbor words. We explore the use of measures of relatedness between word senses based on a novel hybrid approach. First, we investigate how to "literally" and "regularly" express a "concept". We apply set algebra to Wordnet's synsets cooperating with Wordnet's word ontology. In this way we establish regular rules for constructing various representations (lexical notations) of a concept using Boolean operators and word forms in various synset(s) defined in Wordnet. Then we establish a formal mechanism for quantifying and estimating the semantic relatedness between concepts - we facilitate "concept distribution statistics" to determine the degree of semantic relatedness between two lexically expressed concepts. Human languages have words that can mean different things in different contexts, such words with multiple meanings are potentially "ambiguous". The process of "deciding which of several meanings of a term is intended in a given context" is known as "Word Sense Disambiguation (WSD)". The proposed method is not supervised, and does not require any manually created sense-tagged training examples. The experimental results showed good performance on Semcor, a subset of the Brown corpus. We observe that measures of semantic relatedness are useful sources of information for word sense disambiguation.
引用
收藏
页码:153 / 168
页数:16
相关论文
共 23 条
[1]  
AGIRRE E, 1996, P 16 INT C COMP LING, P16
[2]  
[Anonymous], 2001, NAACL 2001
[3]  
[Anonymous], WORDNET ELECT LEXICA
[4]   Semantic feature selection using WordNet [J].
Chua, S ;
Kulathuramaiyer, N .
IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2004), PROCEEDINGS, 2004, :166-172
[5]  
Fellbaum C, 1998, ELECT LEXICAL DATABA
[6]  
JIANG JJ, 1997, P ROCLING 10 1997 IN, P128
[7]   I don't believe in word senses [J].
Kilgarriff, A .
COMPUTERS AND THE HUMANITIES, 1997, 31 (02) :91-113
[8]  
KIM SB, 2004, P 27 ANN INT ACM SIG, P25
[9]  
LEACOCK C, 1998, WORDNET LEXICAL REFE, P110
[10]   INFORMATION-RETRIEVAL BASED ON CONCEPTUAL DISTANCE IN IS-A HIERARCHIES [J].
LEE, JH ;
KIM, MH ;
LEE, YJ .
JOURNAL OF DOCUMENTATION, 1993, 49 (02) :188-207