Co-occurrence graph-based context adaptation: a new unsupervised approach to word sense disambiguation

被引:1
|
作者
Rahmani, Saeed [1 ]
Fakhrahmad, Seyed Mostafa [1 ]
Sadreddini, Mohammad Hadi [1 ]
机构
[1] Shiraz Univ, Comp Sci & Engn Dept, Shiraz, Iran
关键词
D O I
10.1093/llc/fqz048
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Word sense disambiguation (WSD) is the task of selecting correct sense for an ambiguous word in its context. Since WSD is one of the most challenging tasks in various text processing systems, improving its accuracy can be very beneficial. In this article, we propose a new unsupervised method based on co-occurrence graph created by monolingual corpus without any dependency on the structure and properties of the language itself. In the proposed method, the context of an ambiguous word is represented as a sub-graph extracted from a large word co-occurrence graph built based on a corpus. Most of the words are connected in this graph. To clarify the exact sense of an ambiguous word, its senses and relations are added to the context graph, and various similarity functions are employed based on the senses and context graph. In the disambiguation process, we select senses with highest similarity to the context graph. As opposite to other WSD methods, the proposed method does not use any language-dependent resources (e.g. WordNet) and it just uses a monolingual corpus. Therefore, the proposed method can be employed for other languages. Moreover, by increasing the size of corpus, it is possible to enhance the accuracy of WSD. Experimental results on English and Persian datasets show that the proposed method is competitive with existing supervised and unsupervised WSD approaches.
引用
收藏
页码:449 / 471
页数:23
相关论文
共 50 条
  • [21] Learning of word sense disambiguation rules by Co-training, checking co-occurrence of features
    Ibaraki University, 4-12-1 Nakanarusawa, Hitachi Ibaraki
    316-8511, Japan
    Proc. Int. Conf. Lang. Resourc. Eval., LREC, 1600, (1380-1384):
  • [22] Graph-Based Chinese Word Sense Disambiguation with Multi-Knowledge Integration
    Lu, Wenpeng
    Meng, Fanqing
    Wang, Shoujin
    Zhang, Guoqiang
    Zhang, Xu
    Ouyang, Antai
    Zhang, Xiaodong
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 197 - 212
  • [23] Word Sense Disambiguation in biomedical ontologies with term co-occurrence analysis and document clustering
    Andreopoulos, Bill
    Alexopoulou, Dimitra
    Schroeder, Michael
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2008, 2 (03) : 193 - 215
  • [24] Word Sense Discrimination on Tweets: A Graph-based Approach
    Cecchini, Flavio Massimiliano
    Fersini, Elisabetta
    Messina, Enza
    2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 138 - 146
  • [25] Unsupervised Word Sense Disambiguation based on Word Embedding and Collocation
    Han, Shangzhuang
    Shirai, Kiyoaki
    ICAART: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON AGENTS AND ARTIFICIAL INTELLIGENCE - VOL 2, 2021, : 1218 - 1225
  • [26] A Novel Unsupervısed Graph-Based Algorıthm for Hindi Word Sense Disambiguation
    Jha P.
    Agarwal S.
    Abbas A.
    Siddiqui T.J.
    SN Computer Science, 4 (5)
  • [27] A Study of Dynamic Word Sense Disambiguation Base don Full-sentence Co-occurrence of Node Word
    Yan, Yaya
    Xing, Hongbing
    YUYAN KEXUE-LINGUISTIC SCIENCES, 2024, 23 (04): : 354 - 364
  • [28] Selecting Training Data for Unsupervised Domain Adaptation in Word Sense Disambiguation
    Komiya, Kanako
    Sasaki, Minoru
    Shinnou, Hiroyuki
    Kotani, Yoshiyuki
    Okumura, Manabu
    PRICAI 2016: TRENDS IN ARTIFICIAL INTELLIGENCE, 2016, 9810 : 220 - 232
  • [29] Unsupervised similarity-based word sense disambiguation using context vectors and sentential word importance
    Abdalgader, Khaled
    Skabar, Andrew
    ACM Transactions on Speech and Language Processing, 2012, 9 (01):
  • [30] Co-occurrence Networks for Word Sense Induction
    Humonen, Innokentiy S.
    Makarov, Ilya
    2023 IEEE 21ST WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS, SAMI, 2023, : 97 - 102