Co-occurrence graph-based context adaptation: a new unsupervised approach to word sense disambiguation

被引:1
|
作者
Rahmani, Saeed [1 ]
Fakhrahmad, Seyed Mostafa [1 ]
Sadreddini, Mohammad Hadi [1 ]
机构
[1] Shiraz Univ, Comp Sci & Engn Dept, Shiraz, Iran
关键词
D O I
10.1093/llc/fqz048
中图分类号
C [社会科学总论];
学科分类号
03 ; 0303 ;
摘要
Word sense disambiguation (WSD) is the task of selecting correct sense for an ambiguous word in its context. Since WSD is one of the most challenging tasks in various text processing systems, improving its accuracy can be very beneficial. In this article, we propose a new unsupervised method based on co-occurrence graph created by monolingual corpus without any dependency on the structure and properties of the language itself. In the proposed method, the context of an ambiguous word is represented as a sub-graph extracted from a large word co-occurrence graph built based on a corpus. Most of the words are connected in this graph. To clarify the exact sense of an ambiguous word, its senses and relations are added to the context graph, and various similarity functions are employed based on the senses and context graph. In the disambiguation process, we select senses with highest similarity to the context graph. As opposite to other WSD methods, the proposed method does not use any language-dependent resources (e.g. WordNet) and it just uses a monolingual corpus. Therefore, the proposed method can be employed for other languages. Moreover, by increasing the size of corpus, it is possible to enhance the accuracy of WSD. Experimental results on English and Persian datasets show that the proposed method is competitive with existing supervised and unsupervised WSD approaches.
引用
收藏
页码:449 / 471
页数:23
相关论文
共 50 条
  • [41] Land Cover Mapping with Higher Order Graph-Based Co-Occurrence Model
    Zhao, Wenzhi
    Emery, William J.
    Bo, Yanchen
    Chen, Jiage
    REMOTE SENSING, 2018, 10 (11)
  • [42] A Graph-based Approach to Person Name Disambiguation in Web
    Emami, Hojjat
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2019, 10 (02)
  • [43] WIKHDRANK: AN UNSUPERVISED APPROACH FOR ENTITY LINKING BASED ON INSTANCE CO-OCCURRENCE
    Fernandez, Norberto
    Fisteus, Jesus A.
    Sanchez, Luis
    Fuentes-Lorenzo, Damaris
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2012, 8 (11): : 7519 - 7541
  • [44] Text Clustering Algorithm Based on the Graph Structures of Semantic Word Co-occurrence
    Jin, Chun-Xia
    Bai, Qiu-Chan
    2016 INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI 2016), 2016, : 497 - 502
  • [45] Unsupervised Multimodal Word Discovery Based on Double Articulation Analysis With Co-Occurrence Cues
    Taniguchi, Akira
    Murakami, Hiroaki
    Ozaki, Ryo
    Taniguchi, Tadahiro
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (04) : 1825 - 1840
  • [46] A New Word Sense Disambiguation System Based on Deduction
    Fakhrahmad, S. M.
    Rezapour, A. R.
    Jahromi, M. Zolghadri
    Sadreddini, M. H.
    WORLD CONGRESS ON ENGINEERING, WCE 2011, VOL II, 2011, : 1276 - 1281
  • [47] Vector-Based Unsupervised Word Sense Disambiguation for Large Number of Contexts
    Papp, Gyula
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2009, 5729 : 109 - 115
  • [48] A graph-based approach to context matching
    Olaru, Andrei
    Florea, Adina Magda
    Scalable Computing, 2010, 11 (04): : 393 - 400
  • [49] A GRAPH-BASED APPROACH TO CONTEXT MATCHING
    Olaru, Andrei
    Florea, Adina Magda
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2010, 11 (04): : 393 - 399
  • [50] Using Context Information for Knowledge-Based Word Sense Disambiguation
    Simov, Kiril
    Osenova, Petya
    Popov, Alexander
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2016, 2016, 9883 : 130 - 139