Hindi Word Sense Disambiguation Using Cosine Similarity

被引:2
|
作者
Sarika, D. K. [1 ]
Sharma, Dilip Kumar [1 ]
机构
[1] GLA Univ, Dept Comp Engn & Applicat, Mathura, India
关键词
Word sense disambiguation; Natural language processing; Ambiguity; Hindi WordNet; Cosine similarity;
D O I
10.1007/978-981-10-0135-2_76
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Hindi is the regional language of India. Most of the people access, retrieve, and share documents in Hindi language. As all the natural languages possess property of being ambiguous, so does Hindi language, which creates obstacles in usage of information technology properly. In order to remove ambiguity from Hindi language, we need a system called Hindi word sense disambiguation (HWSD). In this paper, we present a supervised method, called HWSD using cosine similarity in which vectors are created for testing query and sense knowledge data for the ambiguous word by considering weights. Experiment is performed on dataset consisting of 90 Hindi ambiguous words and it is found that this method outperforms Lesk's algorithm which is well known algorithm for Word sense disambiguation (WSD). We obtained an overall average precision of 78.99 % and average recall of 72.58 %.
引用
收藏
页码:801 / 808
页数:8
相关论文
共 50 条
  • [11] Hindi Word Sense Disambiguation Using Lesk Approach on Bigram and Trigram Words
    Gautam, Chandra Bhal Singh
    Sharma, Dilip Kumar
    INTERNATIONAL CONFERENCE ON ADVANCES IN INFORMATION COMMUNICATION TECHNOLOGY & COMPUTING, 2016, 2016,
  • [12] Knowledge-Based Method for Word Sense Disambiguation by Using Hindi WordNet
    Sharma, Pooja
    Joshi, Nisheeth
    ENGINEERING TECHNOLOGY & APPLIED SCIENCE RESEARCH, 2019, 9 (02) : 3985 - 3989
  • [13] A Modified Technique for Word Sense Disambiguation Using Lesk Algorithm in Hindi Language
    Sawhney, Radhike
    Kaur, Arvinder
    2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2014, : 2745 - 2749
  • [14] Unsupervised Hindi Word Sense Disambiguation based on Network Agglomeration
    Jain, Amita
    Lobiyal, D. K.
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 195 - 200
  • [15] A Genetic Algorithm Based Approach for Hindi Word Sense Disambiguation
    Athaiya, Anidhya
    Modi, Deepa
    Pareek, Gunjan
    PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES 2018), 2018, : 11 - 14
  • [16] Role of Genetic Algorithm in Optimization of Hindi Word Sense Disambiguation
    Bhatia, Surbhi
    Kumar, Ankit
    Khan, Mohammed Mutillah
    IEEE ACCESS, 2022, 10 : 75693 - 75707
  • [17] Similarity-based word sense disambiguation
    Karov, Y
    Edelman, S
    COMPUTATIONAL LINGUISTICS, 1998, 24 (01) : 41 - 59
  • [18] Semantic Similarity Functions in Word Sense Disambiguation
    Kobylinski, Lukasz
    Kopec, Mateusz
    TEXT, SPEECH AND DIALOGUE, TSD 2012, 2012, 7499 : 31 - 38
  • [19] Graph Connectivity for Unsupervised Word Sense Disambiguation for HINDI Language
    Nandanwar, Lokesh
    2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [20] A Comparative Analysis of Hindi Word Sense Disambiguation and its Approaches
    Sarika
    Sharma, Dilip Kumar
    2015 INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION & AUTOMATION (ICCCA), 2015, : 314 - 321