Role of Karaka Relations in Hindi Word Sense Disambiguation

被引:11
|
作者
Singh, Satyendr
Siddiqui, Tanveer J.
机构
[1] Department of Electronics and Communication, University of Allahabad, Allahabad
关键词
Case Markers; Hindi Word Sense Disambiguation; Karaka Relations; Supervised Word Sense Disambiguation; Vibhakti;
D O I
10.4018/JITR.2015070102
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Karakas are an important constituent of Hindi language. Karaka relations express syntactico-semantic or semantico-syntactic relationship between verbs and nouns or pronouns in a sentence. They capture certain level of semantics closer to thematic relations, but different from it. A vibhakti is assigned to each karaka, in Paninian grammar. This paper investigates the role of karaka relations in Hindi Word Sense Disambiguation (WSD) by utilizing vibhaktis. Two supervised WSD algorithms were used for disambiguation. The first algorithm is based on conditional probability of co-occurring words and the second algorithm is Naive Bayes (NB) classifier. The first algorithm utilizes various heuristics for analyzing the role of karakas in Hindi WSD. The authors obtained an improvement of 14.86% in precision by utilizing content words, vibhaktis and phrases containing them in context vector over the context vector of content words after dropping vibhaktis. A gain of 6.91% in precision was observed by using content words and vibhaktis in context vector over the context vector of content words after dropping vibhaktis of similar context window size. The authors obtained maximum precision of 50.73% by extracting vibhaktis in a perpendicular to 3 window using WSD algorithm based on conditional probability of co-occurring words. They obtained maximum precision of 56.56% by extracting vibhaktis in a +/- 4 window using NB classifier.
引用
收藏
页码:21 / 42
页数:22
相关论文
共 50 条
  • [1] Role of Semantic Relations in Hindi Word Sense Disambiguation
    Singh, Satyendr
    Siddiqui, Tanveer J.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES, ICICT 2014, 2015, 46 : 240 - 248
  • [2] Role of Genetic Algorithm in Optimization of Hindi Word Sense Disambiguation
    Bhatia, Surbhi
    Kumar, Ankit
    Khan, Mohammed Mutillah
    IEEE ACCESS, 2022, 10 : 75693 - 75707
  • [3] Word Sense Disambiguation of Hindi Text using Fuzzified Semantic Relations and Fuzzy Hindi WordNet
    Jain, Goonjan
    Lobiyal, D. K.
    2019 9TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING, DATA SCIENCE & ENGINEERING (CONFLUENCE 2019), 2019, : 494 - 497
  • [4] An Innovative Method for Hindi Word Sense Disambiguation
    Mishra B.K.
    Jain S.
    SN Computer Science, 4 (6)
  • [5] Understanding the role of conceptual relations in Word Sense Disambiguation
    Fernandez-Amoros, David
    Heradio, Ruben
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (08) : 9506 - 9516
  • [6] A Literature Survey on Word Sense Disambiguation for the Hindi Language
    Gujjar, Vinto
    Mago, Neeru
    Kumari, Raj
    Patel, Shrikant
    Chintalapudi, Nalini
    Battineni, Gopi
    INFORMATION, 2023, 14 (09)
  • [7] Word Sense Disambiguation in English to Hindi Machine Translation
    Kolte, S. G.
    Bhirud, S. G.
    SOFTWARE AND COMPUTER APPLICATIONS, 2011, 9 : 285 - 290
  • [8] Hindi Word Sense Disambiguation Using Cosine Similarity
    Sarika, D. K.
    Sharma, Dilip Kumar
    PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ICT FOR SUSTAINABLE DEVELOPMENT ICT4SD 2015, VOL 2, 2016, 409 : 801 - 808
  • [9] Hindi Word Sense Disambiguation Using Semantic Relatedness Measure
    Singh, Satyendr
    Singh, Vivek Kumar
    Siddiqui, Tanveer J.
    MULTI-DISCIPLINARY TRENDS IN ARTIFICIAL INTELLIGENCE, 2013, 8271 : 247 - 256
  • [10] Unsupervised Hindi Word Sense Disambiguation based on Network Agglomeration
    Jain, Amita
    Lobiyal, D. K.
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 195 - 200