A Supervised Approach for Word Sense Disambiguation based on Arabic Diacritics

被引:0
|
作者
Alrakaf, Alaa Abdullah [1 ]
Rahman, Sk. Md. Mizanur [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
来源
2016 5TH INTERNATIONAL CONFERENCE ON INFORMATICS, ELECTRONICS AND VISION (ICIEV) | 2016年
关键词
Arabic natural language processing; Machine learning; Machine translation; Naive Bayes Classifier; word sense disambiguation;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Since the last two decades' Arabic natural language processing (ANLP) has become increasingly much more important. One of the key issues related to ANLP is ambiguity. In Arabic language different pronunciation of one word may have a different meaning. Furthermore, ambiguity also has an impact on the effectiveness and efficiency of Machine Translation (MT). The issue of ambiguity has limited the usefulness and accuracy of the translation from Arabic to English. The lack of Arabic resources makes ambiguity problem more complicated. Additionally, the orthographic level of representation cannot specify the exact meaning of the word. This paper looked at the diacritics of Arabic language and used them to disambiguate an ambiguous word. The proposed approach of word sense disambiguation used Diacritizer application to Diacritize Arabic text. Then find the most accurate sense of an ambiguous word using Naive Bayes Classifier. Our system gets 91% precision, and 12.11% error rate. This experimental study proves that using Arabic Diacritics with Naive Bayes Classifier enhances the accuracy of choosing the appropriate sense for ambiguous Arabic words.
引用
收藏
页码:1015 / 1021
页数:7
相关论文
共 50 条
  • [21] A novel approach to word sense disambiguation in Bengali language using supervised methodology
    Alok Ranjan Pal
    Diganta Saha
    Niladri Sekhar Dash
    Sudip Kumar Naskar
    Antara Pal
    Sādhanā, 2019, 44
  • [22] Word sense disambiguation based on word sense clustering
    Anaya-Sanchez, Henry
    Pons-Porrata, Aurora
    Berlanga-Llavori, Rafael
    ADVANCES IN ARTIFICIAL INTELLIGENCE - IBERAMIA-SBIA 2006, PROCEEDINGS, 2006, 4140 : 472 - 481
  • [23] A novel approach to word sense disambiguation in Bengali language using supervised methodology
    Pal, Alok Ranjan
    Saha, Diganta
    Dash, Niladri Sekhar
    Naskar, Sudip Kumar
    Pal, Antara
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (08):
  • [24] Supervised Word Sense Disambiguation: Facing Current Challenges
    Martiez Iraola, David
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (34):
  • [25] Word sense disambiguation by semi-supervised learning
    Niu, ZY
    Ji, DH
    Tan, CL
    Yang, LP
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2005, 3406 : 238 - 241
  • [26] Assamese Word Sense Disambiguation using Supervised Learning
    Borah, Pranjal Protim
    Talukdar, Gitimoni
    Baruah, Arup
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 946 - 950
  • [27] Enhancing Arabic Word Sense Disambiguation with Ensemble BERT-Based Models
    Djaidri, Asma
    Aliane, Hassina
    Azzoune, Hamid
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2023, PT I, 2025, 2339 : 261 - 272
  • [28] An Improved Approach to Word Sense Disambiguation
    Sachdeva, Pradeep
    Verma, Surbhi
    Singh, Sandeep Kumar
    2014 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), 2014, : 235 - 240
  • [29] Stacking of BERT and CNN Models for Arabic Word Sense Disambiguation
    Saidi, Rakia
    Jarray, Fethi
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (11)
  • [30] Word Sense Disambiguation Based on Semi-Supervised Convolutional Neural Networks
    Zhang C.
    Tang L.
    Gao X.
    Xinan Jiaotong Daxue Xuebao/Journal of Southwest Jiaotong University, 2022, 57 (01): : 11 - 17and27