Generating Sense Inventories for Ambiguous Arabic Words

被引:2
|
作者
Alian, Marwah [1 ]
Awajan, Arafat [1 ,2 ]
机构
[1] Princess Sumaya Univ Technol, King Hussein Sch Comp Sci, Amman, Jordan
[2] Mutah Univ, Informat Technol Coll, Comp Sci Dept, Amman, Jordan
关键词
Word sense induction; word sense disambiguation; arabic text; sense inventory; DISAMBIGUATION;
D O I
10.34028/iajit/18/3A/8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The process of selecting the appropriate meaning of an ambigous word according to its context is known as word sense disambiguation. In this research, we generate a number of Arabic sense inventories based on an unsupervised approach and different pre-trained embeddings, such as Aravec, Fasttext, and Arabic-News embeddings. The resulted inventories from the pre-trained embeddings are evaluated to investigate their efficiency in Arabic word sense disambiguation and sentence similarity. The sense inventories are generated using an unsupervised approach that is based on a graph- based word sense inductionalgorithm. Results show that the AravecTwitter inventory achieves the best accuracy of 0.47 for 50 neighbors and a close accuracy to the Fasttext inventory for 200 neighbors while it provides similar accuracy to the Arabic-News inventory for 100neighbors. The experiment of replacing ambiguous words with their sense vectors is tested for sentence similarity using all sense inventories and the results show that using Aravec-Twitter sense inventoryprovides a better correlation value.
引用
收藏
页码:446 / 451
页数:6
相关论文
共 50 条
  • [1] Sense Inventories for Arabic Texts
    Alian, Marwah
    Awajan, Arafat
    2020 21ST INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2020,
  • [2] Arabic word sense disambiguation using sense inventories
    Alian M.
    Awajan A.
    International Journal of Information Technology, 2023, 15 (2) : 735 - 744
  • [3] Sense Disambiguation "Ambiguous Sensation"? Evaluating Sense Inventories for verbal WSD in Hungarian
    Kuti, Judit
    Heja, Eniko
    Sass, Balint
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : H23 - H29
  • [4] One Classifier for All Ambiguous Words: Overcoming Data Sparsity by Utilizing Sense Correlations Across Words
    Choubey, Prafulla Kumar
    Huang, Ruihong
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5978 - 5985
  • [5] Exploring the Impact of Stop Words and Particles on Arabic Word Sense Disambiguation
    Djaidri, Asma
    Aliane, Hassina
    Azzoune, Hamid
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2023, PT II, 2025, 2340 : 30 - 40
  • [6] Making Sense of Number Words and Arabic Digits: Does Order Count More?
    Sella, Francesco
    Lucangeli, Daniela
    Cohen Kadosh, Roi
    Zorzi, Marco
    CHILD DEVELOPMENT, 2020, 91 (05) : 1456 - 1470
  • [7] Rhythms of Arabic words and Fibonacci words
    Benkoudad, Imad-Eddine
    Azizi, Abdelmalek
    El Amrani, Mouhammed
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2021, 33 (08) : 955 - 962
  • [8] Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories
    Yao, Wenlin
    Pan, Xiaoman
    Jin, Lifeng
    Chen, Jianshu
    Yu, Dian
    Yu, Dong
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 7741 - 7751
  • [9] WORDS AND MASKS - SENSE OF WORDS
    MEITINGER, S
    CRITIQUE, 1978, 34 (378) : 1034 - 1042
  • [10] PROCESSING AMBIGUOUS WORDS IN CONTEXT
    TABOSSI, P
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1989, 27 (06) : 492 - 492