Comparative study of word embeddings models and their usage in Arabic language applications

被引:0
|
作者
Suleiman, Dima [1 ,2 ]
Awajan, Arafat [1 ]
机构
[1] Princess Sumaya Univ Technol, King Hussein Fac Comp Sci, Dept Comp Sci, Amman, Jordan
[2] Univ Jordan, Dept Informat Technol, Amman, Jordan
来源
2018 19TH INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT) | 2018年
关键词
word embeddings; deep learning; sentiment analysis; word2vec; Glove; semantic similarity; CBOW; Skip-grant;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Word embeddings is the representation of the text using vectors such that the words that have similar syntax and semantic will have similar vector representation. Representing words using vectors is very crucial for most of natural language processing applications. In natural language, when using neural network for processing, the words vectors will be fed as input to the network. In this paper, a comparative study of several word embeddings models is conducted including Glove and the two approaches of word2vec model called CBOW and Skip-gram. Furthermore, this study surveying most of the state-of-art of using word embeddings in Arabic language applications such as sentiment analysis, semantic similarity, short answer grading, information retrieval, paraphrase identification, plagiarism detection and Textual Entailment.
引用
收藏
页码:64 / 70
页数:7
相关论文
共 50 条
  • [1] A Comparative Study of Pre-trained Word Embeddings for Arabic Sentiment Analysis
    Zouidine, Mohamed
    Khalil, Mohammed
    2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 1243 - 1248
  • [2] Pretrained Transformer Language Models Versus Pretrained Word Embeddings for the Detection of Accurate Health Information on Arabic Social Media: Comparative Study
    Albalawi, Yahya
    Nikolov, Nikola S.
    Buckley, Jim
    JMIR FORMATIVE RESEARCH, 2022, 6 (06)
  • [3] A Comparative Study of Word Embedding Models for Arabic Text Processing
    Assiri, Fatmah
    Alghamdi, Nuha
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (08): : 399 - 403
  • [4] A Comparative Study of Word Embedding Models for Arabic Text Processing
    Assiri, Fatmah
    Alghamdi, Nuha
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (09): : 399 - 403
  • [5] Word Embeddings for Arabic Sentiment Analysis
    Altowayan, A. Aziz
    Tao, Lixin
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3820 - 3825
  • [6] The Impact of Arabic Diacritization on Word Embeddings
    Abbache, Mohamed
    Abbache, Ahmed
    Xu, Jingwen
    Meziane, Farid
    Wen, Xianbin
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (06)
  • [7] Arabic Text Classification Based on Word and Document Embeddings
    El Mahdaouy, Abdelkader
    Gaussier, Eric
    El Alaoui, Said Ouatik
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT SYSTEMS AND INFORMATICS 2016, 2017, 533 : 32 - 41
  • [8] Utility of word embeddings from large language models in medical diagnosis
    Yazdani, Shahram
    Henry, Ronald Claude
    Byrne, Avery
    Henry, Isaac Claude
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025, 32 (03) : 526 - 534
  • [9] Comprehensive Evaluation of Word Embeddings for Highly Inflectional Language
    Drozda, Pawel
    Sopyla, Krzysztof
    Lewalski, Juliusz
    ADVANCES IN COMPUTATIONAL COLLECTIVE INTELLIGENCE (ICCCI 2021), 2021, 1463 : 597 - 607
  • [10] A Comparative Study of Deep Learning Approaches for Arabic Language Processing
    Mohamed, Mahmoud
    Alosman, Khaled
    JORDAN JOURNAL OF ELECTRICAL ENGINEERING, 2025, 11 (01): : 18 - 34