Word Embeddings for Arabic Sentiment Analysis

被引:0
作者
Altowayan, A. Aziz [1 ]
Tao, Lixin [1 ]
机构
[1] Pace Univ, Dept Comp Sci, New York, NY 10038 USA
来源
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2016年
关键词
sentiment; word embeddings;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manual feature extraction is a challenging and time consuming task, especially in a Morphologically Rich Language (MRL) such as Arabic. In this paper, we rely on word embeddings as the main source of features for opinion mining in Arabic text such as tweets, consumer reviews, and news articles. First, we compile a large Arabic corpus from various sources to learn word representations. Second, we train and generate word vectors (embeddings) from the corpus. Third, we use the embeddings in our feature representation for training several binary classifiers to detect subjectivity and sentiment in both Standard Arabic and Dialectal Arabic. We compare our results with other methods in literature; our approach-with no hand-crafted features-achieves a slightly better accuracy than the top hand-crafted methods. To reproduce our results and for further work, we publish the data and code used in our experiments.
引用
收藏
页码:3820 / 3825
页数:6
相关论文
共 50 条
  • [41] Bias in Word Embeddings
    Papakyriakopoulos, Orestis
    Hegelich, Simon
    Serrano, Juan Carlos Medina
    Marco, Fabienne
    FAT* '20: PROCEEDINGS OF THE 2020 CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, 2020, : 446 - 457
  • [42] Analyzing Distances in Word Embeddings and Their Relation with Seme Analysis
    Gijon Agudo, Manuel
    Vilalta Arias, Armand
    Garcia-Gasulla, Dario
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2019, 319 : 407 - 416
  • [43] isiZulu Word Embeddings
    Dlamini, Sibonelo
    Jembere, Edgar
    Pillay, Anban
    van Niekerk, Brett
    2021 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2021, : 121 - 126
  • [44] Word Embeddings for the Analysis of Ideological Placement in Parliamentary Corpora
    Rheault, Ludovic
    Cochrane, Christopher
    POLITICAL ANALYSIS, 2020, 28 (01) : 112 - 133
  • [45] HieNN-DWE: A hierarchical neural network with dynamic word embeddings for document level sentiment classification
    Liu, Fagui
    Zheng, Lailei
    Zheng, Jingzhong
    NEUROCOMPUTING, 2020, 403 : 21 - 32
  • [46] Detecting Arabic Offensive Language in Microblogs Using Domain-Specific Word Embeddings and Deep Learning
    Aljuhani, Khulood O.
    Alyoubi, Khaled H.
    Alotaibi, Fahd S.
    TEHNICKI GLASNIK-TECHNICAL JOURNAL, 2022, 16 (03): : 394 - 400
  • [47] Biomedical Word Sense Disambiguation with Word Embeddings
    Antunes, Rui
    Matos, Sergio
    11TH INTERNATIONAL CONFERENCE ON PRACTICAL APPLICATIONS OF COMPUTATIONAL BIOLOGY & BIOINFORMATICS, 2017, 616 : 273 - 279
  • [48] Getting into bed with embeddings? A comparison of collocations and word embeddings for corpus-assisted discourse analysis
    Batchelor, Jordan
    APPLIED CORPUS LINGUISTICS, 2024, 4 (03):
  • [49] A Comparative Analysis of Word Embeddings Techniques for Italian News Categorization
    Rollo, Federica
    Bonisoli, Giovanni
    Po, Laura
    IEEE ACCESS, 2024, 12 : 25536 - 25552
  • [50] Analysis of Literal and Metaphorical Senses Based on Diachronic Word Embeddings
    Jia, Yuxiang
    Zheng, Yi
    Zan, Hongying
    Wang, Zhimin
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 346 - 349