Word Embeddings for Arabic Sentiment Analysis

被引:0
|
作者
Altowayan, A. Aziz [1 ]
Tao, Lixin [1 ]
机构
[1] Pace Univ, Dept Comp Sci, New York, NY 10038 USA
来源
2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA) | 2016年
关键词
sentiment; word embeddings;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Manual feature extraction is a challenging and time consuming task, especially in a Morphologically Rich Language (MRL) such as Arabic. In this paper, we rely on word embeddings as the main source of features for opinion mining in Arabic text such as tweets, consumer reviews, and news articles. First, we compile a large Arabic corpus from various sources to learn word representations. Second, we train and generate word vectors (embeddings) from the corpus. Third, we use the embeddings in our feature representation for training several binary classifiers to detect subjectivity and sentiment in both Standard Arabic and Dialectal Arabic. We compare our results with other methods in literature; our approach-with no hand-crafted features-achieves a slightly better accuracy than the top hand-crafted methods. To reproduce our results and for further work, we publish the data and code used in our experiments.
引用
收藏
页码:3820 / 3825
页数:6
相关论文
共 50 条
  • [21] Using Word Embeddings for Ontology-Driven Aspect-Based Sentiment Analysis
    de Kok, Sophie
    Frasincar, Flavius
    PROCEEDINGS OF THE 35TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING (SAC'20), 2020, : 834 - 842
  • [22] Weakly supervised topic sentiment joint model with word embeddings
    Fu, Xianghua
    Sun, Xudong
    Wu, Haiying
    Cui, Laizhong
    Huang, Joshua Zhexue
    KNOWLEDGE-BASED SYSTEMS, 2018, 147 : 43 - 54
  • [23] Evaluating Quality of Word Embeddings with Sentiment Polarity Identification Task
    Indurthi, Vijayasaradhi
    Oota, Subba Reddy
    SEMANTIC WEB CHALLENGES, SEMWEBEVAL 2018, 2018, 927 : 232 - 237
  • [24] Generating Word Embeddings from an Extreme Learning Machine for Sentiment Analysis and Sequence Labeling Tasks
    Lauren, Paula
    Qu, Guangzhi
    Yang, Jucheng
    Watta, Paul
    Huang, Guang-Bin
    Lendasse, Amaury
    COGNITIVE COMPUTATION, 2018, 10 (04) : 625 - 638
  • [25] Persian Sentiment Analysis without Training Data Using Cross-Lingual Word Embeddings
    Aliramezani, Mohammad
    Doostmohammadi, Ehsan
    Bokaei, Mohammad Hadi
    Sameti, Hossien
    2020 10TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2020, : 78 - 82
  • [26] Generating Word Embeddings from an Extreme Learning Machine for Sentiment Analysis and Sequence Labeling Tasks
    Paula Lauren
    Guangzhi Qu
    Jucheng Yang
    Paul Watta
    Guang-Bin Huang
    Amaury Lendasse
    Cognitive Computation, 2018, 10 : 625 - 638
  • [27] Unlock big Data Emotions: Weighted Word Embeddings for sentiment Classification
    Dai, Xiangfeng
    Prout, Bob
    2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 3833 - 3838
  • [28] Comparative study of word embeddings models and their usage in Arabic language applications
    Suleiman, Dima
    Awajan, Arafat
    2018 19TH INTERNATIONAL ARAB CONFERENCE ON INFORMATION TECHNOLOGY (ACIT), 2018, : 64 - 70
  • [29] Arabic Quran Verses Authentication Using Deep Learning and Word Embeddings
    Touati-Hamad, Zineb
    Laouar, Mohamed Ridda
    Bendib, Issam
    Hakak, Saqib
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2022, 19 (04) : 681 - 688
  • [30] Multi-domain sentiment analysis with mimicked and polarized word embeddings for human-robot interaction
    Atzeni, Mattia
    Recupero, Diego Reforgiato
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 110 : 984 - 999