Constructing automatic domain-specific sentiment lexicon using KNN search via terms discrimination vectors

被引:4
|
作者
Alqasemi F. [1 ]
Abdelwahab A. [1 ]
Abdelkader H. [1 ]
机构
[1] Information Systems Department, Menoufia University, Menoufia
关键词
KNN; lexicon-based SA; natural language processing; Sentiment analysis; sentiment lexicon; sentiment seeds;
D O I
10.1080/1206212X.2017.1409477
中图分类号
学科分类号
摘要
Web textual data content is a viable source for decision-makers’ knowledge, so are text analytic applications. Sentiment analysis (SA) is one of text mining fields, in which text is analyzed to recognize text writer implied opinion. In this paper, a new approach had been presented for automatic Arabic language sentiment lexicon constructing. Popular KNN search algorithm is utilized for this objective. Cosine distance between seeds terms and corpus terms is employed in KNN search query. Generated lexicon terms are launched from sentiment seeds and seeds terms are augmented via Arabic-specific NLP-based algorithm, which is helped to enhance seeds terms selection process. Term discrimination vector (TDV) is the main part of KNN query inputs TDV components are computed for each corpus term and it is constituted by four term weight techniques. According to the experimental results, TDV accomplished better results than TF-IDF traditional method with lower computation cost. Also, constructed lexicons outperformed premade lexicons accuracy results. © 2017, © 2017 Informa UK Limited, trading as Taylor & Francis Group.
引用
收藏
页码:127 / 137
页数:10
相关论文
共 47 条
  • [21] Semi-Automatic Approaches for Exploiting Shifter Patterns in Domain-Specific Sentiment Analysis
    Brazdil, Pavel
    Muhammad, Shamsuddeen H.
    Oliveira, Fatima
    Cordeiro, Joao
    Silva, Fatima
    Silvano, Purificacao
    Leal, Antonio
    MATHEMATICS, 2022, 10 (18)
  • [22] Imbalanced text sentiment classification using universal and domain-specific knowledge
    Li, Yijing
    Guo, Haixiang
    Zhang, Qingpeng
    Gu, Mingyun
    Yang, Jianying
    KNOWLEDGE-BASED SYSTEMS, 2018, 160 : 1 - 15
  • [23] TLATR: Automatic Topic Labeling Using Automatic (Domain-Specific) Term Recognition
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    IEEE ACCESS, 2021, 9 : 76624 - 76641
  • [24] TLATR: Automatic Topic Labeling Using Automatic (Domain-Specific) Term Recognition
    Truica, Ciprian-Octavian
    Apostol, Elena-Simona
    IEEE Access, 2021, 9 : 76624 - 76641
  • [25] Meta-mode search: Using XPath to search domain-specific models
    Sudarsan, R
    Gray, J
    SERP '05: PROCEEDINGS OF THE 2005 INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING RESEARCH AND PRACTICE, VOLS 1 AND 2, 2005, : 168 - 174
  • [26] Efficacy improvement of aspect-based sentiment analysis using enhanced rule - based approach and domain-specific lexicon (ERBA-DSL)
    Nandhini, Devi Sri M.
    Gurunathan, Pradeep
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 43 (03) : 2529 - 2547
  • [27] SentiDraw: Using star ratings of reviews to develop domain specific sentiment lexicon for polarity determination
    Sharma, Shashank Shekhar
    Dutta, Gautam
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (01)
  • [28] An automatic method to generate domain-specific investigator networks using PubMed abstracts
    Wei Yu
    Ajay Yesupriya
    Anja Wulf
    Junfeng Qu
    Marta Gwinn
    Muin J Khoury
    BMC Medical Informatics and Decision Making, 7
  • [29] Automatic Heterogeneous Runtime Using Signal Processing Domain-Specific and Parallel Patterns
    Zaidi, Yaseen
    Winberg, Simon
    INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2025, 53 (02)
  • [30] An automatic method to generate domain-specific investigator networks using PubMed abstracts
    Yu, Wei
    Yesupriya, Ajay
    Wulf, Anja
    Qu, Junfeng
    Gwinn, Marta
    Khoury, Muin J.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2007, 7 (1)