Aspect Extraction in Domain Lexicon Generation: A New Frequency-Based Approach

被引:0
作者
Zayet, Tasnim M. A. [1 ]
Ismail, Maizatul Akmar [1 ]
Varathan, Kasturi Dewi [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Data mining; Frequency-domain analysis; Social networking (online); Sentiment analysis; Semantics; Accuracy; Statistical analysis; Text processing; Text mining; Aspect; domain lexicon; frequency-based; sentiment analysis; statistical; word extraction; context; FEATURE-SELECTION; SENTIMENT;
D O I
10.1109/ACCESS.2024.3442930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain sentimental lexicon building become an attractive field in recent years. This is due to the increased number of users' generated data through the internet besides the different sentiments of opinion words in different contexts. Domain lexicons mainly consist of opinion pairs and their associated sentiment. Any opinion pair is formed by a domain word and one of its associated opinion words. Therefore, to generate a domain lexicon from a domain corpus, domain word extraction is needed with their associated opinion words. One of the traditional approaches is frequency-based approaches. However, the ambiguity problem is a big concern of these approaches. This paper introduced a frequency-based equation that considers the context of the words for domain word extraction. The equation was tested on five Amazon reviews datasets and it proved its efficiency over other used frequency-based equations in terms of recall and precision. Therefore, more related lexicons to the domains were generated.
引用
收藏
页码:138972 / 138984
页数:13
相关论文
共 49 条
  • [41] A review of clustering techniques and developments
    Saxena, Amit
    Prasad, Mukesh
    Gupta, Akshansh
    Bharill, Neha
    Patel, Om Prakash
    Tiwari, Aruna
    Er, Meng Joo
    Ding, Weiping
    Lin, Chin-Teng
    [J]. NEUROCOMPUTING, 2017, 267 : 664 - 681
  • [42] Siddiqi S., 2015, Int J Comput Appl, V109, P18, DOI DOI 10.5120/19161-0607
  • [43] Deep Learning Based Abstractive Text Summarization: Approaches, Datasets, Evaluation Measures, and Challenges
    Suleiman, Dima
    Awajan, Arafat
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [44] In vitro evaluation of frictional force of a novel elastic bendable orthodontic wire
    Takada, Megumi
    Nakajima, Akira
    Kuroda, Shingo
    Horiuchi, Shinya
    Shimizu, Noriyoshi
    Tanaka, Eiji
    [J]. ANGLE ORTHODONTIST, 2018, 88 (05) : 602 - 610
  • [45] RETRACTED: Using the Ship-Gram Model for Japanese Keyword Extraction Based on News Reports (Retracted Article)
    Teng, Miao
    [J]. COMPLEXITY, 2021, 2021
  • [46] Helmholtz principle based supervised and unsupervised feature selection methods for text mining
    Tutkan, Melike
    Ganiz, Murat Can
    Akyokus, Selim
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2016, 52 (05) : 885 - 910
  • [47] The Construction of Sentiment Lexicon Based on Context-Dependent Part-of-Speech Chunks for Semantic Disambiguation
    Yin, Fulian
    Wang, Yanyan
    Liu, Jianbo
    Lin, Lisha
    [J]. IEEE ACCESS, 2020, 8 (08): : 63359 - 63367
  • [48] Yusof Nor Nadiah, 2018, International Journal of Machine Learning and Computing, V8, DOI 10.18178/ijmlc.2018.8.4.719
  • [49] Investigating transportation research based on social media analysis: a systematic mapping review
    Zayet, Tasnim M. A.
    Ismail, Maizatul Akmar
    Varathan, Kasturi Dewi
    Noor, Rafidah M. D.
    Chua, Hui Na
    Lee, Angela
    Low, Yeh Ching
    Singh, Sheena Kaur Jaswant
    [J]. SCIENTOMETRICS, 2021, 126 (08) : 6383 - 6421