Aspect Extraction in Domain Lexicon Generation: A New Frequency-Based Approach

被引:0
|
作者
Zayet, Tasnim M. A. [1 ]
Ismail, Maizatul Akmar [1 ]
Varathan, Kasturi Dewi [1 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Informat Syst, Kuala Lumpur 50603, Malaysia
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Data mining; Frequency-domain analysis; Social networking (online); Sentiment analysis; Semantics; Accuracy; Statistical analysis; Text processing; Text mining; Aspect; domain lexicon; frequency-based; sentiment analysis; statistical; word extraction; context; FEATURE-SELECTION; SENTIMENT;
D O I
10.1109/ACCESS.2024.3442930
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Domain sentimental lexicon building become an attractive field in recent years. This is due to the increased number of users' generated data through the internet besides the different sentiments of opinion words in different contexts. Domain lexicons mainly consist of opinion pairs and their associated sentiment. Any opinion pair is formed by a domain word and one of its associated opinion words. Therefore, to generate a domain lexicon from a domain corpus, domain word extraction is needed with their associated opinion words. One of the traditional approaches is frequency-based approaches. However, the ambiguity problem is a big concern of these approaches. This paper introduced a frequency-based equation that considers the context of the words for domain word extraction. The equation was tested on five Amazon reviews datasets and it proved its efficiency over other used frequency-based equations in terms of recall and precision. Therefore, more related lexicons to the domains were generated.
引用
收藏
页码:138972 / 138984
页数:13
相关论文
共 50 条
  • [21] Lane-mark Extraction by Frequency-based Saliency Visual Attention
    Le Ngo, Anh Cat
    Ang, Li-Minn
    Seng, Kah Phooi
    Qiu, Guoping
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 9 (ICCSIT 2010), 2010, : 55 - 59
  • [22] A new EMG frequency-based fatigue threshold test
    Hendrix, C. Russell
    Housh, Terry J.
    Johnson, Glen O.
    Mielke, Michelle
    Camic, Clayton L.
    Zuniga, Jorge M.
    Schmidt, Richard J.
    JOURNAL OF NEUROSCIENCE METHODS, 2009, 181 (01) : 45 - 51
  • [23] A new method of fundamental frequency extraction in frequency domain
    Zhang, H
    Huang, TY
    Song, JS
    ICSP '98: 1998 FOURTH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1998, : 690 - 693
  • [24] New method of fundamental frequency extraction in frequency domain
    Zhang, Hong
    Huang, Taiyi
    Song, Junshou
    International Conference on Signal Processing Proceedings, ICSP, 1998, 1 : 690 - 693
  • [25] Lexicon Generation Using Genetic Algorithm For Aspect-Based Sentiment Analysis
    Mowlaei, Mohammad Erfan
    Abadeh, Mohammad Saniee
    Keshavarz, Hamidreza
    2018 IEEE 22ND INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2018), 2018, : 133 - 137
  • [26] ADMM APPROACH TO ASYNCHRONOUS DISTRIBUTED FREQUENCY-BASED LOAD CONTROL
    Wu, Chia-Wei
    Chang, Tsung-Hui
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 931 - 935
  • [27] Time and Frequency-Based Approach to Heart Sound Segmentation and Classification
    Makela, Jarno
    Vaananen, Heikki
    2016 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), VOL 43, 2016, 43 : 577 - 580
  • [28] PoMo: An Allele Frequency-Based Approach for Species Tree Estimation
    De Maio, Nicola
    Schrempf, Dominik
    Kosiol, Carolin
    SYSTEMATIC BIOLOGY, 2015, 64 (06) : 1018 - 1031
  • [29] Frequency-based approach to the study of semantic brain networks connectivity
    Bianchi, A. M.
    Marchetta, E.
    Tana, M. G.
    Tettamanti, M.
    Rizzo, G.
    JOURNAL OF NEUROSCIENCE METHODS, 2013, 212 (02) : 181 - 189
  • [30] A frequency-based approach for mining coverage statistics in data integration
    Nie, ZQ
    Kambhampati, S
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 387 - 398