Effectiveness of Normalization Over Processing of Textual Data Using Hybrid Approach Sentiment Analysis

被引:1
作者
Johal, Sukhnandan Kaur [1 ]
Mohana, Rajni [2 ]
机构
[1] Thapar Inst Engn & Technol, Dept CSED, Patiala, Punjab, India
[2] Jaypee Univ Informat Technol, Waknaghat, Himachal Prades, India
基金
美国国家科学基金会;
关键词
Informal Text; Natural Language Processing; Normalization; Opinion Mining; SentimentAnalysis; Sentistrength;
D O I
10.4018/IJGHPC.2020070103
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Various natural language processing tasks are carried out to feed into computerized decision support systems. Among these, sentiment analysis is gaining more attention. The majority of sentiment analysis relies on the social media content. This web content is highly un-normalized in nature. This hinders the performance of decision support system. To enhance the performance, it is required to process data efficiently. This article proposes a novel method of normalization of web data during the pre-processing phase. It is aimed to get better results for different natural language processing tasks. This research applies this technique on data for sentiment analysis. Performance of different learning models is analysed using precision, recall, f-measure, fallout for normalize and un-normalize sentiment analysis. Results shows after normalization, some documents shift their polarity i.e. negative to positive. Experimental results show normalized data processing outperforms un-normalized data processing with better accuracy.
引用
收藏
页码:43 / 56
页数:14
相关论文
共 18 条
[1]   Sentiment Analysis Using Common-Sense and Context Information [J].
Agarwal, Basant ;
Mittal, Namita ;
Bansal, Pooja ;
Garg, Sonal .
COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2015, 2015
[2]  
[Anonymous], P CORP LING C CL2009
[3]  
[Anonymous], 2013, P 7 INT C LANG RES O
[4]  
Baldwin T., 2015, P 2015 C N AM CHAPT, P420, DOI DOI 10.3115/V1/N15-1045
[5]   Sentiment Analysis Is a Big Suitcase [J].
Cambria, Erik ;
Poria, Soujanya ;
Gelbukh, Alexander ;
Thelwall, Mike .
IEEE INTELLIGENT SYSTEMS, 2017, 32 (06) :74-80
[6]  
Chien HL, 2002, EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, P786
[7]   Opinion mining from noisy text data [J].
Dey, Lipika ;
Haque, Sk. Mirajul .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2009, 12 (03) :205-226
[8]   Challenges of Sentiment Analysis for Dynamic Events [J].
Ebrahimi, Monireh ;
Yazdavar, Amir Hossein ;
Sheth, Amit .
IEEE INTELLIGENT SYSTEMS, 2017, 32 (05) :70-75
[9]   SentiMI: Introducing point-wise mutual information with SentiWordNet to improve sentiment polarity detection [J].
Khan, Farhan Hassan ;
Qamar, Usman ;
Bashir, Saba .
APPLIED SOFT COMPUTING, 2016, 39 :140-153
[10]   Estimating term domain relevance through term frequency, disjoint corpora frequency - tf-dcf [J].
Lopes, Lucelene ;
Fernandes, Paulo ;
Vieira, Renata .
KNOWLEDGE-BASED SYSTEMS, 2016, 97 :237-249