Effectiveness of Normalization Over Processing of Textual Data Using Hybrid Approach Sentiment Analysis

被引:1
|
作者
Johal, Sukhnandan Kaur [1 ]
Mohana, Rajni [2 ]
机构
[1] Thapar Inst Engn & Technol, Dept CSED, Patiala, Punjab, India
[2] Jaypee Univ Informat Technol, Waknaghat, Himachal Prades, India
关键词
Informal Text; Natural Language Processing; Normalization; Opinion Mining; SentimentAnalysis; Sentistrength;
D O I
10.4018/IJGHPC.2020070103
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Various natural language processing tasks are carried out to feed into computerized decision support systems. Among these, sentiment analysis is gaining more attention. The majority of sentiment analysis relies on the social media content. This web content is highly un-normalized in nature. This hinders the performance of decision support system. To enhance the performance, it is required to process data efficiently. This article proposes a novel method of normalization of web data during the pre-processing phase. It is aimed to get better results for different natural language processing tasks. This research applies this technique on data for sentiment analysis. Performance of different learning models is analysed using precision, recall, f-measure, fallout for normalize and un-normalize sentiment analysis. Results shows after normalization, some documents shift their polarity i.e. negative to positive. Experimental results show normalized data processing outperforms un-normalized data processing with better accuracy.
引用
收藏
页码:43 / 56
页数:14
相关论文
共 50 条
  • [21] A Hybrid Multilingual Fuzzy-Based Approach to the Sentiment Analysis Problem Using SentiWordNet
    Madani, Youness
    Erritali, Mohammed
    Jamaa, Bengourram
    Sailhan, Francoise
    INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2020, 28 (03) : 361 - 390
  • [22] Sentiment Analysis of Top Colleges in India Using Twitter Data
    Mamgain, Nehal
    Mehta, Ekta
    Mittal, Ankush
    Bhatt, Gaurav
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL TECHNIQUES IN INFORMATION AND COMMUNICATION TECHNOLOGIES (ICCTICT), 2016,
  • [23] An Approach to Sentiment Analysis on Unstructured Data in Big Data Environment
    Borikar, Dilipkumar A.
    Chandak, Manoj B.
    SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 169 - 176
  • [24] A Survey of Sentiment Analysis of Internet Textual Data and Application to Pakistani YouTube User Comments
    Rani, Mehwish
    Latif, Seemab
    Tahir, Muhaammad Ali
    Mumtaz, Rafia
    2021 INTERNATIONAL CONFERENCE ON DIGITAL FUTURES AND TRANSFORMATIVE TECHNOLOGIES (ICODT2), 2021,
  • [25] Sentiment Groups as Features of a Classification Model Using a Spanish Sentiment Lexicon: A Hybrid Approach
    Gutierrez, Ernesto
    Cervantes, Ofelia
    Baez-Lopez, David
    Alfredo Sanchez, J.
    PATTERN RECOGNITION (MCPR 2015), 2015, 9116 : 258 - 268
  • [26] Tehran stock exchange prediction using sentiment analysis of online textual opinions
    Ghahfarrokhi, Arezoo
    Shamsfard, Mehrnoush
    INTELLIGENT SYSTEMS IN ACCOUNTING FINANCE & MANAGEMENT, 2020, 27 (01) : 22 - 37
  • [27] Sentiment analysis on cross-domain textual data using classical and deep learning approaches
    Paramesha, K.
    Gururaj, H. L.
    Nayyar, Anand
    Ravishankar, K. C.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (20) : 30759 - 30782
  • [28] Sentiment analysis on cross-domain textual data using classical and deep learning approaches
    K. Paramesha
    H. L. Gururaj
    Anand Nayyar
    K. C. Ravishankar
    Multimedia Tools and Applications, 2023, 82 : 30759 - 30782
  • [29] A Data Augmentation Approach to Sentiment Analysis of MOOC Reviews
    Li, Guangmin
    Zhou, Long
    Tong, Qiang
    Ding, Yi
    Qi, Xiaolin
    Liu, Hang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (08) : 1258 - 1264
  • [30] Sentiment Analysis Using Lexicon Based Approach
    Singh, Vijendra
    Singh, Gurdeep
    Rastogi, Priyanka
    Deswal, Devanshi
    2018 FIFTH INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (IEEE PDGC), 2018, : 13 - 18