An Automated Machine Learning Approach for Sentiment Classification of Bengali E-Commerce Sites

被引:4
作者
Sarowar, Md Golam [1 ]
Rahman, Mushfiqur [1 ]
Ali, Md Nawab Yousuf [1 ]
Rakib, Omor Faruk [1 ]
机构
[1] East West Univ, Dept Comp Sci & Engn, Dhaka, Bangladesh
来源
2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT) | 2019年
关键词
E-commerce sites; K Nearest Neighbor (KNN); Support Vector Machine (SVM); Bangla StopWords Database; Random Forest;
D O I
10.1109/i2ct45611.2019.9033741
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
E-commerce reviews and comments about specific products disclose consumer's perceptions as well as attitudes. This attitudes expressed by the consumer's seem to be most useful for the new customers who is interested on any product. Meanwhile, an ever increasing number of reviews and comments are being stored daily and the amount of people buying goods online are increasing in a great extent. User emotions record associated with every products is beneficial for both the makers as well as customers. But with the increasing number of datasets in the e-commerce websites, it has almost become impossible for manual analysis and without automated machine learning approach it can't be even imagined. Therefore, improvement in the field of machine learning approaches must be accomplished. Realizing the worth of this, this work proposes a hybrid machine learning approach by incorporating different machine learning approaches. Own Bangla StopWords database consisting of approximately 900words have also been concentrated in this work. Initially, input data are tokenized using python NLTK library and filtered using StopWords created. Then conversion of data to numerical from string are conducted using TF-IDF (Term Frequency-Inverse Document Frequency) information retrieval mechanism and finally trained using K-nearest neighbor (KNN) with Support Vector Machine (SVM). Our proposed approach Normalization along with StopWords filtering embedded KNN based SVM demonstrates superiority after a comparative study with Principle Component Analysis (PCA) with Convolutional neural network (CNN), Random Forest, Logistic Regression.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Classification model for accuracy and intrusion detection using machine learning approach
    Agarwal A.
    Sharma P.
    Alshehri M.
    Mohamed A.A.
    Alfarraj O.
    PeerJ Computer Science, 2021, 7 : 1 - 22
  • [32] Loading Frequency Classification in Shape Memory Alloys: A Machine Learning Approach
    Tymoshchuk, Dmytro
    Yasniy, Oleh
    Maruschak, Pavlo
    Iasnii, Volodymyr
    Didych, Iryna
    COMPUTERS, 2024, 13 (12)
  • [33] Classification of hospital admissions into emergency and elective care: a machine learning approach
    Kraemer, Jonas
    Schreyoegg, Jonas
    Busse, Reinhard
    HEALTH CARE MANAGEMENT SCIENCE, 2019, 22 (01) : 85 - 105
  • [34] Classification model for accuracy and intrusion detection using machine learning approach
    Agarwal, Arushi
    Sharma, Purushottam
    Alshehri, Mohammed
    Mohamed, Ahmed A.
    Alfarraj, Osama
    PEERJ COMPUTER SCIENCE, 2021,
  • [35] Machine learning classification approach for formation delineation at the basin-scale
    Vikara, Derek
    Khanna, Vikas
    PETROLEUM RESEARCH, 2022, 7 (02) : 165 - 176
  • [36] Data Analytics Implemented over E-commerce Data to Evaluate Performance of Supervised Learning Approaches in Relation to Customer Behavior
    Hambarde, Kailash
    Silahtaroglu, Gokhan
    Khamitkar, Santosh
    Bhalchandra, Parag
    Shaikh, Husen
    Kulkarni, Govind
    Tamsekar, Pritam
    Samale, Pranita
    SOFT COMPUTING FOR PROBLEM SOLVING, SOCPROS 2018, VOL 1, 2020, 1048 : 285 - 293
  • [37] A Machine Learning Approach for Tracing Tumor Original Sites With Gene Expression Profiles
    Liang, Xin
    Zhu, Wen
    Liao, Bo
    Wang, Bo
    Yang, Jialiang
    Mo, Xiaofei
    Li, Ruixi
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2020, 8
  • [38] An explainable machine learning approach for automated medical decision support of heart disease
    Mesquita, Francisco
    Marques, Goncalo
    DATA & KNOWLEDGE ENGINEERING, 2024, 153
  • [39] A Deep Learning and Machine Learning Approach for Image Classification of Tempered Images in Digital Forensic Analysis
    Chitti, Praveen
    Prabhushetty, K.
    Allagi, Shridhar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 589 - 593
  • [40] A Deep Learning and Machine Learning Approach for Image Classification of Tempered Images in Digital Forensic Analysis
    Chitti, Praveen
    Prabhushetty, K.
    Allagi, Shridhar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (10) : 589 - 593