The Effects of Features Selection Methods on Spam Review Detection Performance

被引:10
|
作者
Etaiwi, Wael [1 ]
Awajan, Arafat [1 ]
机构
[1] Princess Sumaya Univ Technol, Amman, Jordan
来源
2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS) | 2017年
关键词
spam reviews; feature selection; machine learning; spam detection;
D O I
10.1109/ICTCS.2017.50
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Online reviews become a valuable source of information that indicates the overall opinion about products and services, which may affect decision-making processes such as purchase a product or service. Fake reviews are considered as spam reviews, which may have a great impact in the online marketplace behavior. Extracting useful features from review's text using Natural Language Processing (NLP) is not a straightforward step, in addition, it affects the overall performance and results. Many types of features could be used for conducting this task such as Bag-of-Words, linguistic features, words counts and n-gram feature. In this paper, we will investigate the effects of using two different feature selection methods on the spam reviews detection: Bag-of-Words and words counts. Different machine learning algorithms were applied such as Support Victor Machine, Decision Tree, Naive Bayes and Random Forest. Experiments were conducted on a labeled balanced dataset of Hotels reviews. The efficiency will be evaluated according to many evaluation measures such as: precision, recall and accuracy.
引用
收藏
页码:116 / 120
页数:5
相关论文
共 50 条
  • [1] Email Spam: A Comprehensive Review of Optimize Detection Methods, Challenges, and Open Research Problems
    Tusher, Ekramul Haque
    Ismail, Mohd Arfian
    Rahman, Md Arafatur
    Alenezi, Ali H.
    Uddin, Mueen
    IEEE ACCESS, 2024, 12 : 143627 - 143657
  • [2] Review Spam Detection Based on Psychological and Linguistic Features
    Onan, Aytug
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [3] Dynamic Feature Selection for Spam Detection in Twitter
    Karakasli, M. Salih
    Aydin, Muhammed Ali
    Yarkan, Serhan
    Boyaci, Ali
    INTERNATIONAL TELECOMMUNICATIONS CONFERENCE, ITELCON 2017, 2019, 504 : 239 - 250
  • [4] Spam Review Detection Using the Linguistic and Spammer Behavioral Methods
    Hussain, Naveed
    Mirza, Hamid Turab
    Hussain, Ibrar
    Iqbal, Faiza
    Memon, Imran
    IEEE ACCESS, 2020, 8 : 53801 - 53816
  • [5] The Impact of applying Different Preprocessing Steps on Review Spam Detection
    Etaiwi, Wael
    Naymat, Ghazi
    8TH INTERNATIONAL CONFERENCE ON EMERGING UBIQUITOUS SYSTEMS AND PERVASIVE NETWORKS (EUSPN 2017) / 7TH INTERNATIONAL CONFERENCE ON CURRENT AND FUTURE TRENDS OF INFORMATION AND COMMUNICATION TECHNOLOGIES IN HEALTHCARE (ICTH-2017) / AFFILIATED WORKSHOPS, 2017, 113 : 273 - 279
  • [6] Genetic-based Feature Selection for Spam Detection
    Arani, Seyyed Hossein Seyyedi
    Mozaffari, Saeed
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [7] Learning textual features for Twitter spam detection: A systematic literature review
    Abkenar, Sepideh Bazzaz
    Kashani, Mostafa Haghi
    Akbari, Mohammad
    Mahdipour, Ebrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 228
  • [8] Spam Detection Using Feature Selection and Parameters Optimization
    Lee, Sang Min
    Kim, Dong Seong
    Kim, Ji Ho
    Park, Jong Sou
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 883 - 888
  • [9] ADAPTIVE BINARY FLOWER POLLINATION ALGORITHM FOR FEATURE SELECTION IN REVIEW SPAM DETECTION
    Rajamohana, S. P.
    Umamaheswari, K.
    Abirami, B.
    2017 IEEE INTERNATIONAL CONFERENCE ON INNOVATIONS IN GREEN ENERGY AND HEALTHCARE TECHNOLOGIES (IGEHT), 2017,
  • [10] Advances in spam detection for email spam, web spam, social network spam, and review spam: ML-based and nature-inspired-based techniques
    Akinyelu, Andronicus A.
    JOURNAL OF COMPUTER SECURITY, 2021, 29 (05) : 473 - 529