Handling Imbalanced Dataset Using SVM and k-NN Approach

被引:9
|
作者
Wah, Yap Bee [1 ]
Abd Rahman, Hezlin Aryani [1 ]
He, Haibo [2 ,3 ]
Bulgiba, Awang [4 ]
机构
[1] Univ Teknol MARA Malaysia, Fac Comp & Math Sci, Shah Alam 40450, Malaysia
[2] Univ Rhode Isl, Dept Elect Comp & Biomed Engn, Kingston, RI 02881 USA
[3] Julius Ctr Univ Malaya, Kuala Lumpur, Malaysia
[4] Univ Malaya, Fac Med, Dept Social & Prevent Med, Kuala Lumpur 50603, Malaysia
来源
ADVANCES IN INDUSTRIAL AND APPLIED MATHEMATICS | 2016年 / 1750卷
关键词
data mining; classification; imbalanced data; SVM; k-NN;
D O I
10.1063/1.4954536
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Data mining classification methods are affected when the data is imbalanced, that is, when one class is larger than the other class in size for the case of a two-class dependent variable. Many new methods have been developed to handle imbalanced datasets. In handling a binary classification task, Support Vector Machine (SVM) is one of the methods reported to give a high accuracy in predictive modeling compared to the other techniques such as Logistic Regression and Discriminant Analysis. The strength of SVM is the robustness of its algorithm and the capability to integrate with kernel-based learning that results in a more flexible analysis and optimized solution. Another popular method to handle imbalanced data is the random sampling method, such as random undersampling, random oversampling and synthetic sampling. The application of the Nearest Neighbours techniques in sampling approach has been seen as having a bigger advantage compared to other methods, as it can handle both structured and non-structured data. There are some studies that implement an ensemble method of both SVM and Nearest Neighbours with good results. This paper discusses the various methods in handling imbalanced data and an illustration of using SVM and k-Nearest Neighbours (k-NN) on a real-data set.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Classification of motor imagery EEG signals using SVM, k-NN and ANN
    Aruna Tyagi
    Vijay Nehra
    CSI Transactions on ICT, 2016, 4 (2-4) : 135 - 139
  • [2] Traffic Sign Detection Based On HOG and PHOG Using Binary SVM And k-NN
    Sugiharto, Aris
    Harjoko, Agus
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, COMPUTER, AND ELECTRICAL ENGINEERING (ICITACEE), 2016, : 317 - 321
  • [3] Partial Discharge Localization through k-NN and SVM
    Sekatane, Permit Mathuhu
    Bokoro, Pitshou
    ENERGIES, 2023, 16 (21)
  • [4] Offline handwritten Gurmukhi character recognition: k-NN vs. SVM classifier
    Garg A.
    Jindal M.K.
    Singh A.
    International Journal of Information Technology, 2021, 13 (6) : 2389 - 2396
  • [5] Fast k-NN classification using the cluster-space approach
    Jia, XP
    Richards, JA
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2005, 2 (02) : 225 - 228
  • [6] Monitoring Baby State While Sleeping Using K-NN and M-SVM Classifiers
    Nosseir, Ann
    El Araby, Omar
    PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON SOFTWARE AND INFORMATION ENGINEERING (ICSIE 2019), 2019, : 263 - 267
  • [7] K-NN: ESTIMATING AN ADEQUATE VALUE FOR PARAMETER K
    Borsato, Bruno
    Plastino, Alexandre
    Merschmann, Luiz
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL AIDSS: ARTIFICIAL INTELLIGENCE AND DECISION SUPPORT SYSTEMS, 2008, : 459 - +
  • [8] Defining the Features of EMG Signals on the Forearm of the Hand Using SVM, RF, k-NN Classification Algorithms
    Turgunov, Adilbek
    Zohirov, Kudratjon
    Ganiyev, Alisher
    Sharopova, Barno
    2020 INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC), 2020, : 260 - 264
  • [9] Optimizing HAR Systems: Comparative Analysis of Enhanced SVM and k-NN Classifiers
    Shdefat, Ahmed Younes
    Mostafa, Nour
    Al-Arnaout, Zakwan
    Kotb, Yehia
    Alabed, Samer
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [10] Robust Classification of Primary Brain Tumor in Computer Tomography Images Using K-NN and Linear SVM
    Sundararaj, G. Kharmega
    Balamurugan, V.
    2014 INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2014, : 1315 - 1319