A distance-based kernel for classification via Support Vector Machines

被引:7
作者
Amaya-Tejera, Nazhir [1 ]
Gamarra, Margarita [1 ]
Velez, Jorge I. [2 ]
Zurek, Eduardo [1 ]
机构
[1] Univ Norte, Dept Comp Sci, Barranquilla, Colombia
[2] Univ Norte, Dept Ind Engn, Barranquilla, Colombia
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2024年 / 7卷
关键词
support vector machines (SVMs); classification; distance-based kernel; kernel method; machine learning; supervised learning; MODEL; SETS; SVM;
D O I
10.3389/frai.2024.1287875
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Support Vector Machines (SVMs) are a type of supervised machine learning algorithm widely used for classification tasks. In contrast to traditional methods that split the data into separate training and testing sets, here we propose an innovative approach where subsets of the original data are randomly selected to train the model multiple times. This iterative training process aims to identify a representative data subset, leading to improved inferences about the population. Additionally, we introduce a novel distance-based kernel specifically designed for binary-type features based on a similarity matrix that efficiently handles both binary and multi-class classification problems. Computational experiments on publicly available datasets of varying sizes demonstrate that our proposed method significantly outperforms existing approaches in terms of classification accuracy. Furthermore, the distance-based kernel achieves superior performance compared to other well-known kernels from the literature and those used in previous studies on the same datasets. These findings validate the effectiveness of our proposed classification method and distance-based kernel for SVMs. By leveraging random subset selection and a unique kernel design, we achieve notable improvements in classification accuracy. These results have significant implications for diverse classification problems in Machine Learning and data analysis.
引用
收藏
页数:15
相关论文
共 45 条
  • [1] Alotaibi FS, 2019, INT J ADV COMPUT SC, V10, P261
  • [2] Awad M, 2016, International Journal of Network Security & Its Applications, V8, P17
  • [3] Borg I., 2013, Applied Multidimensional Scaling SpringerBriefs in Statistics, P7
  • [4] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [5] Support vector machine classification for large data sets via minimum enclosing ball clustering
    Cervantes, Jair
    Li, Xiaoou
    Yu, Wen
    Li, Kang
    [J]. NEUROCOMPUTING, 2008, 71 (4-6) : 611 - 619
  • [6] Cherkassky V, 1997, IEEE Trans Neural Netw, V8, P1564, DOI 10.1109/TNN.1997.641482
  • [7] SUPPORT-VECTOR NETWORKS
    CORTES, C
    VAPNIK, V
    [J]. MACHINE LEARNING, 1995, 20 (03) : 273 - 297
  • [8] Deza M. M., 2013, Encyclopedia of Distances, P3
  • [9] Cascades of Evolutionary Support Vector Machines
    Dudzik, Wojciech
    Nalepa, Jakub
    Kawulok, Michal
    [J]. PROCEEDINGS OF THE 2022 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE COMPANION, GECCO 2022, 2022, : 240 - 243
  • [10] Decision boundary clustering for efficient local SVM
    Fayed, Hatem A.
    Atiya, Amir F.
    [J]. APPLIED SOFT COMPUTING, 2021, 110