Rule extraction from support vector machines A review

被引:133
作者
Barakat, Nahla [1 ]
Bradley, Andrew P. [2 ]
机构
[1] German Univ Technol Oman, Dept Appl Informat Technol, Muscat, Oman
[2] Univ Queensland, Sch Informat Technol & Elect Engn ITEE, St Lucia, Qld 4072, Australia
关键词
Machine learning; Data mining; Knowledge discovery; Information extraction; Pattern recognition applications; SVMs; NEURAL-NETWORKS; CLASSIFICATION; PREDICTION; AREA;
D O I
10.1016/j.neucom.2010.02.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last decade support vector machine classifiers (SVMs) have demonstrated superior generalization performance to many other classification techniques in a variety of application areas However SVMs have an inability to provide an explanation or comprehensible justification for the solutions they reach It has been shown that the black-box nature of techniques like artificial neural networks (ANNs) is one of the main obstacles impeding their practical application Therefore techniques for rule extraction from ANNs and recently from SVMs were introduced to ameliorate this problem and aid in the explanation of their classification decisions In this paper we conduct a formal review of the area of rule extraction from SVMs The review provides a historical perspective for this area of research and conceptually groups and analyzes the various techniques In particular we propose two alternative groupings the first is based on the SVM (model) components utilized for rule extraction while the second is based on the rule extraction approach The aim is to provide a better understanding of the topic in addition to summarizing the main features of individual algorithms The analysis is then followed by a comparative evaluation of the algorithms salient features and relative performance as measured by a number of metrics It is concluded that there is no one algorithm that can be favored in general However methods that are kernel independent produce the most comprehensible rule set and have the highest fidelity to the SVM should be preferred In addition a specific method can be preferred if the context of the requirements of a specific application so that appropriate tradeoffs may be made The paper concludes by highlighting potential research directions such as the need for rule extraction methods in the case of SVM incremental and active learning and other application domains where special types of SVMs are utilized (C) 2010 Elsevier B V All rights reserved
引用
收藏
页码:178 / 190
页数:13
相关论文
共 68 条
  • [1] A.Moschitti, 2006, JOINT C P 10 C COMP
  • [2] Survey and critique of techniques for extracting rules from trained artificial neural networks
    Andrews, R
    Diederich, J
    Tickle, AB
    [J]. KNOWLEDGE-BASED SYSTEMS, 1995, 8 (06) : 373 - 389
  • [3] [Anonymous], 2000, ICML, DOI DOI 10.1007/978-3-540-44871-6_130
  • [4] [Anonymous], P INT C MACH LEARN I
  • [5] [Anonymous], 2002, Learning with kernels: Support vector machines, regularization, optimization, and beyond
  • [6] [Anonymous], 2004, P 13 ACM INT C INF K
  • [7] [Anonymous], P INT C MACH LEARN I
  • [8] [Anonymous], 1984, OLSHEN STONE CLASSIF, DOI 10.2307/2530946
  • [9] [Anonymous], 1997, MACHINE LEARNING, MCGRAW-HILL SCIENCE/ENGINEERING/MATH
  • [10] [Anonymous], 1974, J PHILOS