A Deep Neural Network Based Quasi-Linear Kernel for Support Vector Machines

被引:14
作者
Li, Weite [1 ,2 ]
Zhou, Bo [1 ]
Chen, Benhui [3 ]
Hu, Jinglu [1 ]
机构
[1] Waseda Univ, Grad Sch Informat Prod & Syst, Kitakyushu, Fukuoka 8080135, Japan
[2] Univ Elect Sci & Technol China, Sch Elect Engn, Chengdu 611731, Sichuan, Peoples R China
[3] Dali Univ, Sch Math & Comp Sci, Dali, Yunnan Province, Peoples R China
基金
中国国家自然科学基金;
关键词
deep neural network; support vector machine; data-dependent kernel; multilayer gated bilinear classifier;
D O I
10.1587/transfun.E99.A.2558
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a deep quasi-linear kernel for support vector machines (SVMs). The deep quasi-linear kernel can be constructed by using a pre-trained deep neural network. To realize this goal, a multilayer gated bilinear classifier is first designed to mimic the functionality of the pre-trained deep neural network, by generating the gate control signals using the deep neural network. Then, a deep quasi-linear kernel is derived by applying an SVM formulation to the multilayer gated bilinear classifier. In this way, we are able to further implicitly optimize the parameters of the multilayer gated bilinear classifier, which are a set of duplicate but independent parameters of the pre-trained deep neural network, by using an SVM optimization. Experimental results on different data sets show that SVMs with the proposed deep quasi-linear kernel have an ability to take advantage of the pre-trained deep neural networks and outperform SVMs with RBF kernels.
引用
收藏
页码:2558 / 2565
页数:8
相关论文
共 35 条
  • [21] Krizhevsky A., 2017, COMMUN ACM, V60, P84, DOI DOI 10.1145/3065386
  • [22] Gradient-based learning applied to document recognition
    Lecun, Y
    Bottou, L
    Bengio, Y
    Haffner, P
    [J]. PROCEEDINGS OF THE IEEE, 1998, 86 (11) : 2278 - 2324
  • [23] Li W., 2015, PROC 2015 INT JOINT, P1
  • [24] Lin M, 2014, PUBLIC HEALTH NUTR, V17, P2029, DOI [10.1017/S1368980013002176, 10.1109/PLASMA.2013.6634954]
  • [25] Makhzani Alireza., 2015, Nips, P2773
  • [26] Mikolov T, 2011, 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, P612
  • [27] Montúfar G, 2014, ADV NEUR IN, V27
  • [28] Rasmus A, 2015, ADV NEUR IN, V28
  • [29] CNN Features off-the-shelf: an Astounding Baseline for Recognition
    Razavian, Ali Sharif
    Azizpour, Hossein
    Sullivan, Josephine
    Carlsson, Stefan
    [J]. 2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, : 512 - 519
  • [30] Srivastava N, 2014, J MACH LEARN RES, V15, P1929