A Kernel Perspective for the Decision Boundary of Deep Neural Networks

被引:1
作者
Zhang, Yifan [1 ]
Liao, Shizhong [1 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300350, Peoples R China
来源
2020 IEEE 32ND INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI) | 2020年
基金
中国国家自然科学基金;
关键词
deep neural network; kernel method; generalization ability; gradient descent; decision boundary;
D O I
10.1109/ICTAI50040.2020.00105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep learning has achieved great success in many fields, but they still lack theoretical understandings. Although some recent theoretical and experimental results have investigated the representation power of deep learning, little effort has been devoted to analyzing the generalization ability of deep learning. In this paper, we analyze deep neural networks from a kernel perspective and use kernel methods to investigate the effect of the implicit regularization introduced by gradient descent on the generalization ability. Firstly, we argue that the multi-layer nonlinear feature transformation in deep neural networks is equivalent to a kernel feature mapping and analyze our point from the perspective of the unique mathematical advantages of kernel methods and the method of constructing multi-layer kernel machines, respectively. Secondly, using the representer theorem, we analyze the decision boundary of deep neural networks and prove that the last hidden layers of deep neural networks converge to nonlinear SVMs. Systematical experiments demonstrate that the decision boundaries of neural networks converge to those of nonlinear SVMs.
引用
收藏
页码:653 / 660
页数:8
相关论文
共 25 条
  • [1] Anthony Martin, 2002, Neural Network Learning-Theoretical Foundations
  • [2] THEORY OF REPRODUCING KERNELS
    ARONSZAJN, N
    [J]. TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1950, 68 (MAY) : 337 - 404
  • [3] Bartlett P. L., 2003, Journal of Machine Learning Research, V3, P463, DOI 10.1162/153244303321897690
  • [4] Belkin M., 2018, PR MACH LEARN RES, P541
  • [5] Bohn B, 2019, J MACH LEARN RES, V20
  • [6] Cho Y., 2009, Advances in Neural Information Processing Systems, P342
  • [7] Gao X., 2019, ARXIV180805385V3
  • [8] A fast learning algorithm for deep belief nets
    Hinton, Geoffrey E.
    Osindero, Simon
    Teh, Yee-Whye
    [J]. NEURAL COMPUTATION, 2006, 18 (07) : 1527 - 1554
  • [9] Jaakkola T. S., ARXIV150805133V2
  • [10] Koppel A, 2019, J MACH LEARN RES, V20