Generalization Error in Deep Learning

被引:58
作者
Jakubovitz, Daniel [1 ]
Giryes, Raja [1 ]
Rodrigues, Miguel R. D. [2 ]
机构
[1] Tel Aviv Univ, Sch Elect Engn, Tel Aviv, Israel
[2] UCL, Dept Elect & Elect Engn, London, England
来源
COMPRESSED SENSING AND ITS APPLICATIONS | 2019年
关键词
SAMPLE COMPLEXITY; SPARSE;
D O I
10.1007/978-3-319-73074-5_5
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
Deep learning models have lately shown great performance in various fields such as computer vision, speech recognition, speech translation, and natural language processing. However, alongside their state-of-the-art performance, it is still generally unclear what is the source of their generalization ability. Thus, an important question is what makes deep neural networks able to generalize well from the training set to new data. In this chapter, we provide an overview of the existing theory and bounds for the characterization of the generalization error of deep neural networks, combining both classical and more recent theoretical and empirical results.
引用
收藏
页码:153 / 193
页数:41
相关论文
共 66 条
  • [51] Neyshabur B., 2015, C LEARN THEOR, P1376
  • [52] Neyshabur B., 2018, INT C LEARN REPR ICL
  • [53] Neyshabur B, 2017, ADV NEUR IN, V30
  • [54] A Survey on Transfer Learning
    Pan, Sinno Jialin
    Yang, Qiang
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (10) : 1345 - 1359
  • [55] Papyan V, 2017, J MACH LEARN RES, V18, P1
  • [56] Poggio Tomaso, 2017, [International Journal of Automation and Computing, 国际自动化与计算杂志], V14, P503
  • [57] Schmidt Ludwig, 2018, NEURIPS, P5019
  • [58] Convergence radius and sample complexity of ITKM algorithms for dictionary learning
    Schnass, Karin
    [J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2018, 45 (01) : 22 - 58
  • [59] Shalev-Shwartz S., 2014, UNDERSTANDING MACHIN, DOI 10.1017/CBO9781107298019
  • [60] Sokolic J, 2017, PR MACH LEARN RES, V54, P1094