Model complexity of deep learning: a survey

被引:238
作者
Hu, Xia [1 ]
Chu, Lingyang [2 ]
Pei, Jian [1 ]
Liu, Weiqing [3 ]
Bian, Jiang [3 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC, Canada
[2] McMaster Univ, Dept Comp & Software, Hamilton, ON, Canada
[3] Microsoft Res, Beijing, Peoples R China
基金
加拿大自然科学与工程研究理事会;
关键词
Deep learning; Deep neural network; Model complexity; Expressive capacity; DECISION TREE COMPLEXITY; NEURAL-NETWORKS; VC-DIMENSION; BOUNDS; SELECTION; ACCURACY; TIME;
D O I
10.1007/s10115-021-01605-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Model complexity is a fundamental problem in deep learning. In this paper, we conduct a systematic overview of the latest studies on model complexity in deep learning. Model complexity of deep learning can be categorized into expressive capacity and effective model complexity. We review the existing studies on those two categories along four important factors, including model framework, model size, optimization process, and data complexity. We also discuss the applications of deep learning model complexity including understanding model generalization, model optimization, and model selection and design. We conclude by proposing several interesting future directions.
引用
收藏
页码:2585 / 2619
页数:35
相关论文
共 100 条
[1]  
Adams R A., 2003, Sobolev Spaces, V2nd edn
[2]  
Allen-Zhu Z, 2019, ADV NEUR IN, V32
[3]  
[Anonymous], 2012, LECT POLYTOPES
[4]  
Arora R., 2018, INT C LEARN REPR
[5]  
Arora S, 2009, COMPUTATIONAL COMPLEXITY: A MODERN APPROACH, P1, DOI 10.1017/CBO9780511804090
[6]  
Ba LJ, 2014, ADV NEUR IN, V27
[7]   Statistical inference, Occam's razor, and statistical mechanics on the space of probability distributions [J].
Balasubramanian, V .
NEURAL COMPUTATION, 1997, 9 (02) :349-368
[8]   UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[9]  
Bartlett P. L., 2003, Journal of Machine Learning Research, V3, P463, DOI 10.1162/153244303321897690
[10]  
Bartlett PL, 2019, J MACH LEARN RES, V20, P1