On the rate of convergence of image classifiers based on convolutional neural networks

被引：9

作者：

Kohler, Michael ^{[1
]}

Krzyzak, Adam ^{[2
]}

Walter, Benjamin ^{[1
]}

机构：

[1] Tech Univ Darmstadt, Fachbereich Math, Schlossgartenstr 7, D-64289 Darmstadt, Germany

[2] Concordia Univ, Dept Comp Sci & Software Engn, 1455 Maisonneuve Blvd West, Montreal, PQ H3G 1M8, Canada

来源：

ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS | 2022年 / 74卷 / 06期

基金：

加拿大自然科学与工程研究理事会;

关键词：

Curse of dimensionality; Convolutional neural networks; Image classification; Rate of convergence;

D O I：

10.1007/s10463-022-00828-4

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Image classifiers based on convolutional neural networks are defined, and the rate of convergence of the misclassification risk of the estimates towards the optimal misclassification risk is analyzed. Under suitable assumptions on the smoothness and structure of a posteriori probability, the rate of convergence is shown which is independent of the dimension of the image. This proves that in image classification, it is possible to circumvent the curse of dimensionality by convolutional neural networks. Furthermore, the obtained result gives an indication why convolutional neural networks are able to outperform the standard feedforward neural networks in image classification. Our classifiers are compared with various other classification methods using simulated data. Furthermore, the performance of our estimates is also tested on real images.

引用

页码：1085 / 1108

页数：24

共 35 条

[1]

[Anonymous], 1993, Lecture Notes in Statistics

[2]

Bartlett PL, 2019, J MACH LEARN RES, V20, P1

[3] ON DEEP LEARNING AS A REMEDY FOR THE CURSE OF DIMENSIONALITY IN NONPARAMETRIC REGRESSION [J].

Bauer, Benedikt ;

Kohler, Michael .

ANNALS OF STATISTICS, 2019, 47 (04) :2261-2285

[4] MAXIMUM LIKELIHOOD FEATURES FOR GENERATIVE IMAGE MODELS [J].

Chang, Lo-Bin ;

Borenstein, Eran ;

Zhang, Wei ;

Geman, Stuart .

ANNALS OF APPLIED STATISTICS, 2017, 11 (03) :1275-1308

[5]

Cover T. M., 1968, P HAW INT C SYST SCI, P413

[6] NECESSARY AND SUFFICIENT CONDITIONS FOR THE POINTWISE CONVERGENCE OF NEAREST NEIGHBOR REGRESSION FUNCTION ESTIMATES [J].

DEVROYE, L .

ZEITSCHRIFT FUR WAHRSCHEINLICHKEITSTHEORIE UND VERWANDTE GEBIETE, 1982, 61 (04) :467-481

[7]

Devroye L., 1996, A Probablistic Theory of Pattern Recognition, V31, DOI DOI 10.1007/978-1-4612-0711-5

[8]

Du, 2018, ARXIV 181103804

[9] A comparison of deep networks with ReLU activation function and linear spline-type methods [J].

Eckle, Konstantin ;

Schmidt-Hieber, Johannes .

NEURAL NETWORKS, 2019, 110 :232-242

[10]

Glorot X., 2010, P 13 INT C ART INT S, P249

← 1 2 3 4 →