HOW CONVOLUTIONAL NEURAL NETWORKS DEAL WITH ALIASING

被引：13

作者：

Ribeiro, Antonio H. ^{[1
,2
]}

Schon, Thomas B. ^{[1
]}

机构：

[1] Uppsala Univ, Dept Informat Technol, Uppsala, Sweden

[2] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil

来源：

2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021) | 2021年

关键词：

Convolutional neural networks; aliasing; deep learning; downsampling; image classification;

D O I：

10.1109/ICASSP39728.2021.9414627

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

The convolutional neural network (CNN) remains an essential tool in solving computer vision problems. Standard convolutional architectures consist of stacked layers of operations that progressively downscale the image. Aliasing is a well-known side-effect of downsampling that may take place: it causes high-frequency components of the original signal to become indistinguishable from its low-frequency components. While downsampling takes place in the max-pooling layers or in the strided-convolutions in these models, there is no explicit mechanism that prevents aliasing from taking place in these layers. Due to the impressive performance of these models, it is natural to suspect that they, somehow, implicitly deal with this distortion. The question we aim to answer in this paper is simply: "how and to what extent do CNNs counteract aliasing?" We explore the question by means of two examples: In the first, we assess the CNNs capability of distinguishing oscillations at the input, showing that the redundancies in the intermediate channels play an important role in succeeding at the task; In the second, we show that an image classifier CNN while, in principle, capable of implementing anti-aliasing filters, does not prevent aliasing from taking place in the intermediate layers.

引用

页码：2755 / 2759

页数：5

共 17 条

[1] Azulay A, 2019, J MACH LEARN RES, V20
[2] Decoding by linear programming
Candes, EJ
Tao, T
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2005, 51 (12) : 4203 - 4215
[3] Compressed sensing
Donoho, DL
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (04) : 1289 - 1306
[4] Impact of Aliasing on Deep CNN-Based End-to-End Acoustic Models
Gong, Yuan
Poellabauer, Christian
[J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 2698 - 2702
[5] He Kaiming, 2015, C COMP VIS PATT REC
[6] Henaff O. J., 2016, ICLR
[7] Kingma DP, 2014, ADV NEUR IN, V27
[8] ImageNet Classification with Deep Convolutional Neural Networks
Krizhevsky, Alex
Sutskever, Ilya
Hinton, Geoffrey E.
[J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90
[9] Le Cun Y., 1990, Proceedings. 10th International Conference on Pattern Recognition (Cat. No.90CH2898-5), P35, DOI 10.1109/ICPR.1990.119325
[10] Madry A., 2018, Towards deep learning models resistant to adversarial attacks

← 1 2 →