Novel convolutional neural networks for efficient classification of rotated and scaled images

被引:2
作者
Tarasiuk, Pawel [1 ]
Szczepaniak, Piotr S. [1 ]
机构
[1] Lodz Univ Technol, Inst Informat Technol, Ul Wolczanska 215, PL-90924 Lodz, Poland
关键词
Deep learning; Convolutional neural networks; Invariance to rotation and scale; Efficient deep learning applications; Optimization of deep learning architectures;
D O I
10.1007/s00521-021-06645-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a novel method for improving the invariance of convolutional neural networks (CNNs) to selected geometric transformations in order to obtain more efficient image classifiers. A common strategy employed to achieve this aim is to train the network using data augmentation. Such a method alone, however, increases the complexity of the neural network model, as any change in the rotation or size of the input image results in the activation of different CNN feature maps. This problem can be resolved by the proposed novel convolutional neural network models with geometric transformations embedded into the network architecture. The evaluation of the proposed CNN model is performed on the image classification task with the use of diverse representative data sets. The CNN models with embedded geometric transformations are compared to those without the transformations, using different data augmentation setups. As the compared approaches use the same amount of memory to store the parameters, the improved classification score means that the proposed architecture is more optimal.
引用
收藏
页码:10519 / 10532
页数:14
相关论文
共 37 条
[1]  
Azulay A, 2019, J MACH LEARN RES, V20
[2]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[3]   Learning Rotation-Invariant Convolutional Neural Networks for Object Detection in VHR Optical Remote Sensing Images [J].
Cheng, Gong ;
Zhou, Peicheng ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (12) :7405-7415
[4]  
Chicco D, 2021, METHODS MOL BIOL, V2190, P73, DOI 10.1007/978-1-0716-0826-5_3
[5]  
Ciresan D., 2011, IJCAI P INT JOINT C, P1237, DOI DOI 10.5591/978-1-57735-516-8/IJCAI11-210
[6]  
Codella NCF, 2018, I S BIOMED IMAGING, P168, DOI 10.1109/ISBI.2018.8363547
[7]  
Conneau A, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1107
[8]   Rotation-invariant convolutional neural networks for galaxy morphology prediction [J].
Dieleman, Sander ;
Willett, Kyle W. ;
Dambre, Joni .
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2015, 450 (02) :1441-1459
[9]   Predicting Depth, Surface Normals and Semantic Labels with a Common Multi-Scale Convolutional Architecture [J].
Eigen, David ;
Fergus, Rob .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :2650-2658
[10]   CcNet: A cross-connected convolutional network for segmenting retinal vessels using multi-scale features [J].
Feng, Shouting ;
Zhuo, Zhongshuo ;
Pan, Daru ;
Tian, Qi .
NEUROCOMPUTING, 2020, 392 :268-276