BAG OF GROUPS OF CONVOLUTIONAL FEATURES MODEL FOR VISUAL OBJECT RECOGNITION

被引：0

作者：

Singh, Jaspreet ^{[1
]}

Singh, Chandan ^{[1
]}

机构：

[1] Punjabi Univ, Dept Comp Sci, Patiala 147002, Punjab, India

来源：

2021 IEEE 31ST INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2021年

关键词：

Rotation; equivariance; invariance; classification; MOMENTS; SCALE;

D O I：

10.1109/MLSP52302.2021.9596432

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep convolutional neural networks (CNNs) are only equivariant to translation. Recently, equivariant CNNs are proposed for the task of image classification which are not only equivariant to translation but also to other affine geometric transformations. Moreover, CNNs and equivariant CNNs require a large amount of labeled training data to generalize its parameters which also limit their application areas. We propose a bag of groups of convolutional features (BoGCFs) model for the CNNs and group-equivariant CNNs (G-CNNs)[1], which preserves the fundamental property of equivariance of G-CNNs and generate the global invariant features by dividing the convolutional feature maps of the deeper layers of the network into groups. The proposed model for CNNs and G-CNNs, referred as CNN-BoGCFs and G-CNN-BoGCFs, performs significantly high when trained on a small amount of labeled data for image classification. The proposed method is evaluated using rotated MNIST, SIMPLIcity and Oxford flower 17 datasets.

引用

页数：6

共 22 条

[1]

[Anonymous], 2008, COMPUT VIS IMAGE UND

[2] Aggregating Deep Convolutional Features for Image Retrieval [J].

Babenko, Artem ;

Lempitsky, Victor .

2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1269-1277

[3] Roto-Translation Covariant Convolutional Networks for Medical Image Analysis [J].

Bekkers, Erik J. ;

Lafarge, Maxime W. ;

Veta, Mitko ;

Eppenhof, Koen A. J. ;

Pluim, Josien P. W. ;

Duits, Remco .

MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2018, PT I, 2018, 11070 :440-448

[4]

Bekkers Erik J, 2019, ARXIV PREPRINT ARXIV

[5]

Chen W.-Y., 2019, The Fractional Laplacian

[6]

Cohen TS, 2016, PR MACH LEARN RES, V48

[7]

Csurka G., 2004, P WORKSH STAT LEARN, V1, P1

[8] Distinctive image features from scale-invariant keypoints [J].

Lowe, DG .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) :91-110

[9]

Nilsback M.-E., 2006, IEEE C COMP VIS PATT, V2, P1447

[10] Multiresolution gray-scale and rotation invariant texture classification with local binary patterns [J].

Ojala, T ;

Pietikäinen, M ;

Mäenpää, T .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (07) :971-987

← 1 2 3 →