Impact of fully connected layers on performance of convolutional neural networks for image classification

被引：271

作者：

Basha, S. H. Shabbeer ^{[1
]}

Dubey, Shiv Ram ^{[1
]}

Pulabaigari, Viswanath ^{[1
]}

Mukherjee, Snehasis ^{[1
]}

机构：

[1] Indian Inst Informat Technol Sri City, Sri City 517646, Andhra Pradesh, India

来源：

NEUROCOMPUTING | 2020年 / 378卷

关键词：

Convolutional neural networks; Fully connected layers; Image classification; Shallow vs deep CNNs; Wider vs deeper datasets;

D O I：

10.1016/j.neucom.2019.10.008

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The Convolutional Neural Networks (CNNs), in domains like computer vision, mostly reduced the need for handcrafted features due to its ability to learn the problem-specific features from the raw input data. However, the selection of dataset-specific CNN architecture, which mostly performed by either experience or expertise is a time-consuming and error-prone process. To automate the process of learning a CNN architecture, this paper attempts at finding the relationship between Fully Connected (FC) layers with some of the characteristics of the datasets. The CNN architectures, and recently datasets also, are categorized as deep, shallow, wide, etc. This paper tries to formalize these terms along with answering the following questions. (i) What is the impact of deeper/shallow architectures on the performance of the CNN w.r.t. FC layers?, (ii) How the deeper/wider datasets influence the performance of CNN w.r.t. FC layers?, and (iii) Which kind of architecture (deeper/shallower) is better suitable for which kind of (deeper/wider) datasets. To address these findings, we have performed experiments with four CNN architectures having different depths. The experiments are conducted by varying the number of FC layers. We used four widely used datasets including CIFAR-10, CIFAR-100, Tiny ImageNet, and CRCHistoPhenotypes to justify our findings in the context of image classification problem. (C) 2019 Elsevier B.V. All rights reserved.

引用

页码：112 / 119

页数：8

共 27 条

[1]

[Anonymous], 2016, P BRIT MACH VIS C

[2]

[Anonymous], TINY IMAGENET CHALLE

[3]

[Anonymous], 2016, ARXIV PREPRINT ARXIV

[4]

[Anonymous], 2017, ADV NEURAL INFORM PR

[5]

Ba LJ, 2014, ADV NEUR IN, V27

[6] The Do's and Don'ts for CNN-based Face Verification [J].

Bansal, Ankan ;

Castillo, Carlos ;

Ranjan, Rajeev ;

Chellappa, Rama .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2545-2554

[7]

Basha SHS, 2018, I C CONT AUTOMAT ROB, P1222, DOI 10.1109/ICARCV.2018.8581147

[8]

Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274

[9]

Delalleau O., 2011, Advances in Neural Information Processing Systems, V24, P666

[10]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 →