Scene Level Image Classification: A Literature Review

被引：0

作者：

Sagar Chavda

Mahesh Goyani

机构：

[1] Gujarat Technological University,

[2] Government Engineering College,undefined

来源：

Neural Processing Letters | 2023年 / 55卷

关键词：

Scene classification; Remote sensing; Multi-label; Low–mid–high level research practices; CNNs; Attentions and ViTs; CapsNet and GANs; Losses; activations; optimization and regularization;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Convolutional neural networks (CNNs) have made significant contributions to natural and remote sensing imaging since the development of deep learning. Scene-level image classification is a challenge that affects both the natural and remote sensing domains and has numerous applications. The number of possible scene entities in the image content that could match the dataset images is the main focus. Scene-level classification is significant and fascinating because of open problems like intraclass heterogeneity, interclass homogeneity, background cluttering, high spatial resolution, and different imaging conditions. Additionally, the multi-label scene dataset’s imbalance, lack of preservation of complex semantic relations, and higher label-to-label correlation are all apparent. The article discusses a meta-analysis of the state-of-the-art scene classification literature practices. We discuss CNNs, attention mechanisms, capsule networks, and generative adversarial networks. The article also delivers an overview of the various activations, losses, optimization techniques, and regularization schemes pertinent to the scene domain. The standard benchmark datasets based on single- and multi-label themes are collated. The performance metrics for scene classification are explained as well. The implementation of the multi-label scene classification utilizing several CNN models on the UC Merced multi-label dataset is also covered in the paper. The proposed MobileNet-based model performs better than the recognized cutting-edge methodologies.

引用

页码：2471 / 2520

页数：49

共 293 条

[1]

Aksoy S(2005)Learning Bayesian classifiers for scene classification with a visual grammar IEEE Trans Geosci Remote Sens 43 581-589

[2]

Koperski K(2020)BoVSG: bag of visual SubGraphs for remote sensing scene classification Int J Remote Sens 41 1986-2003

[3]

Tusk C(2021)A survey on modern trainable activation functions Neural Netw 138 14-32

[4]

Marchisio G(2020)Impact of fully connected layers on performance of convolutional neural networks for image classification Neurocomputing 378 112-119

[5]

Tilton JC(2021)Uav image multi-labeling with data-efficient transformers Appl Sci 11 3974-20

[6]

Amiri K(2021)Vision transformers for remote sensing image classification Remote Sens 13 516:1-359

[7]

Farah M(2020)RADC-Net: a residual attention based convolution network for aerial scene classification Neurocomputing 377 345-17

[8]

Leloglu UM(2001)What’s wrong with pixels? Some recent developments interfacing remote sensing and GIS Z Geoinformationssyst 14 12-727

[9]

Apicella A(2008)Scene classification using a hybrid generative/discriminative approach IEEE Trans Pattern Anal Mach Intell 30 712-311

[10]

Donnarumma F(2018)Optimization methods for large-scale machine learning SIAM Rev 60 223-630

← 1 2 3 4 5 6 7 8 9 10 →