Semantic embedding: scene image classification using scene-specific objects

被引:2
作者
Parseh, Mohammad Javad [1 ]
Rahmanimanesh, Mohammad [1 ]
Keshavarzi, Parviz [1 ]
Azimifar, Zohreh [2 ]
机构
[1] Semnan Univ, Dept Elect & Comp Engn, Semnan, Iran
[2] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
Scene classification; Semantic embedding; Scene-specific objects; RECOGNITION; MODEL; REPRESENTATION; FEATURES; CATEGORIZATION; NETWORKS; BANK;
D O I
10.1007/s00530-022-01010-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual scene understanding is a hot and challenging topic in image processing that aims to understand the general (global) concept of a scene image. In this paper, we propose a novel image embedding algorithm using a learned embedded space, which introduces a high-level semantic representation of the scene images. The learned embedded space as a suitable semantic framework for visual concepts can be used in most applications such as image captioning, Visual Question Answering (VQA), and scene recognition or classification. Inspired by the human inference mechanism in visual scene understanding, the proposed method learns a semantic embedded space of visual concepts using prior semantic knowledge. Prior knowledge is extracted from ConceptNet as one of the most comprehensive knowledge graphs in the form of semantic vectors and is transformed to the learned embedded space with a transformation function. The transformation function is learned by solving a minimization problem. To evaluate our proposed approach, we introduce a scene image dataset called "Scene23", which is based on the VisualGenome dataset. A non-linear SVM classifier is utilized to classify the representations of images to the scene categories. The experimental results show 99.44% classification accuracy on the "Scene23" dataset. Also, we evaluated our proposed method by the "UIUC Sports" and "MIT67" datasets. Experimental results indicate that our proposed method outperforms other state-of-the-art methods on the "UIUC Sports" dataset and achieves competitive results on the "MIT67" dataset.
引用
收藏
页码:669 / 691
页数:23
相关论文
共 50 条
[41]   Robust Learning of Mislabeled Training Samples for Remote Sensing Image Scene Classification [J].
Tu, Bing ;
Kuang, Wenlan ;
He, Wangquan ;
Zhang, Guoyun ;
Peng, Yishu .
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2020, 13 :5623-5639
[42]   An Attribute-Based High-Level Image Representation for Scene Classification [J].
Liu, Wenhua ;
Li, Yidong ;
Wu, Qi .
IEEE ACCESS, 2019, 7 :4629-4640
[43]   An Approach for Construct Semantic Map with Scene Classification and Object Semantic Segmentation [J].
Wang, Peng ;
Cheng, Jun ;
Feng, Wei .
PROCEEDINGS OF 2018 IEEE INTERNATIONAL CONFERENCE ON REAL-TIME COMPUTING AND ROBOTICS (IEEE RCAR), 2018, :270-275
[44]   Remote sensing image scene classification using CNN-MLP with data augmentation [J].
Shawky, Osama A. ;
Hagag, Ahmed ;
El-Dahshan, El-Sayed A. ;
Ismail, Manal A. .
OPTIK, 2020, 221
[45]   A Multi-Level Convolution Pyramid Semantic Fusion Framework for High-Resolution Remote Sensing Image Scene Classification and Annotation [J].
Sun, Xiongli ;
Zhu, Qiqi ;
Qin, Qianqing .
IEEE ACCESS, 2021, 9 (09) :18195-18208
[46]   Scene Image Classification using a Wigner-Based Local Binary Patterns Descriptor [J].
Sinha, Atreyee ;
Banerji, Sugata ;
Liu, Chengjun .
PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, :1614-1621
[47]   A novel topic feature for image scene classification [J].
Zang, Mujun ;
Wen, Dunwei ;
Wang, Ke ;
Liu, Tong ;
Song, Weiwei .
NEUROCOMPUTING, 2015, 148 :467-476
[48]   Scene Level Image Classification: A Literature Review [J].
Sagar Chavda ;
Mahesh Goyani .
Neural Processing Letters, 2023, 55 :2471-2520
[49]   Scene classification with respect to image quality measurements [J].
Oh, Kyung Hoon ;
Triantaphillidou, Sophie ;
Jacobson, Ralph E. .
IMAGE QUALITY AND SYSTEM PERFORMANCE VII, 2010, 7529
[50]   Novel Color HWML Descriptors for Scene and Object Image Classification [J].
Banerji, Sugata ;
Sinha, Atreyee ;
Liu, Chengjun .
2012 3RD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING THEORY, TOOLS AND APPLICATIONS, 2012, :330-335