Semantic embedding: scene image classification using scene-specific objects

被引：2

作者：

Parseh, Mohammad Javad ^{[1
]}

Rahmanimanesh, Mohammad ^{[1
]}

Keshavarzi, Parviz ^{[1
]}

Azimifar, Zohreh ^{[2
]}

机构：

[1] Semnan Univ, Dept Elect & Comp Engn, Semnan, Iran

[2] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran

来源：

MULTIMEDIA SYSTEMS | 2023年 / 29卷 / 02期

关键词：

Scene classification; Semantic embedding; Scene-specific objects; RECOGNITION; MODEL; REPRESENTATION; FEATURES; CATEGORIZATION; NETWORKS; BANK;

D O I：

10.1007/s00530-022-01010-9

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Visual scene understanding is a hot and challenging topic in image processing that aims to understand the general (global) concept of a scene image. In this paper, we propose a novel image embedding algorithm using a learned embedded space, which introduces a high-level semantic representation of the scene images. The learned embedded space as a suitable semantic framework for visual concepts can be used in most applications such as image captioning, Visual Question Answering (VQA), and scene recognition or classification. Inspired by the human inference mechanism in visual scene understanding, the proposed method learns a semantic embedded space of visual concepts using prior semantic knowledge. Prior knowledge is extracted from ConceptNet as one of the most comprehensive knowledge graphs in the form of semantic vectors and is transformed to the learned embedded space with a transformation function. The transformation function is learned by solving a minimization problem. To evaluate our proposed approach, we introduce a scene image dataset called "Scene23", which is based on the VisualGenome dataset. A non-linear SVM classifier is utilized to classify the representations of images to the scene categories. The experimental results show 99.44% classification accuracy on the "Scene23" dataset. Also, we evaluated our proposed method by the "UIUC Sports" and "MIT67" datasets. Experimental results indicate that our proposed method outperforms other state-of-the-art methods on the "UIUC Sports" dataset and achieves competitive results on the "MIT67" dataset.

引用

页码：669 / 691

页数：23

共 50 条

[11] Knowledge Transfer for Scene-Specific Motion Prediction
Ballan, Lamberto
Castaldo, Francesco
Alahi, Alexandre
Palmieri, Francesco
Savarese, Silvio
COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 697 - 713
[12] Knowledge transfer for scene-specific motion prediction
Ballan, Lamberto
Castaldo, Francesco
Alahi, Alexandre
Palmieri, Francesco
Savarese, Silvio
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, 9905 LNCS : 697 - 713
[13] Scene-specific crowd counting using synthetic training images
Delussu, Rita
Putzu, Lorenzo
Fumera, Giorgio
PATTERN RECOGNITION, 2022, 124
[14] SCENE-SPECIFIC MEMORY FOR OBJECTS - A MODEL OF EPISODIC MEMORY IMPAIRMENT IN MONKEYS WITH FORNIX TRANSECTION
GAFFAN, D
JOURNAL OF COGNITIVE NEUROSCIENCE, 1994, 6 (04) : 305 - 320
[15] Scene-Specific Multiple Cues Integration for Multiperson Tracking
Dong, Yanmei
Pei, Mingtao
Liu, Xiaofeng
Zhao, Meng
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (03) : 511 - 518
[16] Semantic Image Manipulation Using Scene Graphs
Dhamo, Helisa
Farshad, Azade
Laina, Iro
Navab, Nassir
Hager, Gregory D.
Tombari, Federico
Rupprecht, Christian
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5212 - 5221
[17] Predicting Image Aesthetics using Objects in the Scene
Roy, Hiya
Yamasaki, Toshihiko
Hashimoto, Tatsuaki
PROCEEDINGS OF THE 2018 INTERNATIONAL JOINT WORKSHOP ON MULTIMEDIA ARTWORKS ANALYSIS AND ATTRACTIVENESS COMPUTING IN MULTIMEDIA (MMART&ACM'18), 2018, : 14 - 19
[18] Deep Learning of Scene-Specific Classifier for Pedestrian Detection
Zeng, Xingyu
Ouyang, Wanli
Wang, Meng
Wang, Xiaogang
COMPUTER VISION - ECCV 2014, PT III, 2014, 8691 : 472 - 487
[19] A parallel vision approach to scene-specific pedestrian detection
Zhang, Wenwen
Wang, Kunfeng
Liu, Yating
Lu, Yue
Wang, Fei-Yue
NEUROCOMPUTING, 2020, 394 (394) : 114 - 126
[20] Deep Background Subtraction with Scene-Specific Convolutional Neural Networks
Braham, Marc
Van Droogenbroeck, Marc
PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, (IWSSIP 2016), 2016, : 113 - 116

← 1 2 3 4 5 →