Semantic embedding: scene image classification using scene-specific objects

被引:2
|
作者
Parseh, Mohammad Javad [1 ]
Rahmanimanesh, Mohammad [1 ]
Keshavarzi, Parviz [1 ]
Azimifar, Zohreh [2 ]
机构
[1] Semnan Univ, Dept Elect & Comp Engn, Semnan, Iran
[2] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
Scene classification; Semantic embedding; Scene-specific objects; RECOGNITION; MODEL; REPRESENTATION; FEATURES; CATEGORIZATION; NETWORKS; BANK;
D O I
10.1007/s00530-022-01010-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual scene understanding is a hot and challenging topic in image processing that aims to understand the general (global) concept of a scene image. In this paper, we propose a novel image embedding algorithm using a learned embedded space, which introduces a high-level semantic representation of the scene images. The learned embedded space as a suitable semantic framework for visual concepts can be used in most applications such as image captioning, Visual Question Answering (VQA), and scene recognition or classification. Inspired by the human inference mechanism in visual scene understanding, the proposed method learns a semantic embedded space of visual concepts using prior semantic knowledge. Prior knowledge is extracted from ConceptNet as one of the most comprehensive knowledge graphs in the form of semantic vectors and is transformed to the learned embedded space with a transformation function. The transformation function is learned by solving a minimization problem. To evaluate our proposed approach, we introduce a scene image dataset called "Scene23", which is based on the VisualGenome dataset. A non-linear SVM classifier is utilized to classify the representations of images to the scene categories. The experimental results show 99.44% classification accuracy on the "Scene23" dataset. Also, we evaluated our proposed method by the "UIUC Sports" and "MIT67" datasets. Experimental results indicate that our proposed method outperforms other state-of-the-art methods on the "UIUC Sports" dataset and achieves competitive results on the "MIT67" dataset.
引用
收藏
页码:669 / 691
页数:23
相关论文
共 50 条
  • [1] Semantic embedding: scene image classification using scene-specific objects
    Mohammad Javad Parseh
    Mohammad Rahmanimanesh
    Parviz Keshavarzi
    Zohreh Azimifar
    Multimedia Systems, 2023, 29 : 669 - 691
  • [2] Underwater Image Enhancement using Scene-Specific Red Channel Prior and Fusion
    Sivaanpu, Anparasy
    Thanikasalam, Kokul
    2021 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER TECHNOLOGIES AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2021, : 580 - 586
  • [3] Research on the Classification of Image Semantic Scene
    Zhang Fang
    Guo Huiling
    Jia Lingshan
    PROCEEDINGS OF THE 2015 INTERNATIONAL INDUSTRIAL INFORMATICS AND COMPUTER ENGINEERING CONFERENCE, 2015, : 162 - 165
  • [4] Training a Scene-Specific Pedestrian Detector Using Tracklets
    Mao, Yunxiang
    Yin, Zhaozheng
    2015 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2015, : 170 - 176
  • [5] Scene Classification Algorithm Based on Semantic Segmented Objects
    Yeo, Woon-Ha
    Heo, Young-Jin
    Choi, Young-Ju
    Park, Seo-Jeon
    Kim, Byung-Gyu
    2021 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2021,
  • [6] Semantic Scene Classification for Image Annotation and Retrieval
    Cavus, Oezge
    Aksoy, Selim
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, 2008, 5342 : 402 - 410
  • [7] Semantic-based Scene Image Classification
    Wang, Xiaoru
    Du, Junping
    Liu, Jie
    PROCEEDINGS OF 2011 INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENCE AND AWARENESS INTERNET, IET AIAI2011, 2011, : 150 - 153
  • [8] Scene-specific crowd counting using synthetic training images
    Delussu, Rita
    Putzu, Lorenzo
    Fumera, Giorgio
    Pattern Recognition, 2022, 124
  • [9] Single Image Defogging using Depth Estimation and Scene-Specific Dark Channel Prior
    Kokul, T.
    Anparasy, S.
    2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 190 - 195
  • [10] Automatic Generation of Scene-Specific Person Trackers
    Holzbach, Gerrit
    van de Camp, Florian
    Stiefelhagen, Rainer
    2016 13TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2016, : 408 - 415