Semantic embedding: scene image classification using scene-specific objects

被引:2
|
作者
Parseh, Mohammad Javad [1 ]
Rahmanimanesh, Mohammad [1 ]
Keshavarzi, Parviz [1 ]
Azimifar, Zohreh [2 ]
机构
[1] Semnan Univ, Dept Elect & Comp Engn, Semnan, Iran
[2] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
关键词
Scene classification; Semantic embedding; Scene-specific objects; RECOGNITION; MODEL; REPRESENTATION; FEATURES; CATEGORIZATION; NETWORKS; BANK;
D O I
10.1007/s00530-022-01010-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Visual scene understanding is a hot and challenging topic in image processing that aims to understand the general (global) concept of a scene image. In this paper, we propose a novel image embedding algorithm using a learned embedded space, which introduces a high-level semantic representation of the scene images. The learned embedded space as a suitable semantic framework for visual concepts can be used in most applications such as image captioning, Visual Question Answering (VQA), and scene recognition or classification. Inspired by the human inference mechanism in visual scene understanding, the proposed method learns a semantic embedded space of visual concepts using prior semantic knowledge. Prior knowledge is extracted from ConceptNet as one of the most comprehensive knowledge graphs in the form of semantic vectors and is transformed to the learned embedded space with a transformation function. The transformation function is learned by solving a minimization problem. To evaluate our proposed approach, we introduce a scene image dataset called "Scene23", which is based on the VisualGenome dataset. A non-linear SVM classifier is utilized to classify the representations of images to the scene categories. The experimental results show 99.44% classification accuracy on the "Scene23" dataset. Also, we evaluated our proposed method by the "UIUC Sports" and "MIT67" datasets. Experimental results indicate that our proposed method outperforms other state-of-the-art methods on the "UIUC Sports" dataset and achieves competitive results on the "MIT67" dataset.
引用
收藏
页码:669 / 691
页数:23
相关论文
共 50 条
  • [11] Knowledge Transfer for Scene-Specific Motion Prediction
    Ballan, Lamberto
    Castaldo, Francesco
    Alahi, Alexandre
    Palmieri, Francesco
    Savarese, Silvio
    COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 697 - 713
  • [12] Knowledge transfer for scene-specific motion prediction
    Ballan, Lamberto
    Castaldo, Francesco
    Alahi, Alexandre
    Palmieri, Francesco
    Savarese, Silvio
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2016, 9905 LNCS : 697 - 713
  • [13] Scene-specific crowd counting using synthetic training images
    Delussu, Rita
    Putzu, Lorenzo
    Fumera, Giorgio
    PATTERN RECOGNITION, 2022, 124
  • [14] SCENE-SPECIFIC MEMORY FOR OBJECTS - A MODEL OF EPISODIC MEMORY IMPAIRMENT IN MONKEYS WITH FORNIX TRANSECTION
    GAFFAN, D
    JOURNAL OF COGNITIVE NEUROSCIENCE, 1994, 6 (04) : 305 - 320
  • [15] Scene-Specific Multiple Cues Integration for Multiperson Tracking
    Dong, Yanmei
    Pei, Mingtao
    Liu, Xiaofeng
    Zhao, Meng
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2020, 12 (03) : 511 - 518
  • [16] Semantic Image Manipulation Using Scene Graphs
    Dhamo, Helisa
    Farshad, Azade
    Laina, Iro
    Navab, Nassir
    Hager, Gregory D.
    Tombari, Federico
    Rupprecht, Christian
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5212 - 5221
  • [17] Predicting Image Aesthetics using Objects in the Scene
    Roy, Hiya
    Yamasaki, Toshihiko
    Hashimoto, Tatsuaki
    PROCEEDINGS OF THE 2018 INTERNATIONAL JOINT WORKSHOP ON MULTIMEDIA ARTWORKS ANALYSIS AND ATTRACTIVENESS COMPUTING IN MULTIMEDIA (MMART&ACM'18), 2018, : 14 - 19
  • [18] Deep Learning of Scene-Specific Classifier for Pedestrian Detection
    Zeng, Xingyu
    Ouyang, Wanli
    Wang, Meng
    Wang, Xiaogang
    COMPUTER VISION - ECCV 2014, PT III, 2014, 8691 : 472 - 487
  • [19] A parallel vision approach to scene-specific pedestrian detection
    Zhang, Wenwen
    Wang, Kunfeng
    Liu, Yating
    Lu, Yue
    Wang, Fei-Yue
    NEUROCOMPUTING, 2020, 394 (394) : 114 - 126
  • [20] Deep Background Subtraction with Scene-Specific Convolutional Neural Networks
    Braham, Marc
    Van Droogenbroeck, Marc
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING, (IWSSIP 2016), 2016, : 113 - 116