Bridging semantic gap between high-level and low-level features in content-based video retrieval using multi-stage ESN-SVM classifier

被引:6
|
作者
Brindha, N. [1 ]
Visalakshi, P. [1 ]
机构
[1] PSG Coll Technol, Coimbatore 641004, Tamil Nadu, India
来源
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2017年 / 42卷 / 01期
关键词
Classification; feature selection; SVM; ESN; spatio-temporal structure;
D O I
10.1007/s12046-016-0574-8
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Content-based video retrieval system aims at assisting a user to retrieve targeted video sequence in a large database. Most of the search engines use textual annotations to retrieve videos. These types of engines offer a low-level abstraction while the user seeks high-level semantics. Bridging this type of semantic gap in video retrieval remains an important challenge. In this paper, colour, texture and shapes are considered to be low-level features and motion is a high-level feature. Colour histograms convert the RGB colour space into YcbCr and extract hue and saturation values from frames. After colour extraction, filter mask is applied and gradient value is computed. Gradient and threshold values are compared to draw the edge map. Edges are smoothed for sharpening to remove the unnecessary connected components. These diverse shapes are then extracted and stored in shape feature vectors. Finally, an SVM classifier is used for classification of low-level features. For high-level features, depth images are extracted for motion feature identification and classification is done via echo state neural networks (ESN). ESN are a supervised learning technique and follow the principle of recurrent neural networks. ESN are well known for time series classification and also proved their effective performance in gesture detection. By combining the existing algorithms, a high-performance multimedia event detection system is constructed. The effectiveness and efficiency of proposed event detection mechanism is validated using MSR 3D action pair dataset. Experimental results show that the detection accuracy of proposed combination is better than those of other algorithms.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 23 条
  • [1] Bridging semantic gap between high-level and low-level features in content-based video retrieval using multi-stage ESN–SVM classifier
    N Brindha
    P Visalakshi
    Sādhanā, 2017, 42 : 1 - 10
  • [2] Using high-level semantic features in video retrieval
    Zheng, Wujie
    Li, Jianmin
    Si, Zhangzhang
    Lin, Fuzong
    Zhang, Bo
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 370 - 379
  • [3] Sternum image retrieval based on high-level semantic information and low-level features
    Chen, Qin
    Tai, Xiaoying
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 362 - 366
  • [4] Bridging the Gap between High-Level Reasoning and Low-Level Control
    Caldiran, Ozan
    Haspalamutgil, Kadir
    Ok, Abdullah
    Palaz, Can
    Erdem, Esra
    Patoglu, Volkan
    LOGIC PROGRAMMING AND NONMONOTONIC REASONING, PROCEEDINGS, 2009, 5753 : 342 - 354
  • [5] From low-level features to high-level semantics: Are we bridging the gap?
    Chen, TH
    ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 179 - 179
  • [6] Mapping low-level features to high-level semantic concepts in region-based image retrieval
    Jiang, W
    Chan, KL
    Li, MJ
    Zhang, HJ
    2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 244 - 249
  • [7] Automatic classification of tennis video for high-level content-based retrieval
    Sudhir, G
    Lee, JCM
    Jain, AK
    1998 IEEE INTERNATIONAL WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO DATABASE, PROCEEDINGS, 1998, : 81 - 90
  • [8] Overview of Research on Finding Semantic Meanings From Low-level Features in Content-based Image Retrieval
    Deb, Sagarmay
    JCPC: 2009 JOINT CONFERENCE ON PERVASIVE COMPUTING, 2009, : 203 - 207
  • [9] Content-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification
    Caicedo, Juan C.
    Gonzalez, Fabio A.
    Romero, Eduardo
    ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 615 - 622
  • [10] Interaction between high-level and low-level image analysis for semantic video object extraction
    Cavallaro, A
    Ebrahimi, T
    EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (06) : 786 - 797