Bridging semantic gap between high-level and low-level features in content-based video retrieval using multi-stage ESN-SVM classifier

被引：6

作者：

Brindha, N. ^{[1
]}

Visalakshi, P. ^{[1
]}

机构：

[1] PSG Coll Technol, Coimbatore 641004, Tamil Nadu, India

来源：

SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES | 2017年 / 42卷 / 01期

关键词：

Classification; feature selection; SVM; ESN; spatio-temporal structure;

D O I：

10.1007/s12046-016-0574-8

中图分类号：

T [工业技术];

学科分类号：

08 ;

摘要：

Content-based video retrieval system aims at assisting a user to retrieve targeted video sequence in a large database. Most of the search engines use textual annotations to retrieve videos. These types of engines offer a low-level abstraction while the user seeks high-level semantics. Bridging this type of semantic gap in video retrieval remains an important challenge. In this paper, colour, texture and shapes are considered to be low-level features and motion is a high-level feature. Colour histograms convert the RGB colour space into YcbCr and extract hue and saturation values from frames. After colour extraction, filter mask is applied and gradient value is computed. Gradient and threshold values are compared to draw the edge map. Edges are smoothed for sharpening to remove the unnecessary connected components. These diverse shapes are then extracted and stored in shape feature vectors. Finally, an SVM classifier is used for classification of low-level features. For high-level features, depth images are extracted for motion feature identification and classification is done via echo state neural networks (ESN). ESN are a supervised learning technique and follow the principle of recurrent neural networks. ESN are well known for time series classification and also proved their effective performance in gesture detection. By combining the existing algorithms, a high-performance multimedia event detection system is constructed. The effectiveness and efficiency of proposed event detection mechanism is validated using MSR 3D action pair dataset. Experimental results show that the detection accuracy of proposed combination is better than those of other algorithms.

引用

页码：1 / 10

页数：10

共 23 条

[1] Bridging semantic gap between high-level and low-level features in content-based video retrieval using multi-stage ESN–SVM classifier
N Brindha
P Visalakshi
Sādhanā, 2017, 42 : 1 - 10
[2] Using high-level semantic features in video retrieval
Zheng, Wujie
Li, Jianmin
Si, Zhangzhang
Lin, Fuzong
Zhang, Bo
IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 370 - 379
[3] Sternum image retrieval based on high-level semantic information and low-level features
Chen, Qin
Tai, Xiaoying
BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 362 - 366
[4] Bridging the Gap between High-Level Reasoning and Low-Level Control
Caldiran, Ozan
Haspalamutgil, Kadir
Ok, Abdullah
Palaz, Can
Erdem, Esra
Patoglu, Volkan
LOGIC PROGRAMMING AND NONMONOTONIC REASONING, PROCEEDINGS, 2009, 5753 : 342 - 354
[5] From low-level features to high-level semantics: Are we bridging the gap?
Chen, TH
ISM 2005: SEVENTH IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA, PROCEEDINGS, 2005, : 179 - 179
[6] Mapping low-level features to high-level semantic concepts in region-based image retrieval
Jiang, W
Chan, KL
Li, MJ
Zhang, HJ
2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 244 - 249
[7] Automatic classification of tennis video for high-level content-based retrieval
Sudhir, G
Lee, JCM
Jain, AK
1998 IEEE INTERNATIONAL WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO DATABASE, PROCEEDINGS, 1998, : 81 - 90
[8] Overview of Research on Finding Semantic Meanings From Low-level Features in Content-based Image Retrieval
Deb, Sagarmay
JCPC: 2009 JOINT CONFERENCE ON PERVASIVE COMPUTING, 2009, : 203 - 207
[9] Content-Based Medical Image Retrieval Using Low-Level Visual Features and Modality Identification
Caicedo, Juan C.
Gonzalez, Fabio A.
Romero, Eduardo
ADVANCES IN MULTILINGUAL AND MULTIMODAL INFORMATION RETRIEVAL, 2008, 5152 : 615 - 622
[10] Interaction between high-level and low-level image analysis for semantic video object extraction
Cavallaro, A
Ebrahimi, T
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (06) : 786 - 797

← 1 2 3 →