Automatic detection of salient objects and spatial relations in videos for a video database system

被引:25
作者
Sevilmis, Tarkan [1 ]
Bastan, Muhammet [1 ]
Gudukbay, Ugur [1 ]
Ulusoy, Oezguer [1 ]
机构
[1] Bilkent Univ, Dept Comp Engn, TR-06800 Ankara, Turkey
关键词
multimedia databases; salient object detection and tracking; camera focus estimation; object labeling; knowledge-base construction; spatio-temporal queries;
D O I
10.1016/j.imavis.2008.01.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimedia databases have gained popularity due to rapidly growing quantities of multimedia data and the need to perform efficient indexing, retrieval and analysis of this data. One downside of multimedia databases is the necessity to process the data for feature extraction and labeling prior to storage and querying. Huge amount of data makes it impossible to complete this task manually. We propose a toot for the automatic detection and tracking of salient objects, and derivation of spatio-temporal relations between them in video. Our system aims to reduce the work for manual selection and labeling of objects significantly by detecting and tracking the salient objects, and hence, requiring to enter the label for each object only once within each shot instead of specifying the labels for each object in every frame they appear. This is also required as a first step in a fully-automatic video database management system in which the labeling should also be done automatically. The proposed framework covers a scalable architecture for video processing and stages of shot boundary detection, salient object detection and tracking, and knowledge-base construction for effective spatio-temporal object querying. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:1384 / 1396
页数:13
相关论文
共 40 条
[1]  
[Anonymous], 2006, 2006 IEEE COMPUTER S, DOI DOI 10.1109/CVRR.2006.147
[2]  
ASHLEY J, 1995, P SPIE STORAGE RETRI, V3, P24
[3]   Comparison of video shot boundary detection techniques [J].
Boreczky, JS ;
Rowe, LA .
STORAGE AND RETRIEVAL FOR STILL IMAGE AND VIDEO DATABASES IV, 1996, 2670 :170-179
[4]   Blobworld: Image segmentation using expectation-maximization and its application to image querying [J].
Carson, C ;
Belongie, S ;
Greenspan, H ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (08) :1026-1038
[5]  
CAVALLARO A, 2002, P 10 ACM INT C MULT, P523
[6]   Using hidden scale for salient object detection [J].
Chalmond, Bernard ;
Francesconi, Benjamin ;
Herbin, Stephane .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (09) :2644-2656
[7]   VideoQ: An automated content based video search system using visual cues [J].
Chang, SF ;
Chen, W ;
Meng, HJ ;
Sundaram, H ;
Zhong, D .
ACM MULTIMEDIA 97, PROCEEDINGS, 1997, :313-324
[8]   Kernel-based object tracking [J].
Comaniciu, D ;
Ramesh, V ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2003, 25 (05) :564-577
[9]   Unsupervised segmentation of color-texture regions in images and video [J].
Deng, YN ;
Manjunath, BS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (08) :800-810
[10]   Image subtraction for real time moving object extraction [J].
Desa, SM ;
Salih, QA .
INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION, PROCEEDINGS, 2004, :41-45