Detecting semantic concepts from video using temporal gradients and audio classification

被引:0
作者
Rautiainen, M
Seppänen, T
Penttilä, J
Peltola, J
机构
[1] Univ Oulu, MediaTeam Oulu, FIN-90014 Oulu, Finland
[2] VTT Tech Res Ctr Finland, FIN-90571 Oulu, Finland
来源
IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS | 2003年 / 2728卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we describe new methods to detect semantic concepts from digital video based on audible and visual content. Temporal Gradient Correlogram captures temporal correlations of gradient edge directions from sampled shot frames. Power-related physical features are extracted from short audio samples in video shots. Video shots containing people, cityscape, landscape, speech or instrumental sound are detected with trained self-organized maps and kNN classification results of audio samples. Test runs and evaluations in TREC 2002 Video Track show consistent performance for Temporal Gradient Correlogram and state-of-the-art precision in audio-based instrumental sound detection.
引用
收藏
页码:260 / 270
页数:11
相关论文
共 27 条
[1]  
[Anonymous], P INF OUL INT WORKSH
[2]  
CAREY MJ, 1999, P ICASSP
[3]  
Chang SF, 1998, 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, P531, DOI 10.1109/ICIP.1998.727321
[4]  
DELBIMBO A, 2000, IEEE INT C MULTIMEDI, V2, P671
[5]  
DEVALOIS RL, 1975, HDB PERCEPTION, V5, P117
[6]   QUERY BY IMAGE AND VIDEO CONTENT - THE QBIC SYSTEM [J].
FLICKNER, M ;
SAWHNEY, H ;
NIBLACK, W ;
ASHLEY, J ;
HUANG, Q ;
DOM, B ;
GORKANI, M ;
HAFNER, J ;
LEE, D ;
PETKOVIC, D ;
STEELE, D ;
YANKER, P .
COMPUTER, 1995, 28 (09) :23-32
[7]   TEXTURAL FEATURES FOR IMAGE CLASSIFICATION [J].
HARALICK, RM ;
SHANMUGAM, K ;
DINSTEIN, I .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1973, SMC3 (06) :610-621
[8]  
HOYT J, 1994, P ICASSP
[9]   Image indexing using color correlograms [J].
Huang, J ;
Kumar, SR ;
Mitra, M ;
Zhu, WJ ;
Zabih, R .
1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, :762-768
[10]  
*IBM, 2003, IBM CUEV TOOLK