Shot boundary detection using scale invariant feature matching

被引：2

作者：

Park, MH ^{[1
]}

Park, RH ^{[1
]}

Lee, SW ^{[1
]}

机构：

[1] Sogang Univ, Dept Elect Engn, Sinsu Dong, Seoul 121742, South Korea

来源：

VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2006, PTS 1 AND 2 | 2006年 / 6077卷

关键词：

shot boundary detection (SBD); scale invariant feature transform (SIFT); hard-cut; gradual-transition; object recognition;

D O I：

10.1117/12.642244

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper presents a shot boundary detection (SBD) method that finds boundaries between shots using the changes in visual content elements' such as objects, actors, and background. Our work presented in this paper is based on the property that the features do not change significantly within a shot whereas they change substantially across a shot boundary. Noticing this characteristic of shot boundaries, we propose a SBD algorithm using the scale- and rotation-invariant local image descriptors. To obtain information of the content elements, we employ the scale invariant feature transform (SIFT) that has been commonly used in object recognition. The number of matched points is large within the same shot whereas zero or the small number of matched points is detected at the shot boundary because all the elements in the previous shot change abruptly in the next shot. Thus we can determine the existence of shot boundaries by the number of matched points. We identify two types of shot boundaries (hard-cut and gradual-transition such as tiling, panning, and fade in/out) with a adjustable frame distance between consecutive frames. Experimental results with four test videos show the effectiveness of the proposed SBD algorithm using scale invariant feature matching.

引用

页数：9

共 11 条

[1] Automated high-level movie segmentation for advanced video-retrieval systems
Hanjalic, A
Lagendijk, RL
Biemond, J
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1999, 9 (04) : 580 - 588
[2] LAN DJ, 2003, P INT C MULT EXP BAL, V3, P469
[3] Lowe D.G., 1999, P IEEE INT C COMP VI, P1150, DOI DOI 10.1109/ICCV.1999.790410
[4] Distinctive image features from scale-invariant keypoints
Lowe, DG
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
[5] Ma YF, 2002, INT C PATT RECOG, P548, DOI 10.1109/ICPR.2002.1048361
[6] Motion analysis and segmentation through spatio-temporal slices processing
Ngo, CW
Pong, TC
Zhang, HJ
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2003, 12 (03) : 341 - 355
[7] Rasheed Z, 2003, PROC CVPR IEEE, P343
[8] Sivic J, 2005, LECT NOTES COMPUT SC, V3568, P226
[9] Sivic J, 2004, PROC CVPR IEEE, P488
[10] SVIC J, 2004, P EUR C COMP VIS PRA, V2, P85

← 1 2 →