Adaptive weighted fusion with new spatial and temporal fingerprints for improved video copy detection

被引:18
作者
Kim, Semin [1 ]
Choi, Jae Young [1 ]
Han, Seungwan [2 ]
Ro, Yong Man [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Image & Video Syst Lab, Taejon 305701, South Korea
[2] Elect & Telecommun Res Inst, Cyber Secur Convergence Res Lab, Taejon 305700, South Korea
关键词
Video copy detection; Video sequence matching; Modality fusion; Video fingerprint; Weighted adaptive fusion; Spatial and temporal information; ROBUST; EFFICIENT; DISTANCE; COLOR; SURF;
D O I
10.1016/j.image.2014.05.002
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a new and novel modality fusion method designed for combining spatial and temporal fingerprint information to improve video copy detection performance. Most of the previously developed methods have been limited to use only pre-specified weights to combine spatial and temporal modality information. Hence, previous approaches may not adaptively adjust the significance of the temporal fingerprints that depends on the difference between the temporal variances of compared videos, leading to performance degradation in video copy detection. To overcome the aforementioned limitation, the proposed method has been devised to extract two types of fingerprint information: (1) spatial fingerprint that consists of the signs of DCT coefficients in local areas in a keyframe and (2) temporal fingerprint that computes the temporal variances in local areas in consecutive keyframes. In addition, the so-called temporal strength measurement technique is developed to quantitatively represent the amount of the temporal variances; it can be adaptively used to consider the significance of compared temporal fingerprints. The experimental results show that the proposed modality fusion method outperforms other state-of-the-arts fusion methods and popular spatio-temporal fingerprints in terms of video copy detection. Furthermore, the proposed method can save 39.0%, 25.1%, and 46.1% time complexities needed to perform video fingerprint matching without a significant loss of detection accuracy for our synthetic dataset, TRECVID 2009 CCD Task, and MUSCLE-VCD 2007, respectively. This result indicates that our proposed method can be readily incorporated into the real-life video copy detection systems. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:788 / 806
页数:19
相关论文
共 60 条
[1]  
Anguera X., 2011, P NIST TRECVID WORKS
[2]  
Ayari M., 2011, P NIST TRECVID WORKS
[3]  
Barrios J.M., 2011, P NIST TRECVID WORKS
[4]   Speeded-Up Robust Features (SURF) [J].
Bay, Herbert ;
Ess, Andreas ;
Tuytelaars, Tinne ;
Van Gool, Luc .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2008, 110 (03) :346-359
[5]   Automatic video classification: A survey of the literature [J].
Brezeale, Darin ;
Cook, Diane J. .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 2008, 38 (03) :416-430
[6]   Video sequence matching based on temporal ordinal measurement [J].
Chen, Li ;
Stentiford, F. W. M. .
PATTERN RECOGNITION LETTERS, 2008, 29 (13) :1824-1831
[7]   Spatio-temporal transform based video hashing [J].
Coskun, Baris ;
Sankur, Bulent ;
Memon, Nasir .
IEEE TRANSACTIONS ON MULTIMEDIA, 2006, 8 (06) :1190-1208
[8]  
Dong Y., 2011, P NIST TRECVID WORKS
[9]   An Image-Based Approach to Video Copy Detection With Spatio-Temporal Post-Filtering [J].
Douze, Matthijs ;
Jegou, Herve ;
Schmid, Cordelia .
IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (04) :257-266
[10]   A Robust and Fast Video Copy Detection System Using Content-Based Fingerprinting [J].
Esmaeili, Mani Malek ;
Fatourechi, Mehrdad ;
Ward, Rabab Kreidieh .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2011, 6 (01) :213-226