A C3D-based Convolutional Neural Network for Frame Dropping Detection in a Single Video Shot

被引:32
作者
Long, Chengjiang [1 ]
Smith, Eric [1 ]
Basharat, Arslan [1 ]
Hoogs, Anthony [1 ]
机构
[1] Kitware Inc, 28 Corp Dr, Clifton Pk, NY 12065 USA
来源
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2017年
关键词
D O I
10.1109/CVPRW.2017.237
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frame dropping is a type of video manipulation where consecutive frames are deleted to omit content from the original video. Automatically detecting dropped frames across a large archive of videos while maintaining a low false alarm rate is a challenging task in digital video forensics. We propose a new approach for forensic analysis by exploiting the local spatio-temporal relationships within a portion of a video to robustly detect frame removals. In this paper, we propose to adapt the Convolutional 3D Neural Network (C3D) for frame drop detection. In order to further suppress the errors due by the network, we produce a refined video-level confidence score and demonstrate that it is superior to the raw output scores from the network. We conduct experiments on two challenging video datasets containing rapid camera motion and zoom changes. The experimental results clearly demonstrate the efficacy of the proposed approach.
引用
收藏
页码:1898 / 1906
页数:9
相关论文
共 15 条
[1]  
[Anonymous], NATL TELECOMMUNICATI
[2]  
[Anonymous], 2014, CVPR
[3]  
[Anonymous], LEARNING BASED REFER
[4]  
[Anonymous], 2014, 13 INT WORKSH DIG FO
[5]  
[Anonymous], 2014, NIPS
[6]  
Chao J., 2013, INT WORKSH DIG WAT, P267, DOI DOI 10.1007/978-3-642-40099-5_22
[7]   Exposing Digital Forgeries in Ballistic Motion [J].
Conotter, Valentina ;
O'Brien, James F. ;
Farid, Hany .
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2012, 7 (01) :283-296
[8]   Learning Spatiotemporal Features with 3D Convolutional Networks [J].
Du Tran ;
Bourdev, Lubomir ;
Fergus, Rob ;
Torresani, Lorenzo ;
Paluri, Manohar .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4489-4497
[9]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90
[10]   Video shot boundary detection: Seven years of TRECVid activity [J].
Smeaton, Alan F. ;
Over, Paul ;
Doherty, Aiden R. .
COMPUTER VISION AND IMAGE UNDERSTANDING, 2010, 114 (04) :411-418