CNN-based camera motion classification using HSI color model for compressed videos

被引：0

作者：

Pavan Sandula

Harish Reddy Kolanu

Manish Okade

机构：

[1] National Institute of Technology (NIT),Department of Electronics and Communication Engineering

来源：

Signal, Image and Video Processing | 2022年 / 16卷

关键词：

Convolutional neural network; Camera motion classification; HSI color model; Compressed domain; Block motion vectors;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

This paper presents a novel camera motion classification framework based on modeling the compressed domain block motion vectors using the HSI color model. The input to the proposed method is the interframe block motion vectors decoded from the compressed bitstream. The block motion vector’s magnitude and orientation are estimated, followed by assigning motion vector orientation to Hue, motion vector magnitude to Saturation, and keeping Intensity at a fixed value. The HSI assignment is then converted into an RGB image followed by supervised learning utilizing a convolutional neural network to recognize eleven camera motion patterns comprising seven pure camera motion patterns and four mixed camera patterns. The proposed method’s premise is based on posing the camera motion classification problem as a color recognition task. Detailed experimental analysis that includes a comparison with state-of-the-art methods, ablation study, and robustness analysis is carried out utilizing block motion vectors obtained from H.264/AVC encoded videos. Results demonstrate accuracies of over 98 % in recognizing eleven camera patterns for the proposed method.

引用

页码：103 / 110

页数：7

共 27 条

[1]

Duan L-Y(2006)Nonparametric motion characterization for robust classification of camera motion patterns IEEE Trans. Multimed. 8 323-340

[2]

Jin JS(2017)Classification and retrieval of radiology images in h.264/avc compressed domain Signal Image Video Process. 11 573-580

[3]

Tian Q(2014)Camhid: camera motion histogram descriptor and its application to cinematographic shot classification IEEE Trans. Circuits Syst. Video Technol. 24 1682-1695

[4]

Xu C-S(2016)Robust learning-based camera motion characterization scheme with applications to video stabilization IEEE Trans. Circuits Syst. Video Technol. 26 453-466

[5]

Yamaghani M(2019)Predict vehicle collision by ttc from motion using a single video camera IEEE Trans. Intell. Transp. Syst. 20 522-533

[6]

Zargari F(2016)Saliency detection in mpeg and hevc video using intra-frame and inter-frame distances Signal Image Video Process. 10 703-709

[7]

Hasan MA(2020)Static video summarization using multi-cnn with sparse autoencoder and random forest classifier Signal Image Video Process. 15 735-885

[8]

Xu M(2019)Compressed domain zoom motion classification using local tetra patterns Signal Image Video Process 13 879-313

[9]

He X(2013)Video object tracking in the compressed domain using spatio-temporal markov random fields IEEE Trans. Image Process. 22 300-520

[10]

Xu C(2014)Tsallis entropy-based information measures for shot boundary detection and keyframe selection Signal Image Video Process. 7 507-undefined

← 1 2 3 →