Video classification and retrieval through spatio-temporal Radon features

被引:14
作者
Sasithradevi, A. [1 ]
Roomi, S. Mohamed Mansoor [2 ]
机构
[1] VV Coll Engn, Elect & Commun Engn, Tuticorin 627657, Tamil Nadu, India
[2] Thiagarajar Coll Engn, Elect & Commun Engn, Madurai 625015, Tamil Nadu, India
关键词
Bagged trees classification model; Linear discriminant analysis; Radon Projections; Spatio temporal feature; Video classification and retrieval; HUMAN ACTION RECOGNITION;
D O I
10.1016/j.patcog.2019.107099
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rise in the availability of video content for access via the Internet and the medium of television has resulted in the development of automatic search procedures to retrieve the desired video. Searches can be simplified and hastened by employing automatic classification of videos. This paper proposes a descriptor called the Spatio-Temporal Histogram of Radon Projections (STHRP) for representing the temporal pattern of the contents of a video and demonstrates its application to video classification and retrieval. The first step in STHRP pattern computation is to represent any video as Three Orthogonal Planes (TOPs), i.e., XY, XT and YT, signifying the spatial and temporal contents. Frames corresponding to each plane are partitioned into overlapping blocks. Radon projections are obtained over these blocks at different orientations, resulting in weighted transform coefficients that are normalized and grouped into bins. Linear Discriminant Analysis (LDA) is performed over these coefficients of the TOPs to arrive at a compact description of STHRP pattern. Compared to existing classification and retrieval approaches, the proposed descriptor is highly robust to translation, rotation and illumination variations in videos. To evaluate the capabilities of the invariant STHRP pattern, we analyse the performance by conducting experiments on the UCF-101, HMDB51, 10contexts and TRECVID data sets for classification and retrieval using a bagged tree model. Experimental evaluation of video classification reveals that STHRP pattern can achieve classification rates of 96.15%, 71.7%, 93.24% and 97.3% for the UCF-101, HMDB51,10contexts and TRECVID 2005 data sets respectively. We conducted retrieval experiments on the TRECVID 2005, JHMDB and 10contexts data sets and the results revealed that STHRP pattern is able to provide the videos relevant to the user's query in minimal time (0.05s) with a good precision rate. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:13
相关论文
共 60 条
[1]  
[Anonymous], IEEE T MULTIMEDIA
[2]  
[Anonymous], IEEE 5 NAT C COMP VI
[3]  
[Anonymous], EEE T ACTIONS SYSTEM
[4]  
[Anonymous], INT C DIG IM COMP TE
[5]  
[Anonymous], 19 INT C PATT REC IC
[6]  
[Anonymous], 2014, ABS14054506 CORR
[7]  
[Anonymous], 2016, CVPR
[8]  
[Anonymous], P CHIN C CCBR
[9]  
[Anonymous], 1983, RADON TRANSFORM ITS
[10]  
[Anonymous], 2015, IJCV