Camera Motion-Based Analysis of User Generated Video

被引:43
作者
Abdollahian, Golnaz [1 ]
Taskiran, Cuneyt M. [2 ]
Pizlo, Zygmunt [3 ]
Delp, Edward J. [1 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Motorola Inc, Applicat Res & Technol Ctr, Schaumburg, IL 60196 USA
[3] Purdue Univ, Dept Psychol Sci, W Lafayette, IN 47907 USA
关键词
Content-based video analysis; eye tracking; home video; motion-based analysis; regions of interest; saliency maps; user generated video; video summarization; COMPRESSED VIDEO; VISUAL-ATTENTION; MODEL;
D O I
10.1109/TMM.2009.2036286
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we propose a system for the analysis of user generated video (UGV). UGV often has a rich camera motion structure that is generated at the time the video is recorded by the person taking the video, i.e., the "camera person." We exploit this structure by defining a new concept known as camera view for temporal segmentation of UGV. The segmentation provides a video summary with unique properties that is useful in applications such as video annotation. Camera motion is also a powerful feature for identification of keyframes and regions of interest (ROIs) since it is an indicator of the camera person's interests in the scene and can also attract the viewers' attention. We propose a new location-based saliency map which is generated based on camera motion parameters. This map is combined with other saliency maps generated using features such as color contrast, object motion and face detection to determine the ROIs. In order to evaluate our methods we conducted several user studies. A subjective evaluation indicated that our system produces results that is consistent with viewers' preferences. We also examined the effect of camera motion on human visual attention through an eye tracking experiment. The results showed a high dependency between the distribution of fixation points of the viewers and the direction of camera movement which is consistent with our location-based saliency map.
引用
收藏
页码:28 / 41
页数:14
相关论文
共 49 条
[1]  
ABDOLLAHIAN G, 2009, P IEEE INT C MULT EX
[2]   Analysis of unstructured video based on camera motion [J].
Abdollahian, Golnaz ;
Delp, Edward J. .
MULTIMEDIA CONTENT ACCESS: ALGORITHMS AND SYSTEMS, 2007, 6506
[3]  
ALBIOL A, 2003, P IEEE INT C AC SPEE
[4]  
[Anonymous], 2003, P 11 ACM INT C MULTI, DOI DOI 10.1145/957013.957094
[5]  
BABAGUCHI N, 2000, P ACM MULT 2000 WORK, P205
[6]   A GENERAL DISTRIBUTION THEORY FOR A CLASS OF LIKELIHOOD CRITERIA [J].
BOX, GEP .
BIOMETRIKA, 1949, 36 (3-4) :317-346
[7]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[8]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[9]   A visual attention model for adapting images on small displays [J].
Chen, LQ ;
Xie, X ;
Fan, X ;
Ma, WY ;
Zhang, HJ ;
Zhou, HQ .
MULTIMEDIA SYSTEMS, 2003, 9 (04) :353-364
[10]  
Choi JW, 2000, PROCEEDINGS OF THE 43RD IEEE MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, P758, DOI 10.1109/MWSCAS.2000.952867