A Graph-Theoretic Framework for Summarizing First-Person Videos

被引：4

作者：

Sahu, Abhimanyu ^{[1
]}

Chowdhury, Ananda S. ^{[1
]}

机构：

[1] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata 700032, India

来源：

GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019 | 2019年 / 11510卷

关键词：

First-person video; Center-surround model; Spectral graph dissimilarity; Video similarity graph; MST; Inadmissible edge;

D O I：

10.1007/978-3-030-20081-7_18

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

First-person video summarization has emerged as an important problem in the areas of computer vision and multimedia communities. In this paper, we present a graph-theoretic framework for summarizing first-person (egocentric) videos at frame level. We first develop a new way of characterizing egocentric video frames by building a center-surround model based on spectral measures of dissimilarity between two graphs representing the center and the surrounding regions in a frame. The frames in a video are next represented by a weighted graph (video similarity graph) in the feature space constituting center-surround differences in entropy and optic flow values along with PHOG (Pyramidal HOG) features. The frames are finally clustered using a MST based approach with a new measure of inadmissibility for edges based on neighbourhood analysis. Frames closest to the centroid of each cluster are used to build the summary. Experimental comparisons on two standard datasets clearly indicate the advantage of our solution.

引用

页码：183 / 193

页数：11

共 18 条

[11] Lee YJ, 2012, PROC CVPR IEEE, P1346, DOI 10.1109/CVPR.2012.6247820
[12] Story-Driven Summarization for Egocentric Video
Lu, Zheng
Grauman, Kristen
[J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2714 - 2721
[13] Unsupervised Video Summarization with Adversarial LSTM Networks
Mahasseni, Behrooz
Lam, Michael
Todorovic, Sinisa
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2982 - 2991
[14] Video Summarization Using Deep Semantic Features
Otani, Mayu
Nakashima, Yuta
Rahtu, Esa
Heikkila, Janne
Yokoya, Naokazu
[J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 361 - 377
[15] Sahu A, 2018, INT C PATT RECOG, P2887, DOI 10.1109/ICPR.2018.8546119
[16] Song YL, 2015, PROC CVPR IEEE, P5179, DOI 10.1109/CVPR.2015.7299154
[17] GRAPH-THEORETICAL METHODS FOR DETECTING AND DESCRIBING GESTALT CLUSTERS
ZAHN, CT
[J]. IEEE TRANSACTIONS ON COMPUTERS, 1971, C 20 (01) : 68 - &
[18] Video Summarization with Long Short-Term Memory
Zhang, Ke
Chao, Wei-Lun
Sha, Fei
Grauman, Kristen
[J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 766 - 782

← 1 2 →