A Graph-Theoretic Framework for Summarizing First-Person Videos

被引:4
作者
Sahu, Abhimanyu [1 ]
Chowdhury, Ananda S. [1 ]
机构
[1] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata 700032, India
来源
GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019 | 2019年 / 11510卷
关键词
First-person video; Center-surround model; Spectral graph dissimilarity; Video similarity graph; MST; Inadmissible edge;
D O I
10.1007/978-3-030-20081-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
First-person video summarization has emerged as an important problem in the areas of computer vision and multimedia communities. In this paper, we present a graph-theoretic framework for summarizing first-person (egocentric) videos at frame level. We first develop a new way of characterizing egocentric video frames by building a center-surround model based on spectral measures of dissimilarity between two graphs representing the center and the surrounding regions in a frame. The frames in a video are next represented by a weighted graph (video similarity graph) in the feature space constituting center-surround differences in entropy and optic flow values along with PHOG (Pyramidal HOG) features. The frames are finally clustered using a MST based approach with a new measure of inadmissibility for edges based on neighbourhood analysis. Frames closest to the centroid of each cluster are used to build the summary. Experimental comparisons on two standard datasets clearly indicate the advantage of our solution.
引用
收藏
页码:183 / 193
页数:11
相关论文
共 18 条
  • [11] Lee YJ, 2012, PROC CVPR IEEE, P1346, DOI 10.1109/CVPR.2012.6247820
  • [12] Story-Driven Summarization for Egocentric Video
    Lu, Zheng
    Grauman, Kristen
    [J]. 2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 2714 - 2721
  • [13] Unsupervised Video Summarization with Adversarial LSTM Networks
    Mahasseni, Behrooz
    Lam, Michael
    Todorovic, Sinisa
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2982 - 2991
  • [14] Video Summarization Using Deep Semantic Features
    Otani, Mayu
    Nakashima, Yuta
    Rahtu, Esa
    Heikkila, Janne
    Yokoya, Naokazu
    [J]. COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 361 - 377
  • [15] Sahu A, 2018, INT C PATT RECOG, P2887, DOI 10.1109/ICPR.2018.8546119
  • [16] Song YL, 2015, PROC CVPR IEEE, P5179, DOI 10.1109/CVPR.2015.7299154
  • [18] Video Summarization with Long Short-Term Memory
    Zhang, Ke
    Chao, Wei-Lun
    Sha, Fei
    Grauman, Kristen
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 766 - 782