A Graph-Theoretic Framework for Summarizing First-Person Videos

被引:4
作者
Sahu, Abhimanyu [1 ]
Chowdhury, Ananda S. [1 ]
机构
[1] Jadavpur Univ, Dept Elect & Telecommun Engn, Kolkata 700032, India
来源
GRAPH-BASED REPRESENTATIONS IN PATTERN RECOGNITION, GBRPR 2019 | 2019年 / 11510卷
关键词
First-person video; Center-surround model; Spectral graph dissimilarity; Video similarity graph; MST; Inadmissible edge;
D O I
10.1007/978-3-030-20081-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
First-person video summarization has emerged as an important problem in the areas of computer vision and multimedia communities. In this paper, we present a graph-theoretic framework for summarizing first-person (egocentric) videos at frame level. We first develop a new way of characterizing egocentric video frames by building a center-surround model based on spectral measures of dissimilarity between two graphs representing the center and the surrounding regions in a frame. The frames in a video are next represented by a weighted graph (video similarity graph) in the feature space constituting center-surround differences in entropy and optic flow values along with PHOG (Pyramidal HOG) features. The frames are finally clustered using a MST based approach with a new measure of inadmissibility for edges based on neighbourhood analysis. Frames closest to the centroid of each cluster are used to build the summary. Experimental comparisons on two standard datasets clearly indicate the advantage of our solution.
引用
收藏
页码:183 / 193
页数:11
相关论文
共 18 条
  • [1] VISON: Video Summarization for ONline applications
    Almeida, Jurandy
    Leite, Neucimar J.
    Torres, Ricardo da S.
    [J]. PATTERN RECOGNITION LETTERS, 2012, 33 (04) : 397 - 409
  • [2] [Anonymous], 2007, P 6 ACM INT C IM VID, DOI DOI 10.1145/1282280.1282340
  • [3] An extensive comparative study of cluster validity indices
    Arbelaitz, Olatz
    Gurrutxaga, Ibai
    Muguerza, Javier
    Perez, Jesus M.
    Perona, Inigo
    [J]. PATTERN RECOGNITION, 2013, 46 (01) : 243 - 256
  • [4] Summarizing video sequence using a graph-based hierarchical approach
    Belo, Luciana dos Santos
    Caetano, Carlos Antonio, Jr.
    do Patrocinio, Zenilton Kleber Goncalves, Jr.
    Ferzoli Guimaraes, Silvio Jamil
    [J]. NEUROCOMPUTING, 2016, 173 : 1001 - 1016
  • [5] Summarization of Egocentric Videos: A Comprehensive Survey
    del Molino, Ana Garcia
    Tan, Cheston
    Lim, Joo-Hwee
    Tan, Ah-Hwee
    [J]. IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS, 2017, 47 (01) : 65 - 76
  • [6] Gangapure Vijay N., 2015, Graph-Based Representations in Pattern Recognition. 10th IAPR-TC-15 International Workshop, GbRPR 2015. Proceedings: LNCS 9069, P282, DOI 10.1007/978-3-319-18224-7_28
  • [7] Guimaraes SJF, 2010, LECT NOTES COMPUT SC, V6419, P46
  • [8] Spatial and temporal scoring for egocentric video summarization
    Guo, Zhao
    Gao, Lianli
    Zhen, Xiantong
    Zou, Fuhao
    Shen, Fumin
    Zheng, Kai
    [J]. NEUROCOMPUTING, 2016, 208 : 299 - 308
  • [9] Gygli M, 2015, PROC CVPR IEEE, P3090, DOI 10.1109/CVPR.2015.7298928
  • [10] Creating Summaries from User Videos
    Gygli, Michael
    Grabner, Helmut
    Riemenschneider, Hayko
    Van Gool, Luc
    [J]. COMPUTER VISION - ECCV 2014, PT VII, 2014, 8695 : 505 - 520