A review on video summarization techniques

被引:21
作者
Meena, Preeti [1 ]
Kumar, Himanshu [1 ]
Yadav, Sandeep Kumar [1 ]
机构
[1] Indian Inst Technol, Dept Elect Engn, Jodhpur 342037, Rajasthan, India
关键词
Video summarization; Single view; Multi-view; Multi-modal; Modality fusion; KEY-FRAME EXTRACTION; SCENE DETECTION; SALIENCY; RECOGNITION; SYSTEM; VISUALIZATION; SELECTION; GUIDANCE; SYNOPSIS; FUSION;
D O I
10.1016/j.engappai.2022.105667
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The exponential growth of technology has resulted in a profusion of advanced imaging devices and eases internet accessibility, leading to an increase in the creation and use of multimedia content. Analyzing representative or meaningful information from such massive data is a time-consuming task that impacts the efficiency of various video processing applications, including video searching, retrieval, indexing, sharing, and many more. In literature, numerous video summarization techniques which extract key-frames or key-shots from the original video to generate a concise yet informative summary have been proposed to address these issues. This paper presents a discussion of the state-of-the-art video summarization techniques along with limitations and challenges. The paper examines summarization techniques in a holistic manner based upon the distinct attributes of evolving video data types on the basis of parameters such as the number of views, dimensions, modality, and content. Such a categorization framework enables us to critically analyze the recent progress, future directions, limitations, datasets, application domains etc., in a better comprehensible manner.
引用
收藏
页数:23
相关论文
共 212 条
  • [1] CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization
    Al Nahian, Mohaiminul
    Iftekhar, A. S. M.
    Islam, Mohammad Tariqul
    Rahman, S. M. Mahbubur
    Hatzinakos, Dimitrios
    [J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 24 - 29
  • [2] Almeida J., 2010, Proceedings 2010 IEEE International Symposium on Multimedia (ISM 2010), P113, DOI 10.1109/ISM.2010.25
  • [3] [Anonymous], 2010, Image Analysis for Multimedia Interactive Services (WIAMIS), 2010 11th International Workshop on, DOI DOI 10.1109/WIC0M.2010.5601233
  • [4] [Anonymous], 2014, COMPUT NOW
  • [5] [Anonymous], YOUTUBE 8 M
  • [6] [Anonymous], 2007, COMPUTER ANIMATION S
  • [7] Apostolidis E., 2021, arXiv
  • [8] Apostolidis E., 2019, P 1 INT WORKSHOP AI, P17
  • [9] AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization
    Apostolidis, Evlampios
    Adamantidou, Eleni
    Metsai, Alexandros, I
    Mezaris, Vasileios
    Patras, Ioannis
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3278 - 3292
  • [10] Unsupervised Video Summarization via Attention-Driven Adversarial Learning
    Apostolidis, Evlampios
    Adamantidou, Eleni
    Metsai, Alexandros, I
    Mezaris, Vasileios
    Patras, Ioannis
    [J]. MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 492 - 504