A review on video summarization techniques

被引：21

作者：

Meena, Preeti ^{[1
]}

Kumar, Himanshu ^{[1
]}

Yadav, Sandeep Kumar ^{[1
]}

机构：

[1] Indian Inst Technol, Dept Elect Engn, Jodhpur 342037, Rajasthan, India

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2023年 / 118卷

关键词：

Video summarization; Single view; Multi-view; Multi-modal; Modality fusion; KEY-FRAME EXTRACTION; SCENE DETECTION; SALIENCY; RECOGNITION; SYSTEM; VISUALIZATION; SELECTION; GUIDANCE; SYNOPSIS; FUSION;

D O I：

10.1016/j.engappai.2022.105667

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The exponential growth of technology has resulted in a profusion of advanced imaging devices and eases internet accessibility, leading to an increase in the creation and use of multimedia content. Analyzing representative or meaningful information from such massive data is a time-consuming task that impacts the efficiency of various video processing applications, including video searching, retrieval, indexing, sharing, and many more. In literature, numerous video summarization techniques which extract key-frames or key-shots from the original video to generate a concise yet informative summary have been proposed to address these issues. This paper presents a discussion of the state-of-the-art video summarization techniques along with limitations and challenges. The paper examines summarization techniques in a holistic manner based upon the distinct attributes of evolving video data types on the basis of parameters such as the number of views, dimensions, modality, and content. Such a categorization framework enables us to critically analyze the recent progress, future directions, limitations, datasets, application domains etc., in a better comprehensible manner.

引用

页数：23

共 212 条

[1] CNN-Based Prediction of Frame-Level Shot Importance for Video Summarization
Al Nahian, Mohaiminul
Iftekhar, A. S. M.
Islam, Mohammad Tariqul
Rahman, S. M. Mahbubur
Hatzinakos, Dimitrios
[J]. 2017 INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2017, : 24 - 29
[2] Almeida J., 2010, Proceedings 2010 IEEE International Symposium on Multimedia (ISM 2010), P113, DOI 10.1109/ISM.2010.25
[3] [Anonymous], 2010, Image Analysis for Multimedia Interactive Services (WIAMIS), 2010 11th International Workshop on, DOI DOI 10.1109/WIC0M.2010.5601233
[4] [Anonymous], 2014, COMPUT NOW
[5] [Anonymous], YOUTUBE 8 M
[6] [Anonymous], 2007, COMPUTER ANIMATION S
[7] Apostolidis E., 2021, arXiv
[8] Apostolidis E., 2019, P 1 INT WORKSHOP AI, P17
[9] AC-SUM-GAN: Connecting Actor-Critic and Generative Adversarial Networks for Unsupervised Video Summarization
Apostolidis, Evlampios
Adamantidou, Eleni
Metsai, Alexandros, I
Mezaris, Vasileios
Patras, Ioannis
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3278 - 3292
[10] Unsupervised Video Summarization via Attention-Driven Adversarial Learning
Apostolidis, Evlampios
Adamantidou, Eleni
Metsai, Alexandros, I
Mezaris, Vasileios
Patras, Ioannis
[J]. MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 492 - 504

← 1 2 3 4 5 6 7 8 9 10 →