Effective Video Summarization Approach Based on Visual Attention

被引：5

作者：

Ahmad, Hilal ^{[1
]}

Khan, Habib Ullah ^{[2
]}

Ali, Sikandar ^{[3
]}

Rahman, Syed Ijaz Ur ^{[1
]}

Wahid, Fazli ^{[3
]}

Khattak, Hizbullah ^{[4
]}

机构：

[1] Islamia Coll Peshawar Khyber, Dept Comp Sci, Pakhtunkhwa, Pakistan

[2] Qatar Univ, Dept Accounting & Informat Syst, Coll Business & Econ, Doha 2713, Qatar

[3] Univ Haripur, Dept Informat Technol, Khyber Pakhtunkhwa, Pakistan

[4] Hazara Univ Mansehra, Dept Informat Technol, Khyber Pakhtunkhwa, Pakistan

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2022年 / 71卷 / 01期

关键词：

KFE; video summarization; visual saliency; visual attention model; KEY-FRAME EXTRACTION; RETRIEVAL; FUSION; MODEL;

D O I：

10.32604/cmc.2022.021158

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video summarization is applied to reduce redundancy and develop a concise representation of key frames in the video, more recently, video sum-maries have been used through visual attention modeling. In these schemes, the frames that stand out visually are extracted as key frames based on human attention modeling theories. The schemes for modeling visual attention have proven to be effective for video summaries. Nevertheless, the high cost of computing in such techniques restricts their usability in everyday situations. In this context, we propose a method based on KFE (key frame extraction) technique, which is recommended based on an efficient and accurate visual attention model. The calculation effort is minimized by utilizing dynamic visual highlighting based on the temporal gradient instead of the traditional optical flow techniques. In addition, an efficient technique using a discrete cosine transformation is utilized for the static visual salience. The dynamic and static visual attention metrics are merged by means of a non-linear weighted fusion technique. Results of the system are compared with some existing state -of-the-art techniques for the betterment of accuracy. The experimental results of our proposed model indicate the efficiency and high standard in terms of the key frames extraction as output.

引用

页码：1427 / 1442

页数：16

共 38 条

[1]

Calic J., 2004, PROC 5 INT WORKSHOP

[2] An Autonomous Framework to Produce and Distribute Personalized Team-Sport Video Summaries: A Basketball Case Study [J].

Chen, Fan ;

Delannay, Damien ;

De Vleeschouwer, Christophe .

IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (06) :1381-1394

[3] Fast human detection using a novel boosted cascading structure with meta stages [J].

Chen, Yu-Ting ;

Chen, Chu-Song .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2008, 17 (08) :1452-1464

[4]

DeMenthon D., 1998, Proceedings ACM Multimedia 98, P211, DOI 10.1145/290747.290773

[5] A long video caption generation algorithm for big video data retrieval [J].

Ding, Songtao ;

Qu, Shiru ;

Xi, Yuling ;

Wan, Shaohua .

FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2019, 93 :583-595

[6] Efficient visual attention based framework for extracting key frames from videos [J].

Ejaz, Naveed ;

Mehmood, Irfan ;

Baik, Sung Wook .

SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (01) :34-44

[7] Adaptive key frame extraction for video summarization using an aggregation mechanism [J].

Ejaz, Naveed ;

Bin Tariq, Tayyab ;

Baik, Sung Wook .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2012, 23 (07) :1031-1040

[8] Memorable and rich video summarization [J].

Fei, Mengjuan ;

Jiang, Wei ;

Mao, Weijie .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2017, 42 :207-217

[9] VSUMM: A mechanism designed to produce static video summaries and a novel evaluation method [J].

Fontes de Avila, Sandra Eliza ;

Brandao Lopes, Ana Paula ;

da Luz, Antonio, Jr. ;

Araujo, Arnaldo de Albuquerque .

PATTERN RECOGNITION LETTERS, 2011, 32 (01) :56-68

[10] STIMO: STIll and MOving video storyboard for the web scenario [J].

Furini, Marco ;

Geraci, Filippo ;

Montangero, Manuela ;

Pellegrini, Marco .

MULTIMEDIA TOOLS AND APPLICATIONS, 2010, 46 (01) :47-69

← 1 2 3 4 →