Optimized deep learning enabled lecture audio video summarization

被引:0
|
作者
Kaur, Preet Chandan [1 ]
Ragha, Leena [1 ,2 ]
机构
[1] DY PATIL Deemed Univ, Ramrao Adik Inst Technol, Comp Engn Fac, Navi Mumbai 400706, Maharashtra, India
[2] BLDEAs VP Dr PGH Coll Engn & Technol, Dept Comp Sci & Engn, Ashram Rd, Vijayapur 586103, Karnataka, India
关键词
Audio Video Summarization; Deep Residual Network; Video Shot Segmentation; E; -learning; YCbCr Space Colour Model;
D O I
10.1016/j.jvcir.2024.104309
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization plays an important role in multiple applications by compressing lengthy video content into compressed representation. The purpose is to present a fine-tuned deep model for lecture audio video summarization. Initially, the input lecture audio-visual video is taken from the dataset. Then, the video shot segmentation (slide segmentation) is done using the YCbCr space colour model. From each video shot, the audio and video within the video shot are segmented using the Honey Badger-based Bald Eagle Algorithm (HBBEA). The HBBEA is obtained by combining the Bald Eagle Search (BES) and Honey Badger Algorithm (HBA). The DRN training is executed by HBBEA to select the finest DRN weights. The relevant video frames are merged with the audio. The proposed HBBEA-based DRN outperformed with a better F1-Score of 91.9 %, Negative predictive value (NPV) of 89.6 %, Positive predictive value (PPV) of 90.7 %, Accuracy of 91.8 %, precision of 91 %, and recall of 92.8 %.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] Visual Summarization of Lecture Video Segments for Enhanced Navigation
    Rahman, Mohammad Rajiur
    Shah, Shishir
    Subhlok, Jaspal
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 154 - 157
  • [32] A audio-visual model for efficient video summarization
    El-Nagar, Gamal
    El-Sawy, Ahmed
    Rashad, Metwally
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [33] Auto-summarization of audio-video presentations
    He, LW
    Sanocki, E
    Gupta, A
    Grudin, J
    ACM MULTIMEDIA 99, PROCEEDINGS, 1999, : 489 - 498
  • [34] Automated MPEG audio-video summarization and description
    Sugano, M
    Nakajima, Y
    Yanagihara, H
    2002 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2002, : 956 - 959
  • [35] Research on perceptual fusion of audio and video based on deep learning
    An, Qing
    Chen, Yanhua
    Wu, Shusen
    2020 INTERNATIONAL CONFERENCE ON IMAGE, VIDEO PROCESSING AND ARTIFICIAL INTELLIGENCE, 2020, 11584
  • [36] AUTOMATIC CONSUMER VIDEO SUMMARIZATION BY AUDIO AND VISUAL ANALYSIS
    Jiang, Wei
    Cotton, Courtenay
    Loui, Alexander C.
    2011 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2011,
  • [37] Video Summarization with Supervised Learning
    Basak, Jayanta
    Luthra, Varun
    Chaudhury, Santanu
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 863 - +
  • [38] Deep Learning Enabled Semantic Communication Systems for Video Transmission
    Zhang, Zhenguo
    Yang, Qianqian
    He, Shibo
    Chen, Jiming
    2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL, 2023,
  • [39] Unsupervised video summarization using deep Non-Local video summarization networks
    Zang, Sha-Sha
    Yu, Hui
    Song, Yan
    Zeng, Ru
    NEUROCOMPUTING, 2023, 519 : 26 - 35
  • [40] An Effective Video Summarization Framework Based on the Object of Interest Using Deep Learning
    Ul Haq, Hafiz Burhan
    Asif, Muhammad
    Ahmad, Maaz Bin
    Ashraf, Rehan
    Mahmood, Toqeer
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2022, 2022