Optimized deep learning enabled lecture audio video summarization

被引:0
|
作者
Kaur, Preet Chandan [1 ]
Ragha, Leena [1 ,2 ]
机构
[1] DY PATIL Deemed Univ, Ramrao Adik Inst Technol, Comp Engn Fac, Navi Mumbai 400706, Maharashtra, India
[2] BLDEAs VP Dr PGH Coll Engn & Technol, Dept Comp Sci & Engn, Ashram Rd, Vijayapur 586103, Karnataka, India
关键词
Audio Video Summarization; Deep Residual Network; Video Shot Segmentation; E; -learning; YCbCr Space Colour Model;
D O I
10.1016/j.jvcir.2024.104309
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization plays an important role in multiple applications by compressing lengthy video content into compressed representation. The purpose is to present a fine-tuned deep model for lecture audio video summarization. Initially, the input lecture audio-visual video is taken from the dataset. Then, the video shot segmentation (slide segmentation) is done using the YCbCr space colour model. From each video shot, the audio and video within the video shot are segmented using the Honey Badger-based Bald Eagle Algorithm (HBBEA). The HBBEA is obtained by combining the Bald Eagle Search (BES) and Honey Badger Algorithm (HBA). The DRN training is executed by HBBEA to select the finest DRN weights. The relevant video frames are merged with the audio. The proposed HBBEA-based DRN outperformed with a better F1-Score of 91.9 %, Negative predictive value (NPV) of 89.6 %, Positive predictive value (PPV) of 90.7 %, Accuracy of 91.8 %, precision of 91 %, and recall of 92.8 %.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness Reward
    Zhou, Kaiyang
    Qiao, Yu
    Xiang, Tao
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 7582 - 7589
  • [42] Weakly Supervised Deep Reinforcement Learning for Video Summarization With Semantically Meaningful Reward
    Li, Zutong
    Yang, Lei
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 3238 - 3246
  • [43] An enhanced video summarization system using audio features for a personal video recorder
    Otsuka, I
    Radhakrishnan, R
    Siracusa, M
    Divakaran, A
    Mishima, H
    IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2006, 52 (01) : 168 - 172
  • [44] Analysis of Classroom Processes Based on Deep Learning With Video and Audio Features
    Heng, Chuo Hiang
    Toyoura, Masahiro
    Leow, Chee Siang
    Nishizaki, Hiromitsu
    IEEE ACCESS, 2024, 12 : 110705 - 110712
  • [45] Context Driven Optimized Perceptual Video Summarization and Retrieval
    Thomas, Sinnu Susan
    Gupta, Sumana
    Subramanian, Venkatesh K.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (10) : 3132 - 3145
  • [46] Label Distribution Learning for Video Summarization
    Liu Y.
    Tang S.
    Gao Y.
    Li Z.
    Li H.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (01): : 104 - 110
  • [47] Video summarization by learning semantic information
    Hua R.
    Wu X.
    Zhao W.
    Beijing Hangkong Hangtian Daxue Xuebao/Journal of Beijing University of Aeronautics and Astronautics, 2021, 47 (03): : 650 - 657
  • [48] Progressive reinforcement learning for video summarization
    Wang, Guolong
    Wu, Xun
    Yan, Junchi
    INFORMATION SCIENCES, 2024, 655
  • [49] Attentive and Adversarial Learning for Video Summarization
    Fu, Tsu-Jui
    Tai, Shao-Heng
    Chen, Hwann-Tzong
    2019 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2019, : 1579 - 1587
  • [50] Video Summarization Using Deep Semantic Features
    Otani, Mayu
    Nakashima, Yuta
    Rahtu, Esa
    Heikkila, Janne
    Yokoya, Naokazu
    COMPUTER VISION - ACCV 2016, PT V, 2017, 10115 : 361 - 377