Modality correlation-based video summarization

被引:7
|
作者
Wang, Xingrun [1 ]
Nie, Xiushan [2 ]
Liu, Xingbo [1 ]
Wang, Binze [3 ]
Yin, Yilong [4 ]
机构
[1] Shandong Univ, Sch Comp Sci & Technol, Jinan 250101, Shandong, Peoples R China
[2] Shandong Jianzhu Univ, Sch Comp Sci & Technol, Jinan 250101, Shandong, Peoples R China
[3] Changan Univ, Coll Geol Engn & Geomat, Xian 710054, Peoples R China
[4] Shandong Univ, Sch Software Engn, Jinan 250101, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Video summarization; Modality correlation; Modality-specific information; Attention mechanism;
D O I
10.1007/s11042-020-08690-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video summarization is an important technique to help us browse, store, and retrieve a rapidly increasing amount of video data, which extracts frames or shots from the original video. Text information covers important content of a video, and thus a summarization can be generated by exploring the correlation between the frame and text. In this study, we propose a video summarization method based on the modality correlation. With this method, we first learn the correlation between the text and frame in the respective space, and then fuse two correlations to obtain the importance score of each shot. Finally, video shots that have a high importance score are chosen as the video summarization. Compared to previous methods that seldom apply text to generate the video summarization, or only use the latent common information between text and frame, the proposed method fully utilizes not only the latent common but also modality-specific information for a video summarization. Experiments were conducted on the TVSum50 dataset, and the results verify the effectiveness of our proposed approach.
引用
收藏
页码:33875 / 33890
页数:16
相关论文
共 50 条
  • [31] Thermal roots of correlation-based complexity
    Fraundorf, Philip
    COMPLEXITY, 2008, 13 (03) : 18 - 26
  • [32] Penalized regression with correlation-based penalty
    Gerhard Tutz
    Jan Ulbricht
    Statistics and Computing, 2009, 19 : 239 - 253
  • [33] Correlation-based phase noise measurements
    Rubiola, E
    Giordano, V
    REVIEW OF SCIENTIFIC INSTRUMENTS, 2000, 71 (08): : 3085 - 3091
  • [34] Smart Surveillance Based on Video Summarization
    Thomas, Sinnu Susan
    Gupta, Sumana
    Subramanian, Venkatesh K.
    2017 IEEE REGION 10 INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR SMART CITIES (IEEE TENSYMP 2017), 2017,
  • [35] Gesture-based video summarization
    Kosmopoulos, D
    Doulamis, A
    Doulamis, N
    2005 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), VOLS 1-5, 2005, : 3213 - 3216
  • [36] Human Based Surveillance Video Summarization
    Aydemir, M. Said
    Karsligil, M. Elif
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [37] Video Summarization Based on Multimodal Features
    Zhang, Yu
    Liu, Ju
    Liu, Xiaoxi
    Gao, Xuesong
    INTERNATIONAL JOURNAL OF MULTIMEDIA DATA ENGINEERING & MANAGEMENT, 2020, 11 (04): : 60 - 76
  • [38] VIDEO SUMMARIZATION BASED ON LOCAL FEATURES
    Massaoudi, Mohamed
    Bahroun, Sahbi
    Zagrouba, Ezzeddine
    25. INTERNATIONAL CONFERENCE IN CENTRAL EUROPE ON COMPUTER GRAPHICS, VISUALIZATION AND COMPUTER VISION (WSCG 2017), 2017, 2701 : 13 - 17
  • [39] Video summarization based on semantic representation
    Carlos, RP
    Uehara, K
    ADVANCED MULTIMEDIA CONTENT PROCESSING, 1999, 1554 : 1 - 16
  • [40] Video Summarization Based on Optical Flow
    Jadhav, Dipti
    Bhosle, Udhav
    ADVANCED COMPUTING AND INTELLIGENT ENGINEERING, 2020, 1082 : 333 - 342