Video Summarization with Long Short-Term Memory

被引:412
|
作者
Zhang, Ke [1 ]
Chao, Wei-Lun [1 ]
Sha, Fei [2 ]
Grauman, Kristen [3 ]
机构
[1] Univ Southern Calif, Dept Comp Sci, Los Angeles, CA 90007 USA
[2] Univ Calif Los Angeles, Dept Comp Sci, Los Angeles, CA 90024 USA
[3] Univ Texas Austin, Dept Comp Sci, Austin, TX 78712 USA
来源
COMPUTER VISION - ECCV 2016, PT VII | 2016年 / 9911卷
关键词
Video summarization; Long short-term memory; SPEECH RECOGNITION;
D O I
10.1007/978-3-319-46478-7_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel supervised learning technique for summarizing videos by automatically selecting keyframes or key subshots. Casting the task as a structured prediction problem, our main idea is to use Long Short-Term Memory (LSTM) to model the variable-range temporal dependency among video frames, so as to derive both representative and compact video summaries. The proposed model successfully accounts for the sequential structure crucial to generating meaningful video summaries, leading to state-of-the-art results on two benchmark datasets. In addition to advances in modeling techniques, we introduce a strategy to address the need for a large amount of annotated data for training complex learning approaches to summarization. There, our main idea is to exploit auxiliary annotated video summarization datasets, in spite of their heterogeneity in visual styles and contents. Specifically, we show that domain adaptation techniques can improve learning by reducing the discrepancies in the original datasets' statistical properties.
引用
收藏
页码:766 / 782
页数:17
相关论文
共 50 条
  • [41] Modeling Speaker Variability Using Long Short-Term Memory Networks for Speech Recognition
    Li, Xiangang
    Wu, Xihong
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1086 - 1090
  • [42] Long short-term memory recurrent neural network architectures for Urdu acoustic modeling
    Tehseen Zia
    Usman Zahid
    International Journal of Speech Technology, 2019, 22 : 21 - 30
  • [43] Long short-term memory recurrent neural network architectures for Urdu acoustic modeling
    Zia, Tehseen
    Zahid, Usman
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 21 - 30
  • [44] Short-term forecasting electricity load by long short-term memory and reinforcement learning for optimization of hyper-parameters
    Nguyen, Ngoc Anh
    Dang, Tien Dat
    Verdu, Elena
    Solanki, Vijender Kumar
    EVOLUTIONARY INTELLIGENCE, 2023, 16 (05) : 1729 - 1746
  • [45] Short-term power prediction of photovoltaic power station based on long short-term memory-back-propagation
    Hua, Chi
    Zhu, Erxi
    Kuang, Liang
    Pi, Dechang
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2019, 15 (10)
  • [46] Short-term forecasting electricity load by long short-term memory and reinforcement learning for optimization of hyper-parameters
    Ngoc Anh Nguyen
    Tien Dat Dang
    Elena Verdú
    Vijender Kumar Solanki
    Evolutionary Intelligence, 2023, 16 : 1729 - 1746
  • [47] Dynamic Optimization Long Short-Term Memory Model Based on Data Preprocessing for Short-Term Traffic Flow Prediction
    Zhang, Yang
    Xin, Dongrong
    IEEE ACCESS, 2020, 8 : 91510 - 91520
  • [48] A Two-Stage Short-Term Load Forecasting Method Using Long Short-Term Memory and Multilayer Perceptron
    Xie, Yuhong
    Ueda, Yuzuru
    Sugiyama, Masakazu
    ENERGIES, 2021, 14 (18)
  • [49] Short-Term Probabilistic Forecasting Method for Wind Speed Combining Long Short-Term Memory and Gaussian Mixture Model
    He, Xuhui
    Lei, Zhihao
    Jing, Haiquan
    Zhong, Rendong
    ATMOSPHERE, 2023, 14 (04)
  • [50] An Enhancement Method Based on Long Short-Term Memory Neural Network for Short-Term Natural Gas Consumption Forecasting
    Liu, Jinyuan
    Wang, Shouxi
    Wei, Nan
    Yang, Yi
    Lv, Yihao
    Wang, Xu
    Zeng, Fanhua
    ENERGIES, 2023, 16 (03)