Video Description Using Bidirectional Recurrent Neural Networks

被引:18
|
作者
Peris, Alvaro [1 ]
Bolanos, Marc [2 ,3 ]
Radeva, Petia [2 ,3 ]
Casacuberta, Francisco [1 ]
机构
[1] Univ Politecn Valencia, PRHLT Res Ctr, Valencia, Spain
[2] Univ Barcelona, Barcelona, Spain
[3] Comp Vision Ctr, Bellaterra, Spain
关键词
Video description; Neural Machine Translation; Birectional Recurrent Neural Networks; LSTM; Convolutional Neural Networks;
D O I
10.1007/978-3-319-44781-0_1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although traditionally used in the machine translation field, the encoder-decoder framework has been recently applied for the generation of video and image descriptions. The combination of Convolutional and Recurrent Neural Networks in these models has proven to outperform the previous state of the art, obtaining more accurate video descriptions. In this work we propose pushing further this model by introducing two contributions into the encoding stage. First, producing richer image representations by combining object and location information from Convolutional Neural Networks and second, introducing Bidirectional Recurrent Neural Networks for capturing both forward and backward temporal relationships in the input frames.
引用
收藏
页码:3 / 11
页数:9
相关论文
共 50 条
  • [1] Bidirectional recurrent neural networks
    Schuster, M
    Paliwal, KK
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (11) : 2673 - 2681
  • [2] Attention-Based Bidirectional Recurrent Neural Networks for Description Generation of Videos
    Du, Xiaotong
    Yuan, Jiabin
    Liu, Hu
    CLOUD COMPUTING AND SECURITY, PT VI, 2018, 11068 : 440 - 451
  • [3] Localisation in Wireless Networks using Deep Bidirectional Recurrent Neural Networks
    Lynch, David
    Ho, Lester
    MacDonald, Michael
    O'Neill, Michael
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] Cyclist Trajectory Prediction Using Bidirectional Recurrent Neural Networks
    Saleh, Khaled
    Hossny, Mohammed
    Nahavandi, Saeid
    AI 2018: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, 11320 : 284 - 295
  • [5] Metaphor Detection using Ensembles of Bidirectional Recurrent Neural Networks
    Brooks, Jennifer
    Youssef, Abdou
    FIGURATIVE LANGUAGE PROCESSING, 2020, : 244 - 249
  • [6] Deepfake Video Detection Using Recurrent Neural Networks
    Guera, David
    Delp, Edward J.
    2018 15TH IEEE INTERNATIONAL CONFERENCE ON ADVANCED VIDEO AND SIGNAL BASED SURVEILLANCE (AVSS), 2018, : 127 - 132
  • [7] CONFIDENCE ESTIMATION AND DELETION PREDICTION USING BIDIRECTIONAL RECURRENT NEURAL NETWORKS
    Ragni, A.
    Li, Q.
    Gales, M. J. F.
    Wang, Y.
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 204 - 211
  • [8] Bidirectional Recurrent Neural Networks as Generative Models
    Berglund, Mathias
    Raiko, Tapani
    Honkala, Mikko
    Karkkainen, Leo
    Vetek, Akos
    Karhunen, Juha
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [9] Bidirectional Molecule Generation with Recurrent Neural Networks
    Grisoni, Francesca
    Moret, Michael
    Lingwood, Robin
    Schneider, Gisbert
    JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2020, 60 (03) : 1175 - 1183
  • [10] Video Genre Classification using Convolutional Recurrent Neural Networks
    Lakshmi, K. Prasanna
    Solanki, Mihir
    Dara, Jyothi Swaroop
    Kompalli, Avinash Bhargav
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (03) : 170 - 176