Temporal capsule networks for video motion estimation and error concealment

被引:8
作者
Sankisa, Arun [1 ]
Punjabi, Arjun [1 ]
Katsaggelos, Aggelos K. [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
关键词
Capsule networks; Conv3D; ConvLSTM; Error concealment; Motion estimation;
D O I
10.1007/s11760-020-01671-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a temporal capsule network architecture to encode motion in videos as an instantiation parameter. The extracted motion is used to perform motion-compensated error concealment. We modify the original architecture and use a carefully curated dataset to enable the training of capsules spatially and temporally. First, we add the temporal dimension by taking co-located "patches" from three consecutive frames obtained from standard video sequences to form input data "cubes." Second, the network is designed with an initial feature extraction layer that operates on all three dimensions to generate spatiotemporal features. Additionally, we implement the PrimaryCaps module with a recurrent layer, instead of a conventional convolutional layer, to extract short-term motion-related temporal dependencies and encode them as activation vectors in the capsule output. Finally, the capsule output is combined with the most-recent past frame and passed through a fully connected reconstruction network to perform motion-compensated error concealment. We study the effectiveness of temporal capsules by comparing the proposed model with architectures that do not include capsules. Although the quality of the reconstruction shows room for improvement, we successfully demonstrate that capsules-based architectures can be designed to operate in the temporal dimension to encode motion-related attributes as instantiation parameters. The accuracy of motion estimation is evaluated by comparing both the reconstructed frame outputs and the corresponding optical flow estimates with ground truth data.
引用
收藏
页码:1369 / 1377
页数:9
相关论文
共 50 条
  • [21] Temporal Error Concealment Technique for MPEG-4 Video Streams
    丁学文
    杨兆选
    郭迎春
    [J]. Transactions of Tianjin University, 2006, (04) : 291 - 296
  • [22] Combined and iterative form of spatial and temporal error concealment for video signals
    Tang, Li
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2006, 52 (03) : 356 - 361
  • [23] Enhanced Temporal Error Concealment for 1Seg Video Broadcasting
    Wang, Jun
    Tang, Yichun
    Goto, Satoshi
    [J]. ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 45 - 54
  • [24] Temporal shape error concealment by global motion compensation with local refinement
    Soares, Luis Ducla
    Pereira, Fernando
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (06) : 1331 - 1348
  • [25] A novel motion recovery using temporal and spatial correlation for a fast temporal error concealment over H.264 video sequences
    Cholman Nam
    Changgon Chu
    Taeguk Kim
    Sokmin Han
    [J]. Multimedia Tools and Applications, 2020, 79 : 1221 - 1240
  • [26] A novel motion recovery using temporal and spatial correlation for a fast temporal error concealment over H.264 video sequences
    Nam, Cholman
    Chu, Changgon
    Kim, Taeguk
    Han, Sokmin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (1-2) : 1221 - 1240
  • [27] Multiview video error concealment with improved pixel estimation and illumination compensation
    Lin, Ting-Lan
    Chang, Tsung-En
    Huang, Guei-Shiang
    Chou, Chi-Chan
    [J]. 2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 2013, : 157 - 162
  • [28] End to End Video Distortion Estimation with Advanced Error Concealment Considerations
    Cheng, Qin
    Agrafiotis, Dimitris
    [J]. 2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 303 - 306
  • [29] Influence of Block Size on Motion Vector Estimation Error in Enhancement of Video Temporal Resolution
    Vranjes, Denis
    Rimac-Drlje, Snjezana
    Vranjes, Mario
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON SMART SYSTEMS AND TECHNOLOGIES (SST), 2017, : 263 - 267
  • [30] GENERATIVE ADVERSARIAL NETWORKS BASED ERROR CONCEALMENT FOR LOW RESOLUTION VIDEO
    Xiang, Chongyang
    Xu, Jiajun
    Yan, Chuan
    Peng, Qiang
    Wu, Xiao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1827 - 1831