Temporal capsule networks for video motion estimation and error concealment

被引：8

作者：

Sankisa, Arun ^{[1
]}

Punjabi, Arjun ^{[1
]}

Katsaggelos, Aggelos K. ^{[1
]}

机构：

[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA

来源：

SIGNAL IMAGE AND VIDEO PROCESSING | 2020年 / 14卷 / 07期

关键词：

Capsule networks; Conv3D; ConvLSTM; Error concealment; Motion estimation;

D O I：

10.1007/s11760-020-01671-x

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we present a temporal capsule network architecture to encode motion in videos as an instantiation parameter. The extracted motion is used to perform motion-compensated error concealment. We modify the original architecture and use a carefully curated dataset to enable the training of capsules spatially and temporally. First, we add the temporal dimension by taking co-located "patches" from three consecutive frames obtained from standard video sequences to form input data "cubes." Second, the network is designed with an initial feature extraction layer that operates on all three dimensions to generate spatiotemporal features. Additionally, we implement the PrimaryCaps module with a recurrent layer, instead of a conventional convolutional layer, to extract short-term motion-related temporal dependencies and encode them as activation vectors in the capsule output. Finally, the capsule output is combined with the most-recent past frame and passed through a fully connected reconstruction network to perform motion-compensated error concealment. We study the effectiveness of temporal capsules by comparing the proposed model with architectures that do not include capsules. Although the quality of the reconstruction shows room for improvement, we successfully demonstrate that capsules-based architectures can be designed to operate in the temporal dimension to encode motion-related attributes as instantiation parameters. The accuracy of motion estimation is evaluated by comparing both the reconstructed frame outputs and the corresponding optical flow estimates with ground truth data.

引用

页码：1369 / 1377

页数：9

共 50 条

[21] Temporal Error Concealment Technique for MPEG-4 Video Streams
丁学文
杨兆选
郭迎春
[J]. Transactions of Tianjin University, 2006, (04) : 291 - 296
[22] Combined and iterative form of spatial and temporal error concealment for video signals
Tang, Li
[J]. IEEE TRANSACTIONS ON BROADCASTING, 2006, 52 (03) : 356 - 361
[23] Enhanced Temporal Error Concealment for 1Seg Video Broadcasting
Wang, Jun
Tang, Yichun
Goto, Satoshi
[J]. ADVANCES IN MULTIMEDIA MODELING, PROCEEDINGS, 2010, 5916 : 45 - 54
[24] Temporal shape error concealment by global motion compensation with local refinement
Soares, Luis Ducla
Pereira, Fernando
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (06) : 1331 - 1348
[25] A novel motion recovery using temporal and spatial correlation for a fast temporal error concealment over H.264 video sequences
Cholman Nam
Changgon Chu
Taeguk Kim
Sokmin Han
[J]. Multimedia Tools and Applications, 2020, 79 : 1221 - 1240
[26] A novel motion recovery using temporal and spatial correlation for a fast temporal error concealment over H.264 video sequences
Nam, Cholman
Chu, Changgon
Kim, Taeguk
Han, Sokmin
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (1-2) : 1221 - 1240
[27] Multiview video error concealment with improved pixel estimation and illumination compensation
Lin, Ting-Lan
Chang, Tsung-En
Huang, Guei-Shiang
Chou, Chi-Chan
[J]. 2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 2013, : 157 - 162
[28] End to End Video Distortion Estimation with Advanced Error Concealment Considerations
Cheng, Qin
Agrafiotis, Dimitris
[J]. 2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE, 2014, : 303 - 306
[29] Influence of Block Size on Motion Vector Estimation Error in Enhancement of Video Temporal Resolution
Vranjes, Denis
Rimac-Drlje, Snjezana
Vranjes, Mario
[J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON SMART SYSTEMS AND TECHNOLOGIES (SST), 2017, : 263 - 267
[30] GENERATIVE ADVERSARIAL NETWORKS BASED ERROR CONCEALMENT FOR LOW RESOLUTION VIDEO
Xiang, Chongyang
Xu, Jiajun
Yan, Chuan
Peng, Qiang
Wu, Xiao
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 1827 - 1831

← 1 2 3 4 5 →