Temporal capsule networks for video motion estimation and error concealment

被引:8
作者
Sankisa, Arun [1 ]
Punjabi, Arjun [1 ]
Katsaggelos, Aggelos K. [1 ]
机构
[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA
关键词
Capsule networks; Conv3D; ConvLSTM; Error concealment; Motion estimation;
D O I
10.1007/s11760-020-01671-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a temporal capsule network architecture to encode motion in videos as an instantiation parameter. The extracted motion is used to perform motion-compensated error concealment. We modify the original architecture and use a carefully curated dataset to enable the training of capsules spatially and temporally. First, we add the temporal dimension by taking co-located "patches" from three consecutive frames obtained from standard video sequences to form input data "cubes." Second, the network is designed with an initial feature extraction layer that operates on all three dimensions to generate spatiotemporal features. Additionally, we implement the PrimaryCaps module with a recurrent layer, instead of a conventional convolutional layer, to extract short-term motion-related temporal dependencies and encode them as activation vectors in the capsule output. Finally, the capsule output is combined with the most-recent past frame and passed through a fully connected reconstruction network to perform motion-compensated error concealment. We study the effectiveness of temporal capsules by comparing the proposed model with architectures that do not include capsules. Although the quality of the reconstruction shows room for improvement, we successfully demonstrate that capsules-based architectures can be designed to operate in the temporal dimension to encode motion-related attributes as instantiation parameters. The accuracy of motion estimation is evaluated by comparing both the reconstructed frame outputs and the corresponding optical flow estimates with ground truth data.
引用
收藏
页码:1369 / 1377
页数:9
相关论文
共 50 条
[31]   Error concealment for scalable motion-compensated subband/wavelet video coders [J].
Bajic, Ivan V. ;
Woods, John W. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (04) :508-514
[32]   A joint motion-image inpainting method for error concealment in video coding [J].
Chen, L. Y. ;
Chan, S. C. ;
Shum, H. Y. .
2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, :2241-+
[33]   Error protection and concealment of motion vectors in MCTF-based video coding [J].
Stoufs, M ;
Barbarien, J ;
Verdicchio, F ;
Munteanu, A ;
Cornelis, J ;
Schelkens, P .
WAVELET APPLICATIONS IN INDUSTRIAL PROCESSING II, 2004, 5607 :71-80
[34]   An edge-based temporal error concealment for MPEG-coded video [J].
Huang, YL ;
Lien, HY .
VISUAL COMMUNICATIONS AND IMAGE PROCESSING 2005, PTS 1-4, 2005, 5960 :992-1000
[35]   Hybrid Spatio-Temporal Error Concealment technique for Image/Video transmission [J].
Patel, Dheeraj ;
Patel, Jigisha .
PROCEEDINGS ON 2014 2ND INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGY TRENDS IN ELECTRONICS, COMMUNICATION AND NETWORKING (ET2ECN), 2014,
[36]   Improved fading scheme for spatio-temporal error concealment in video transmission [J].
Hwang, Min-Cheol ;
Kim, Jun-Hyung ;
Park, Chun-Su ;
Ko, Sung-Jea .
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2008, E91A (03) :740-748
[37]   Fuzzy logic based temporal error concealment for H.264 video [J].
Lee, Pei-Jun ;
Lin, Ming-Long .
ETRI JOURNAL, 2006, 28 (05) :574-582
[38]   An efficient spatio-temporal boundary matching algorithm for video error concealment [J].
Xiang, Youjun ;
Feng, Liangmou ;
Xie, Shengli ;
Zhou, Zhiheng .
MULTIMEDIA TOOLS AND APPLICATIONS, 2011, 52 (01) :91-103
[39]   An efficient spatio-temporal boundary matching algorithm for video error concealment [J].
Youjun Xiang ;
Liangmou Feng ;
Shengli Xie ;
Zhiheng Zhou .
Multimedia Tools and Applications, 2011, 52 :91-103
[40]   A Temporal-domain Error Concealment Algorithm Effective In Motion Boundary Improvement [J].
Zhao, De-fang ;
Gong, Sheng-rong ;
Zhang, Shu-kui .
2009 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND TECHNOLOGY, VOL II, PROCEEDINGS, 2009, :252-255