Video Compressed Sensing Using a Convolutional Neural Network

被引:41
作者
Shi, Wuzhen [1 ,2 ]
Liu, Shaohui [1 ,2 ]
Jiang, Feng [1 ,2 ]
Zhao, Debin [1 ,2 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Peoples R China
[2] Peng Cheng Lab, Shenzhen 518055, Peoples R China
基金
美国国家科学基金会;
关键词
Image reconstruction; Correlation; Compressed sensing; Convolutional neural networks; Computer architecture; Video sequences; Machine learning; video compressed sensing; video reconstruction; multilevel feature compensation; convolutional neural network;
D O I
10.1109/TCSVT.2020.2978703
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Recently, a few image compressed sensing (CS) methods based on deep learning have been developed, which achieve remarkable reconstruction quality with low computational complexity. However, these existing deep learning-based image CS methods focus on exploring intraframe correlation while ignoring interframe cues, resulting in inefficiency when directly applied to video CS. In this paper, we propose a novel video CS framework based on a convolutional neural network (dubbed VCSNet) to explore both intraframe and interframe correlations. Specifically, VCSNet divides the video sequence into multiple groups of pictures (GOPs), of which the first frame is a keyframe that is sampled at a higher sampling ratio than the other nonkeyframes. In a GOP, the block-based framewise sampling by a convolution layer is proposed, which leads to the sampling matrix being automatically optimized. In the reconstruction process, the framewise initial reconstruction by using a linear convolutional neural network is first presented, which effectively utilizes the intraframe correlation. Then, the deep reconstruction with multilevel feature compensation is proposed, which compensates the nonkeyframes with the keyframe in a multilevel feature compensation manner. Such multilevel feature compensation allows the network to better explore both intraframe and interframe correlations. Extensive experiments on six benchmark videos show that VCSNet provides better performance over state-of-the-art video CS methods and deep learning-based image CS methods in both objective and subjective reconstruction quality.
引用
收藏
页码:425 / 438
页数:14
相关论文
共 38 条
[21]   Compressed-Sensing-Enabled Video Streaming for Wireless Multimedia Sensor Networks [J].
Pudlewski, Scott ;
Melodia, Tommaso ;
Prasanna, Arvind .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2012, 11 (06) :1060-1072
[22]  
Reddy D, 2011, PROC CVPR IEEE, P329, DOI 10.1109/CVPR.2011.5995542
[23]   Motion Tuned Spatio-Temporal Quality Assessment of Natural Videos [J].
Seshadrinathan, Kalpana ;
Bovik, Alan Conrad .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2010, 19 (02) :335-350
[24]  
Shi W., 2019, P IEEE CVF C COMP VI, P12290
[25]   Image Compressed Sensing Using Convolutional Neural Network [J].
Shi, Wuzhen ;
Jiang, Feng ;
Liu, Shaohui ;
Zhao, Debin .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :375-388
[26]  
Shi WZ, 2017, IEEE INT CON MULTI, P877, DOI 10.1109/ICME.2017.8019428
[27]   Video Quality Assessment by Reduced Reference Spatio-Temporal Entropic Differencing [J].
Soundararajan, Rajiv ;
Bovik, Alan C. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2013, 23 (04) :684-694
[28]   Video Compressed Sensing with Multihypothesis [J].
Tramel, Eric W. ;
Fowler, James E. .
2011 DATA COMPRESSION CONFERENCE (DCC), 2011, :193-202
[29]   Modified-CS: Modifying Compressive Sensing for Problems With Partially Known Support [J].
Vaswani, Namrata ;
Lu, Wei .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (09) :4595-4607
[30]   LS-CS-Residual (LS-CS): Compressive Sensing on Least Squares Residual [J].
Vaswani, Namrata .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (08) :4108-4120