Reconstruction Algorithm for Lost Frame of Multiview Videos in Wireless Multimedia Sensor Network Based on Deep Learning Multilayer Perceptron Regression

被引：25

作者：

Lin, Ting-Lan ^{[1
]}

Tseng, Hua-Wei ^{[2
]}

Wen, Yangming ^{[3
]}

Lai, Fu-Wei ^{[2
]}

Lin, Ching-Hsuan ^{[2
]}

Wang, Chuan-Jia ^{[2
]}

机构：

[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan

[2] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 32023, Taiwan

[3] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA

来源：

IEEE SENSORS JOURNAL | 2018年 / 18卷 / 23期

关键词：

Wireless multimedia sensor network (WMSN); multiview video system; frame loss recovery; error concealment; multilayer perceptron regression (MPR); deep learning; inpainting; optical flow; ERROR CONCEALMENT; NEURAL-NETWORKS;

D O I：

10.1109/JSEN.2018.2865916

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Wireless multimedia sensor network (WMSN) is important for environmental monitoring. When the sensors are used as cameras, the network can be regarded as a multiview video system. The Packet loss may occur when the multiview videos are transmitted wirelessly. When the video frames are last during transmission, a frame reconstruction method is needed in the decoder to estimate the missing pixels. In the proposed work, a reconstruction algorithm for lost frame of multiview videos in the WMSN based on deep learning methods is presented. A novel pixel estimation algorithm is developed using multilayer perceptron regression (MPR) with the deep learning method. Furthermore, a modified inpainting method is proposed with the use of the information from the optical flow algorithm with the neighboring available frames. Compared with the state-of-the-art method, the proposed MPR method with the traditional inpainting method increased the average peak signalto-noise ratio up to 5.62 dB. The combination of the proposed MPR method with the proposed inpainting method outperformed previous proposed combination up to 832 dB on average, showing the significance of the proposed inpainting method.

引用

页码：9792 / 9801

页数：10

共 27 条

[1] [Anonymous], 2009, Ph.D. dissertation
[2] Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection
Belhumeur, PN
Hespanha, JP
Kriegman, DJ
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) : 711 - 720
[3] Learning Deep Architectures for AI
Bengio, Yoshua
[J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01): : 1 - 127
[4] Frame Loss Concealment for Stereoscopic Video Plus Depth Sequences
Chung, Tae-Young
Sull, Sanghoon
Kim, Chang-Su
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (03) : 1336 - 1344
[5] Region filling and object removal by exemplar-based image inpainting
Criminisi, A
Pérez, P
Toyama, K
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (09) : 1200 - 1212
[6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[7] Hinton G. E., 2009, Deep belief networks, V4, P5947, DOI DOI 10.4249/SCHOLARPEDIA.5947
[8] Deep Neural Networks for Acoustic Modeling in Speech Recognition
Hinton, Geoffrey
Deng, Li
Yu, Dong
Dahl, George E.
Mohamed, Abdel-rahman
Jaitly, Navdeep
Senior, Andrew
Vanhoucke, Vincent
Patrick Nguyen
Sainath, Tara N.
Kingsbury, Brian
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 82 - 97
[9] Artificial neural networks vs linear regression in a fluid mechanics and chemical modelling problem: Elimination of hydrogen sulphide in a lab-scale biofilter
Ibarra-Berastegi, G.
Elias, A.
Arias, R.
Barona, A.
[J]. 2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 584 - 587
[10] Kingma D. P., P 3 INT C LEARN REPR

← 1 2 3 →