Reconstruction Algorithm for Lost Frame of Multiview Videos in Wireless Multimedia Sensor Network Based on Deep Learning Multilayer Perceptron Regression

被引:25
作者
Lin, Ting-Lan [1 ]
Tseng, Hua-Wei [2 ]
Wen, Yangming [3 ]
Lai, Fu-Wei [2 ]
Lin, Ching-Hsuan [2 ]
Wang, Chuan-Jia [2 ]
机构
[1] Natl Taipei Univ Technol, Dept Elect Engn, Taipei 10608, Taiwan
[2] Chung Yuan Christian Univ, Dept Elect Engn, Taoyuan 32023, Taiwan
[3] Univ Calif Davis, Dept Comp Sci, Davis, CA 95616 USA
关键词
Wireless multimedia sensor network (WMSN); multiview video system; frame loss recovery; error concealment; multilayer perceptron regression (MPR); deep learning; inpainting; optical flow; ERROR CONCEALMENT; NEURAL-NETWORKS;
D O I
10.1109/JSEN.2018.2865916
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Wireless multimedia sensor network (WMSN) is important for environmental monitoring. When the sensors are used as cameras, the network can be regarded as a multiview video system. The Packet loss may occur when the multiview videos are transmitted wirelessly. When the video frames are last during transmission, a frame reconstruction method is needed in the decoder to estimate the missing pixels. In the proposed work, a reconstruction algorithm for lost frame of multiview videos in the WMSN based on deep learning methods is presented. A novel pixel estimation algorithm is developed using multilayer perceptron regression (MPR) with the deep learning method. Furthermore, a modified inpainting method is proposed with the use of the information from the optical flow algorithm with the neighboring available frames. Compared with the state-of-the-art method, the proposed MPR method with the traditional inpainting method increased the average peak signalto-noise ratio up to 5.62 dB. The combination of the proposed MPR method with the proposed inpainting method outperformed previous proposed combination up to 832 dB on average, showing the significance of the proposed inpainting method.
引用
收藏
页码:9792 / 9801
页数:10
相关论文
共 27 条
  • [1] [Anonymous], 2009, Ph.D. dissertation
  • [2] Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection
    Belhumeur, PN
    Hespanha, JP
    Kriegman, DJ
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1997, 19 (07) : 711 - 720
  • [3] Learning Deep Architectures for AI
    Bengio, Yoshua
    [J]. FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01): : 1 - 127
  • [4] Frame Loss Concealment for Stereoscopic Video Plus Depth Sequences
    Chung, Tae-Young
    Sull, Sanghoon
    Kim, Chang-Su
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2011, 57 (03) : 1336 - 1344
  • [5] Region filling and object removal by exemplar-based image inpainting
    Criminisi, A
    Pérez, P
    Toyama, K
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2004, 13 (09) : 1200 - 1212
  • [6] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
  • [7] Hinton G. E., 2009, Deep belief networks, V4, P5947, DOI DOI 10.4249/SCHOLARPEDIA.5947
  • [8] Deep Neural Networks for Acoustic Modeling in Speech Recognition
    Hinton, Geoffrey
    Deng, Li
    Yu, Dong
    Dahl, George E.
    Mohamed, Abdel-rahman
    Jaitly, Navdeep
    Senior, Andrew
    Vanhoucke, Vincent
    Patrick Nguyen
    Sainath, Tara N.
    Kingsbury, Brian
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 82 - 97
  • [9] Artificial neural networks vs linear regression in a fluid mechanics and chemical modelling problem: Elimination of hydrogen sulphide in a lab-scale biofilter
    Ibarra-Berastegi, G.
    Elias, A.
    Arias, R.
    Barona, A.
    [J]. 2007 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2007, : 584 - 587
  • [10] Kingma D. P., P 3 INT C LEARN REPR